ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-05-20 16:26:42 +08:00

Author	SHA1	Message	Date
bitloi	853021ff2a	feat: support multiple canvas_types for agent templates and remove duplicate files (#14030 ) ### What problem does this PR solve? Closes #13907 The template catalog had duplicate files (e.g. `*_r.json`) only to place the same template into multiple sidebar groups. This increases maintenance cost and makes template updates error-prone. This PR adds first-class support for multiple template categories in a single file via `canvas_types`, then removes duplicate template files. What changed: - Added `canvas_types` to `CanvasTemplate` model and DB migration. - Added normalization logic when loading templates: - accepts legacy `canvas_type` - accepts new `canvas_types` - merges/deduplicates values - preserves backward compatibility by keeping `canvas_type` as first normalized value. - Updated template import flow to load only `.json` files and in stable sorted order. - Updated frontend template filtering to match on `canvas_types` first, with fallback to legacy `canvas_type`. - Consolidated duplicated template pairs into single files and removed: - `deep_search_r.json` - `reflective_academic_paper_generator_r.json` - `seo_article_writer_r.json` - Added regression/edge-case tests for category normalization and route serialization expectations. ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2026-04-13 20:26:30 +08:00
eason	aa92abe73c	fix: close file handles properly in json.load() calls (#13997 ) ## Summary Fixes #13996 Replace `json.load(open(...))` with `with open(...) as f: json.load(f)` in two files to ensure file descriptors are properly closed. Affected files: - `common/doc_store/infinity_conn_base.py` — schema loading for Infinity doc store - `api/db/init_data.py` — agent template loading at startup ## Why this matters In a long-running server process like RAGFlow, leaked file descriptors from `json.load(open(...))` can accumulate over time. While CPython's refcounting usually cleans these up, it's not guaranteed (especially under memory pressure or with alternative Python runtimes), and can lead to `OSError: [Errno 24] Too many open files`. ## Test plan - [ ] Verify Infinity doc store schema loading still works correctly - [ ] Verify agent templates load correctly on startup <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Summary by CodeRabbit * Refactor * Improved file handling in internal data processing to ensure proper resource cleanup. <!-- end of auto-generated comment: release notes by coderabbit.ai --> Co-authored-by: easonysliu <easonysliu@tencent.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 12:16:49 +08:00
Lynn	0214257886	Fix: init func (#13430 ) ### What problem does this PR solve? Fix update_cnt add error in init_data. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-06 11:42:31 +08:00
Lynn	62cb292635	Feat/tenant model (#13072 ) ### What problem does this PR solve? Add id for table tenant_llm and apply in LLMBundle. ### Type of change - [x] Refactoring --------- Co-authored-by: Yingfeng <yingfeng.zhang@gmail.com> Co-authored-by: Liu An <asiro@qq.com>	2026-03-05 17:27:17 +08:00
as-ondewo	194e076e26	Fix: init superuser can create duplicate users (#13221 ) ### What problem does this PR solve? This PR fixes 2 bugs related to RAGFlow's init superuser functionality. #### Bug 1 When the RAGFlow server was started with the `--init-superuser` option it would always create a new admin user even if it already exists resulting in duplicate users. To fix this, I added an additional check before create the superuser and added the unique constraint to the email column of the database, to mitigate potential TOCTOU race conditions. Since existing databases could contain duplicate emails I added email de-duplication to the database migration. #### Bug 2 When the RAGFlow server was started with the `--init-superuser` option but without configured default LLM and embedding models it would fail to start because the `init_superuser` function would always make test request to the models even if they were not set. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-27 19:55:51 +08:00
as-ondewo	91d1a81937	fix: error during admin tenant creation when using Postgres (#13164 ) ### What problem does this PR solve? This fixes the bug described in #13130. When starting RAGFlow with Postgres the admin tenant create failed because the rerank model was not set. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-24 10:57:31 +08:00
Lynn	f3923452df	Fix: add tokenized content (#12793 ) ### What problem does this PR solve? Add tokenized content es field to query zh message. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-01-23 16:56:03 +08:00
Jin Hai	ac9113b0ef	feature: add system setting service (#12408 ) ### What problem does this PR solve? #12409 ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-01-04 14:21:39 +08:00
Lynn	6e9691a419	Feat: message manage (#12196 ) ### What problem does this PR solve? Manage message and use in agent. Issue #4213 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-25 21:18:13 +08:00
Kevin Hu	ea4a5cd665	Fix: tokenizer issue. (#11902 ) #11786 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-11 17:38:17 +08:00
buua436	3cb72377d7	Refa:remove sensitive information (#11873 ) ### What problem does this PR solve? change: remove sensitive information ### Type of change - [x] Refactoring	2025-12-10 19:08:45 +08:00
zhipeng	d5f8548200	Allow create super user when start rag server. (#10634 ) ### What problem does this PR solve? New options for rag server scripts to create the super admin user when start server. ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhichang Yu <yuzhichang@gmail.com> Co-authored-by: Jin Hai <haijin.chn@gmail.com>	2025-11-24 19:02:08 +08:00
Kevin Hu	e9de25c973	Docs: update `latest updates`. (#11188 ) ### Type of change - [x] Documentation Update	2025-11-12 10:38:33 +08:00
Kevin Hu	c30ffb5716	Fix: ollama model list issue. (#11175 ) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-11-11 19:46:41 +08:00
Jin Hai	f98b24c9bf	Move api.settings to common.settings (#11036 ) ### What problem does this PR solve? As title ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-11-06 09:36:38 +08:00
Jin Hai	1a9215bc6f	Move some vars to globals (#11017 ) ### What problem does this PR solve? As title. ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-11-05 14:14:38 +08:00
Jin Hai	bab3fce136	Move some constants to common (#11004 ) ### What problem does this PR solve? As title. ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-11-05 08:01:39 +08:00
Jin Hai	44f2d6f5da	Move 'get_project_base_directory' to common directory (#10940 ) ### What problem does this PR solve? As title ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-11-02 21:05:28 +08:00
Jin Hai	62b7c655c5	Refactor: migrate the function to specific file (#10201 ) ### What problem does this PR solve? Move base64 related function to api/common/base64.py ### Type of change - [x] Refactoring --------- Signed-off-by: jinhai <haijin.chn@gmail.com> Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-09-25 23:37:50 +08:00
He Wang	7ccca2143c	perf: add get_all_kb_doc_count func to simplify kb.doc_num updating (#10169 ) ### What problem does this PR solve? Add get_all_kb_doc_count func to simplify kb.doc_num updating. ### Type of change - [x] Performance Improvement	2025-09-19 19:11:50 +08:00
Kevin Hu	5e8cd693a5	Refa: split services about llm. (#9450 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2025-08-13 16:41:01 +08:00
Yongteng Lei	421657f64b	Feat: allows setting multiple types of default models in service config (#9404 ) ### What problem does this PR solve? Allows set multiple types of default models in service config. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-08-13 09:46:05 +08:00
Kevin Hu	d9fe279dde	Feat: Redesign and refactor agent module (#9113 ) ### What problem does this PR solve? #9082 #6365 <u> WARNING: it's not compatible with the older version of `Agent` module, which means that `Agent` from older versions can not work anymore.</u> ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-07-30 19:41:09 +08:00
Jin Hai	4a2ff633e0	Fix typo in code (#8327 ) ### What problem does this PR solve? Fix typo in code ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-06-18 09:41:09 +08:00
Kevin Hu	9849230a04	Fix: remove deprecated novitaAI. (#7511 ) ### What problem does this PR solve? #7484 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-05-07 19:36:16 +08:00
Yongteng Lei	98670c3755	Fix: KB update_time changed whenever system relaunched (#6959 ) ### What problem does this PR solve? Fix KB update_time changed whenever system relaunched. #6953 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-04-11 20:10:49 +08:00
Kevin Hu	7463241896	Fix: empty doc id validation. (#6064 ) ### What problem does this PR solve? #6031 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-03-14 11:45:44 +08:00
utopia2077	2d4a60cae6	Fix: Reduce excessive IO operations by loading LLM factory configurations (#6047 ) …ions ### What problem does this PR solve? This PR fixes an issue where the application was repeatedly reading the llm_factories.json file from disk in multiple places, which could lead to "Too many open files" errors under high load conditions. The fix centralizes the file reading operation in the settings.py module and stores the data in a global variable that can be accessed by other modules. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [x] Performance Improvement - [ ] Other (please describe):	2025-03-14 09:54:38 +08:00
donblack01	b1a46d5adc	Fix:when start with source code not in docker env report 'UnicodeDec… (#5802 ) ### What problem does this PR solve? fix:when start with source code not in docker env report "UnicodeDecodeError: 'gbk' codec can't decode byte 0xad in position 5: illegal multibyte sequence" in windows ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: tangyu <1@1.com>	2025-03-10 11:22:06 +08:00
Kevin Hu	dd0ebbea35	Light GraphRAG (#4585 ) ### What problem does this PR solve? #4543 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-01-22 19:43:14 +08:00
Kevin Hu	c5da3cdd97	Tagging (#4426 ) ### What problem does this PR solve? #4367 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-01-09 17:07:21 +08:00
Zhichang Yu	0d68a6cd1b	Fix errors detected by Ruff (#3918 ) ### What problem does this PR solve? Fix errors detected by Ruff ### Type of change - [x] Refactoring	2024-12-08 14:21:12 +08:00
Jin Hai	1e90a1bf36	Move settings initialization after module init phase (#3438 ) ### What problem does this PR solve? 1. Module init won't connect database any more. 2. Config in settings need to be used with settings.CONFIG_NAME ### Type of change - [x] Refactoring Signed-off-by: jinhai <haijin.chn@gmail.com>	2024-11-15 17:30:56 +08:00
Zhichang Yu	30f6421760	Use consistent log file names, introduced initLogger (#3403 ) ### What problem does this PR solve? Use consistent log file names, introduced initLogger ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [x] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2024-11-14 17:13:48 +08:00
Kevin Hu	e44e3a67b0	adapt to lower case cohere (#3392 ) ### What problem does this PR solve? #3384 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-11-14 10:18:25 +08:00
Zhichang Yu	a2a5631da4	Rework logging (#3358 ) Unified all log files into one. ### What problem does this PR solve? Unified all log files into one. ### Type of change - [x] Refactoring	2024-11-12 17:35:13 +08:00
Kevin Hu	5e7c1fb23a	reduce rerank batch size (#2801 ) ### What problem does this PR solve? ### Type of change - [x] Performance Improvement	2024-10-11 11:29:19 +08:00
Kevin Hu	e6da0c7c7b	deprecate init a super user (#2589 ) ### What problem does this PR solve? #2295 ### Type of change - [x] Refactoring	2024-09-25 18:30:27 +08:00
Dada Hsueh	2484e26cb5	fix `superuser` password not base64 encoded (#2475 ) ### What problem does this PR solve? Fixes the _superuser_ `admin@ragflow.io` not being accessible due to how entered passwords are used. Unless this is expected behavior? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-09-18 14:30:45 +08:00
Jin Hai	6b3a40be5c	Format file format from Windows/dos to Unix (#1949 ) ### What problem does this PR solve? Related source file is in Windows/DOS format, they are format to Unix format. ### Type of change - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2024-08-15 09:17:36 +08:00
Kevin Hu	54fc6dcf01	refine llm init (#1938 ) ### What problem does this PR solve? ### Type of change - [x] Performance Improvement	2024-08-14 13:28:55 +08:00
Kevin Hu	a313b77cdd	rm qwen-v1-max (#1894 ) ### What problem does this PR solve? #1748 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-08-09 18:41:44 +08:00
黄腾	ede733e130	add support for eml file parser (#1768 ) ### What problem does this PR solve? add support for eml file parser #1363 ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-08-06 16:42:14 +08:00
Kevin Hu	152072f900	Add graphrag (#1793 ) ### What problem does this PR solve? #1594 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-08-02 18:51:14 +08:00
H	ac7a0d4fbf	Add ParsertType Audio (#1637 ) ### What problem does this PR solve? #1514 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-07-22 19:17:30 +08:00
Kevin Hu	a5306e6345	fix minimax init error (#1537 ) ### What problem does this PR solve? ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-07-16 16:55:31 +08:00
黄腾	75086f41a9	'load llm infomation from a json file and add support for OpenRouter' (#1533 ) ### What problem does this PR solve? #1467 ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Zhedong Cen <cenzhedong2@126.com>	2024-07-16 15:19:43 +08:00
Kevin Hu	607de74ace	fix minimax bug (#1528 ) ### What problem does this PR solve? #1353 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2024-07-16 10:55:33 +08:00
paresh2806	ddeac9ab3d	added SVG for Groq model model providers (#1470 ) #1432 #1447 This PR adds support for the GROQ LLM (Large Language Model). Groq is an AI solutions company delivering ultra-low latency inference with the first-ever LPU™ Inference Engine. The Groq API enables developers to integrate state-of-the-art LLMs, such as Llama-2 and llama3-70b-8192, into low latency applications with the request limits specified below. Learn more at [groq.com](https://groq.com/). Supported Models \| ID \| Requests per Minute \| Requests per Day \| Tokens per Minute \| \|----------------------\|---------------------\|------------------\|-------------------\| \| gemma-7b-it \| 30 \| 14,400 \| 15,000 \| \| gemma2-9b-it \| 30 \| 14,400 \| 15,000 \| \| llama3-70b-8192 \| 30 \| 14,400 \| 6,000 \| \| llama3-8b-8192 \| 30 \| 14,400 \| 30,000 \| \| mixtral-8x7b-32768 \| 30 \| 14,400 \| 5,000 \| --------- Co-authored-by: paresh0628 <paresh.tuvoc@gmail.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2024-07-12 09:25:44 +08:00
zhuhao	009e18f094	feat: support xinference rerank model (#1466 ) ### What problem does this PR solve? support xinference rerank model #1455 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2024-07-11 18:37:41 +08:00

1 2

97 Commits