ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-05-27 11:15:59 +08:00

Author	SHA1	Message	Date
buua436	0501134820	Fix: support tool call config (#14616 ) ### What problem does this PR solve? support tool call config ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-05-07 15:54:57 +08:00
buua436	5b162a0c46	Fix: preserve doc generator download metadata in message (#14626 ) ### What problem does this PR solve? preserve doc generator download metadata ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-05-07 15:48:36 +08:00
buua436	3e396c0a72	Fix: add base64 to doc generator output (#14599 ) ### What problem does this PR solve? add base64 to doc generator output ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-05-06 20:33:08 +08:00
buua436	e6e80041f5	Fix: agent toolcall null response & schema validation & DeepSeek think history (#14425 ) ### What problem does this PR solve? agent toolcall null response & schema validation & DeepSeek think history ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-28 17:09:08 +08:00
buua436	c949096db0	Refactor: optimize agent reset conversation variable defaults (#14401 ) ### What problem does this PR solve? optimize agent reset conversation variable defaults ### Type of change - [x] Refactoring	2026-04-27 19:57:56 +08:00
buua436	82313020c7	Refa: align list operations and strict mode (#14387 ) ### What problem does this PR solve? align list operations and strict mode ### Type of change - [x] Refactoring	2026-04-27 19:13:00 +08:00
buua436	4f6651968a	Fix: prioritize explore session ID and reset default conversation variables (#14399 ) ### What problem does this PR solve? prioritize explore session ID and reset default conversation variables ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-27 18:52:40 +08:00
Jack	290f0294d6	Refactor: migrate artifact API (#14348 ) ### What problem does this PR solve? Before migration: GET /v1/document/artifact/<filename> After migration: GET /api/v1/documents/artifact/<filename> ### Type of change - [x] Refactoring	2026-04-27 15:19:41 +08:00
Xing Hong	fb95136f39	Fix: validate URL scheme and resolved IP before crawling to prevent SSRF (#14090 ) ### What problem does this PR solve? The POST /upload_info?url=<url> endpoint accepted a user-supplied URL and passed it directly to AsyncWebCrawler without any validation. There were no restrictions on URL scheme, destination hostname, or resolved IP address. This allowed any authenticated user to instruct the server to make outbound HTTP requests to internal infrastructure — including RFC 1918 private networks, loopback addresses, and cloud metadata services such as http://169.254.169.254 — effectively using the server as a proxy for internal network reconnaissance or credential theft. This PR adds an SSRF guard (_validate_url_for_crawl) that runs before any crawl is initiated. It enforces an allowlist of safe schemes (http/https), resolves the hostname at validation time, and rejects any URL whose resolved IP falls within a private or reserved network range. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-25 14:30:15 +08:00
Cocoon-Break	620088be2f	fix: check isinstance before len in VariableAssigner _remove_first/_remove_last (#14281 ) fix: check isinstance before len in VariableAssigner _remove_first/_remove_last	2026-04-24 19:09:44 +08:00
Magicbook1108	75a5548b85	Feat: optimize title chunk (#14325 ) ### What problem does this PR solve? Feat: optimize title chunk 1. Add a new button to enable "Use root chunk as H0 heading", so that the first chunk is carried on to all remaining chunks. 2. Update resume agent template ### Type of change - [x] New Feature (non-breaking change which adds functionality) <img width="700" alt="img_v3_02111_63b04951-b3d7-4001-a08b-539db6d5298g" src="https://github.com/user-attachments/assets/4179ac4d-90e7-4353-9b93-d649a455e634" /> <img width="700" alt="image" src="https://github.com/user-attachments/assets/c0ba0f3c-05aa-4f2c-b418-e808ca1a2641" />	2026-04-23 18:55:55 +08:00
Magicbook1108	9c7c105007	Fix: Doc generator (#14223 ) ### What problem does this PR solve? Doc generator ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-20 16:37:33 +08:00
Magicbook1108	d053317c4d	Fix: variable in doc generator (#14180 ) ### What problem does this PR solve? Fix: variable in doc generator ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-20 14:19:42 +08:00
LeonTung	f554f6ae85	chore(docs): tips for installing CN fonts (#14189 ) ### What problem does this PR solve? Add tips for installing Chinse fonts under code sandbox. Otherwise, `matplotlib `won't render Chinese correctly. <img width="2082" height="1186" alt="sales_analysis" src="https://github.com/user-attachments/assets/57e675ab-1e92-4662-9aeb-ad72a6121eb5" /> ### Type of change - [x] Documentation Update	2026-04-20 12:11:23 +08:00
LeonTung	c3bf8d9d60	feat(templates): add a data analysis agent template (#14130 ) ### What problem does this PR solve? Add a new agent template that demonstrates how to leverage the `CodeExec` component to do the data analysis. ### Type of change - [x] Other (please describe): Agent template	2026-04-17 11:32:04 +08:00
writinwaters	0df5d830d4	Refact: Updated agent template descriptions. (#14175 ) ### What problem does this PR solve? Updated ingestion pipeline template descriptions for better technical accuracy and readability. ### Type of change - [x] Refactoring	2026-04-17 10:46:06 +08:00
Magicbook1108	901023a80a	Fix: literal eval http request input (#14145 ) ### What problem does this PR solve? Fix: literal eval http request input ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) <img width="700" alt="img_v3_0210q_f4b49ff7-e670-4054-ab0e-9443a09215fg" src="https://github.com/user-attachments/assets/089300be-06f9-4bb6-97af-61bf5f4a5e8c" /> <img width="700" alt="img_v3_0210q_398cd52a-2ad9-42be-8d5b-4e6e68a7d22g" src="https://github.com/user-attachments/assets/239b43cd-a2a5-49d8-9200-991bb26336c8" />	2026-04-16 16:52:34 +08:00
Yongteng Lei	356ba5650a	Fix: sandbox don't attach attachment metadata (#14135 ) ### What problem does this PR solve? Sandbox don't attach attachment metadata ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-16 12:08:54 +08:00
Magicbook1108	d51789e2be	Feat: update templates && add resume template (#14124 ) ### What problem does this PR solve? Feat: update templates && add resume template ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-04-15 18:42:29 +08:00
Magicbook1108	1376c004a9	Fix: update docs generator (#14070 ) ### What problem does this PR solve? Refactor: update docs generator ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) 1. Support multiple document generator components and correctly display messages in the message component. The document generator will not overwrite other messages. <img width="700" alt="Screenshot from 2026-04-13 13-56-17" src="https://github.com/user-attachments/assets/3f3e06e8-33ce-4df1-8b05-510c86af70a4" /> 2. Support Chinese content and ensure correct Markdown rendering in PDF and DOCX <img width="700" alt="image" src="https://github.com/user-attachments/assets/69bf1f7b-261d-48e5-a9f3-8e94462b90ed" /> 3. Simplify configuration page and support more output format <img height="700" alt="image" src="https://github.com/user-attachments/assets/8647374c-c055-4daa-ad71-cd9052eb138e" /> 4. Hide download from other components except for message <img width="700" alt="image" src="https://github.com/user-attachments/assets/a723dfcb-b60d-4eb5-b2f6-d41ca5955eb4" /> <img width="700" alt="image" src="https://github.com/user-attachments/assets/a8762ac4-807b-4f0b-9287-65f82f7c9c98" /> 5. Sanitize filename <img width="700" alt="image" src="https://github.com/user-attachments/assets/df49509f-37c0-40f9-b03d-bd6ce7fdefa8" /> 6. And more changes on usability	2026-04-14 15:24:43 +08:00
Jin Hai	2b6c50734f	Sync code from EE (#14080 ) ### What problem does this PR solve? As title. ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-04-14 15:03:46 +08:00
Magicbook1108	8723c3aa86	Feat: more templates (#14075 ) ### What problem does this PR solve? Feat: more templates <img width="700" alt="image" src="https://github.com/user-attachments/assets/533e88f1-fc56-4337-a026-6623fc978893" /> ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: writinwaters <93570324+writinwaters@users.noreply.github.com>	2026-04-14 10:00:55 +08:00
天海蒼灆	356d45fda1	Feat: add cell type coercion for Excel export (#13808 ) ### What problem does this PR solve? - Implemented a helper function to convert markdown cell text to native numeric types for Excel output. - Ensured that leading zeros are preserved and handled various numeric formats, including those with thousand separators and scientific notation. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-04-13 20:54:57 +08:00
bitloi	853021ff2a	feat: support multiple canvas_types for agent templates and remove duplicate files (#14030 ) ### What problem does this PR solve? Closes #13907 The template catalog had duplicate files (e.g. `*_r.json`) only to place the same template into multiple sidebar groups. This increases maintenance cost and makes template updates error-prone. This PR adds first-class support for multiple template categories in a single file via `canvas_types`, then removes duplicate template files. What changed: - Added `canvas_types` to `CanvasTemplate` model and DB migration. - Added normalization logic when loading templates: - accepts legacy `canvas_type` - accepts new `canvas_types` - merges/deduplicates values - preserves backward compatibility by keeping `canvas_type` as first normalized value. - Updated template import flow to load only `.json` files and in stable sorted order. - Updated frontend template filtering to match on `canvas_types` first, with fallback to legacy `canvas_type`. - Consolidated duplicated template pairs into single files and removed: - `deep_search_r.json` - `reflective_academic_paper_generator_r.json` - `seo_article_writer_r.json` - Added regression/edge-case tests for category normalization and route serialization expectations. ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2026-04-13 20:26:30 +08:00
Yongteng Lei	1638083e18	Fix: sandbox cannot accept large args list (#14063 ) ### What problem does this PR solve? Sandbox cannot accept large args list. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-13 14:14:08 +08:00
akie	3911d90993	Fix: agent application can not show Cite (#14047 ) Close #14018 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) ### Problem In Agent applications, even with the cite option enabled, only inline [ID: x] citation markers are visible (showing chunk content on hover). The Agent does not display the referenced file cards below the response, unlike Chat applications. ### Root Cause The Agent's Retrieval tool (agent/tools/retrieval.py) calls retriever.retrieval() with aggs=False, which means the retrieval results do not include doc_aggs (document aggregation) data. Without doc_aggs, the frontend ReferenceDocumentList component has no data to render the file cards. In contrast, the Chat application (api/db/services/dialog_service.py) calls the same retriever.retrieval() method with aggs=True. ### Fix Changed aggs=False to aggs=True in agent/tools/retrieval.py so that document aggregation data is returned along with the retrieved chunks.	2026-04-13 11:06:14 +08:00
Magicbook1108	82d74fd276	Refact: update pipeline template (#14036 ) ### What problem does this PR solve? Refact: update pipeline template ### Type of change - [x] Refactoring	2026-04-10 19:04:52 +08:00
Magicbook1108	9ce293a736	Refact: update exesql notification (#14027 ) ### What problem does this PR solve? Refact: update exesql notification ### Type of change - [x] Refactoring	2026-04-10 13:42:57 +08:00
Ricardo-M-L	c13f8856a1	fix: correct typos in agent component filename and templates (#13930 ) ## Summary - Rename misspelled file `varaiable_aggregator.py` → `variable_aggregator.py` - Fix `unkown` → `unknown` in template and frontend constant (3 instances) - Fix `Finale` → `Final` in customer feedback template (2 instances) ## Test plan - [ ] Verify variable aggregator component loads correctly - [ ] Verify agent templates render properly 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: yuj <yuj@ztjzsoft.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: Jin Hai <haijin.chn@gmail.com>	2026-04-09 11:06:01 +08:00
Yongteng Lei	3064895bbb	Fix: import error in sandbox provider (#13971 ) ### What problem does this PR solve? Fix import error in sandbox provider. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Summary by CodeRabbit * Chores * Updated internal configuration import mechanism for sandbox provider initialization. No end-user impact. <!-- end of auto-generated comment: release notes by coderabbit.ai -->	2026-04-08 15:35:30 +08:00
balibabu	38acf34724	Fix: The agent selected a knowledge base, but the API returned the error: "No dataset is selected". (#13950 ) ### What problem does this PR solve? Fix: The agent selected a knowledge base, but the API returned the error: "No dataset is selected". ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: balibabu <assassin_cike@163.com>	2026-04-07 14:16:37 +08:00
Yongteng Lei	112007243d	Refa: refine code_exec component (#13925 ) ### What problem does this PR solve? Refine code_exec component. ### Type of change - [x] Refactoring	2026-04-07 11:48:29 +08:00
Magicbook1108	69264b3a70	Feat: Refact pipeline (#13826 ) ### What problem does this PR solve? ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Refactoring --------- Co-authored-by: Zhichang Yu <yuzhichang@gmail.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-03 19:26:45 +08:00
LeonTung	0b724be521	chore(templates): Update the customer feedback dispatcher template (#13919 ) ### What problem does this PR solve? Update the customer feedback dispatcher template and introduce a new operator `Variable Aggregator`. ### Type of change - [x] Other (please describe): Template change --------- Co-authored-by: Jin Hai <haijin.chn@gmail.com>	2026-04-03 16:51:39 +08:00
Ricardo-M-L	354108922b	fix: use f-string with separator in switch operator error message (#13915 ) \`switch.py\` line 137 concatenates the operator directly after the text without separator: \`'Not supported operator' + operator\` → produces \`"Not supported operatorXXX"\` Changed to: \`f'Not supported operator: {operator}'\`	2026-04-03 16:49:28 +08:00
writinwaters	6263857c1e	Agent templates regrouped and renamed (#13873 ) ### What problem does this PR solve? Regrouped and renamed agent templates to increase user engagement. ### Type of change - [x] Refactoring	2026-04-03 13:43:25 +08:00
Ricardo-M-L	09a09a5b20	fix: correct typo in IterationItem name check and incomplete error message (#13890 ) Two small fixes: 1. iterationitem.py line 72: Typo "interationitem" → "iterationitem" (missing 't'). The component name check never matched IterationItem components. 2. raptor.py line 94: Error message "Embedding error: " had a trailing colon with no details. Changed to "Embedding error: empty embeddings returned".	2026-04-02 10:35:28 +08:00
Baki Burak Öğün	8a4da41406	docs: add Turkish README translation (README_tr.md) (#13750 ) ## Summary Add a complete Turkish translation of the README and include a Turkish language badge across all existing README files. ## Changes - New file: `README_tr.md` - Full Turkish translation of README.md, covering all sections (What is RAGFlow, Demo, Latest Updates, Key Features, System Architecture, Get Started, Configurations, Docker Image, Development from Source, Documentation, Roadmap, Community, Contributing) - Updated 9 existing README files (README.md, README_zh.md, README_tzh.md, README_ja.md, README_ko.md, README_id.md, README_pt_br.md, README_fr.md, README_ar.md) to include the Turkish language badge in the language selector ## Impact - 10 files changed, 417 insertions - Follows the same structure and conventions as other language-specific README files (README_ja.md, README_ko.md, etc.) - Turkish badge uses the same styling pattern (highlighted with DBEDFA in README_tr.md, standard DFE0E5 in others) --------- Co-authored-by: bakiburakogun <bakiburakogun@users.noreply.github.com>	2026-03-24 19:00:48 +08:00
Lynn	db57155b30	Fix: get user_id from variables (#13716 ) ### What problem does this PR solve? Get user_id from canvas variable when input a {} pattern value. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-20 23:39:34 +08:00
Yongteng Lei	dd839f30e8	Fix: code supports matplotlib (#13724 ) ### What problem does this PR solve? Code as "final" node: ![img_v3_02vs_aece4caf-8403-4939-9e68-9845a22c2cfg](https://github.com/user-attachments/assets/9d87b8df-da6b-401c-bf6d-8b807fe92c22) Code as "mid" node: ![img_v3_02vv_f74f331f-d755-44ab-a18c-96fff8cbd34g](https://github.com/user-attachments/assets/c94ef3f9-2a6c-47cb-9d2b-19703d2752e4) ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-03-20 20:32:00 +08:00
Daniil Sivak	dee68c571b	Feat: support variable interpolation in headers (#13680 ) Closes #13277 ### What problem does this PR solve? Adds `{variable_name}` (and `{component@variable}`) interpolation support to HTTP header values in the `Invoke` component, matching the existing URL interpolation behavior. ### Type of change - [x] New Feature (non-breaking change which adds functionality) <img width="1280" height="867" alt="image" src="https://github.com/user-attachments/assets/8ab7b4e9-7cc0-4a7f-8a5f-f838a15a5fda" /> --------- Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>	2026-03-18 22:38:20 +08:00
Yongteng Lei	53e395ca2e	Fix: cannot debug invoke component (#13649 ) ### What problem does this PR solve? Cannot debug invoke component. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-18 14:22:13 +08:00
Lynn	02070bab2a	Feat: record user_id in memory (#13585 ) ### What problem does this PR solve? Get user_id from canvas and record it. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-03-13 15:38:35 +08:00
Yongteng Lei	13a34d7689	Feat: inject sys.date into canvas (#13567 ) ### What problem does this PR solve? Inject sys.date into canvas. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-03-12 17:49:13 +08:00
JiangNan	2634cfc06f	Fix: undefined variable and wrong method name in agent components (#13462 ) ## Summary This PR fixes two runtime bugs in agent components: Bug 1: `agent/component/invoke.py` — `NameError` in POST + `clean_html` path The POST method's `clean_html` branch uses the variable `sections` without ever defining it. Both the GET and PUT branches correctly call `sections = HtmlParser()(None, response.content)` before referencing `sections`, but this line was missing from the POST branch (copy-paste omission). This causes a `NameError` whenever a user configures an Invoke component with `method="post"` and `clean_html=True`. Bug 2: `agent/component/data_operations.py` — `AttributeError` in `_recursive_eval` The `_recursive_eval` method recursively calls `self.recursive_eval()` (without the leading underscore) instead of `self._recursive_eval()`. Since the method is defined as `_recursive_eval`, this causes an `AttributeError` at runtime when the `literal_eval` operation processes nested dicts or lists. ## Test plan - [ ] Configure an Invoke node with `method=post` and `clean_html=True`, verify HTML is parsed correctly without `NameError` - [ ] Configure a DataOperations node with `operations=literal_eval` on nested data, verify no `AttributeError` --------- Signed-off-by: JiangNan <1394485448@qq.com>	2026-03-09 11:09:47 +08:00
Eden	ab6ca75245	fix(agent): ensure database connections are properly closed in ExeSQL tool (#13427 ) ## Summary Fix a database connection and cursor resource leak in the ExeSQL agent tool. When SQL execution raises an exception (for example syntax error or missing table), the existing code path skips `cursor.close()` and `db.close()`, causing database connections to accumulate over time. This can eventually lead to connection exhaustion in long-running agent workflows. ## Root Cause The cleanup logic for database cursors and connections is placed after the SQL execution loop without `try/finally` protection. If an exception occurs during `cursor.execute()`, `fetchmany()`, or result processing, the cleanup code is not reached and the connection remains open. The same issue also exists in the IBM DB2 execution path where `ibm_db.close(conn)` may be skipped when exceptions occur. ## Fix - Wrap SQL execution logic in `try/finally` blocks to guarantee resource cleanup. - Ensure `cursor.close()` and `db.close()` are always executed. - Add explicit `db.close()` when `db.cursor()` creation fails. - Remove redundant close calls in early-return branches since `finally` now handles cleanup. ## Impact - No change to normal execution behavior. - Ensures database resources are always released when errors occur. - Prevents connection leaks in long-running workflows. - Only affects `agent/tools/exesql.py`. ## Testing Manual test scenarios: 1. Valid SQL execution 2. SQL syntax error 3. Query against a non-existing table 4. Execution cancellation during query In all scenarios the database cursor and connection are properly closed. Code quality checks: - `ruff check` passed - No new warnings introduced	2026-03-09 10:36:02 +08:00
Lynn	0214257886	Fix: init func (#13430 ) ### What problem does this PR solve? Fix update_cnt add error in init_data. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-06 11:42:31 +08:00
guptas6est	c35b210c3a	fix(security): upgrade requests to 2.32.5 in agent/sandbox to fix CVE-2024-47081 (#13424 ) ### What problem does this PR solve? This PR remediates CVE-2024-47081 (MEDIUM severity) in the agent/sandbox component by upgrading the requests library from version 2.32.3 to 2.32.5. The vulnerability allows .netrc credentials to leak via malicious URLs. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-06 09:29:18 +08:00
Lynn	62cb292635	Feat/tenant model (#13072 ) ### What problem does this PR solve? Add id for table tenant_llm and apply in LLMBundle. ### Type of change - [x] Refactoring --------- Co-authored-by: Yingfeng <yingfeng.zhang@gmail.com> Co-authored-by: Liu An <asiro@qq.com>	2026-03-05 17:27:17 +08:00
guptas6est	8c9b080499	fix: update axios to 1.13.5+ to remediate CVE-2026-25639 DoS vulnerability (#13380 ) ### What problem does this PR solve? This PR remediates CVE-2026-25639, a HIGH severity Denial of Service vulnerability in axios caused by __proto__ pollution in the mergeConfig function. The vulnerability affects both the web frontend and the sandbox nodejs environment. Trivy security scan identified axios versions below 1.13.5 as vulnerable. This PR updates axios to secure versions (1.13.6 in web, 1.13.5 in sandbox) to eliminate the security risk. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-05 17:26:04 +08:00

1 2 3 4 5 ...

437 Commits