ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-05-26 10:47:21 +08:00

Author	SHA1	Message	Date
buua436	e6e80041f5	Fix: agent toolcall null response & schema validation & DeepSeek think history (#14425 ) ### What problem does this PR solve? agent toolcall null response & schema validation & DeepSeek think history ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-28 17:09:08 +08:00
buua436	82313020c7	Refa: align list operations and strict mode (#14387 ) ### What problem does this PR solve? align list operations and strict mode ### Type of change - [x] Refactoring	2026-04-27 19:13:00 +08:00
Xing Hong	fb95136f39	Fix: validate URL scheme and resolved IP before crawling to prevent SSRF (#14090 ) ### What problem does this PR solve? The POST /upload_info?url=<url> endpoint accepted a user-supplied URL and passed it directly to AsyncWebCrawler without any validation. There were no restrictions on URL scheme, destination hostname, or resolved IP address. This allowed any authenticated user to instruct the server to make outbound HTTP requests to internal infrastructure — including RFC 1918 private networks, loopback addresses, and cloud metadata services such as http://169.254.169.254 — effectively using the server as a proxy for internal network reconnaissance or credential theft. This PR adds an SSRF guard (_validate_url_for_crawl) that runs before any crawl is initiated. It enforces an allowlist of safe schemes (http/https), resolves the hostname at validation time, and rejects any URL whose resolved IP falls within a private or reserved network range. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-25 14:30:15 +08:00
Cocoon-Break	620088be2f	fix: check isinstance before len in VariableAssigner _remove_first/_remove_last (#14281 ) fix: check isinstance before len in VariableAssigner _remove_first/_remove_last	2026-04-24 19:09:44 +08:00
Magicbook1108	9c7c105007	Fix: Doc generator (#14223 ) ### What problem does this PR solve? Doc generator ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-20 16:37:33 +08:00
Magicbook1108	d053317c4d	Fix: variable in doc generator (#14180 ) ### What problem does this PR solve? Fix: variable in doc generator ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-20 14:19:42 +08:00
Magicbook1108	901023a80a	Fix: literal eval http request input (#14145 ) ### What problem does this PR solve? Fix: literal eval http request input ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) <img width="700" alt="img_v3_0210q_f4b49ff7-e670-4054-ab0e-9443a09215fg" src="https://github.com/user-attachments/assets/089300be-06f9-4bb6-97af-61bf5f4a5e8c" /> <img width="700" alt="img_v3_0210q_398cd52a-2ad9-42be-8d5b-4e6e68a7d22g" src="https://github.com/user-attachments/assets/239b43cd-a2a5-49d8-9200-991bb26336c8" />	2026-04-16 16:52:34 +08:00
Yongteng Lei	356ba5650a	Fix: sandbox don't attach attachment metadata (#14135 ) ### What problem does this PR solve? Sandbox don't attach attachment metadata ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-16 12:08:54 +08:00
Magicbook1108	1376c004a9	Fix: update docs generator (#14070 ) ### What problem does this PR solve? Refactor: update docs generator ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) 1. Support multiple document generator components and correctly display messages in the message component. The document generator will not overwrite other messages. <img width="700" alt="Screenshot from 2026-04-13 13-56-17" src="https://github.com/user-attachments/assets/3f3e06e8-33ce-4df1-8b05-510c86af70a4" /> 2. Support Chinese content and ensure correct Markdown rendering in PDF and DOCX <img width="700" alt="image" src="https://github.com/user-attachments/assets/69bf1f7b-261d-48e5-a9f3-8e94462b90ed" /> 3. Simplify configuration page and support more output format <img height="700" alt="image" src="https://github.com/user-attachments/assets/8647374c-c055-4daa-ad71-cd9052eb138e" /> 4. Hide download from other components except for message <img width="700" alt="image" src="https://github.com/user-attachments/assets/a723dfcb-b60d-4eb5-b2f6-d41ca5955eb4" /> <img width="700" alt="image" src="https://github.com/user-attachments/assets/a8762ac4-807b-4f0b-9287-65f82f7c9c98" /> 5. Sanitize filename <img width="700" alt="image" src="https://github.com/user-attachments/assets/df49509f-37c0-40f9-b03d-bd6ce7fdefa8" /> 6. And more changes on usability	2026-04-14 15:24:43 +08:00
Jin Hai	2b6c50734f	Sync code from EE (#14080 ) ### What problem does this PR solve? As title. ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-04-14 15:03:46 +08:00
天海蒼灆	356d45fda1	Feat: add cell type coercion for Excel export (#13808 ) ### What problem does this PR solve? - Implemented a helper function to convert markdown cell text to native numeric types for Excel output. - Ensured that leading zeros are preserved and handled various numeric formats, including those with thousand separators and scientific notation. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-04-13 20:54:57 +08:00
Ricardo-M-L	c13f8856a1	fix: correct typos in agent component filename and templates (#13930 ) ## Summary - Rename misspelled file `varaiable_aggregator.py` → `variable_aggregator.py` - Fix `unkown` → `unknown` in template and frontend constant (3 instances) - Fix `Finale` → `Final` in customer feedback template (2 instances) ## Test plan - [ ] Verify variable aggregator component loads correctly - [ ] Verify agent templates render properly 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: yuj <yuj@ztjzsoft.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: Jin Hai <haijin.chn@gmail.com>	2026-04-09 11:06:01 +08:00
Ricardo-M-L	354108922b	fix: use f-string with separator in switch operator error message (#13915 ) \`switch.py\` line 137 concatenates the operator directly after the text without separator: \`'Not supported operator' + operator\` → produces \`"Not supported operatorXXX"\` Changed to: \`f'Not supported operator: {operator}'\`	2026-04-03 16:49:28 +08:00
Ricardo-M-L	09a09a5b20	fix: correct typo in IterationItem name check and incomplete error message (#13890 ) Two small fixes: 1. iterationitem.py line 72: Typo "interationitem" → "iterationitem" (missing 't'). The component name check never matched IterationItem components. 2. raptor.py line 94: Error message "Embedding error: " had a trailing colon with no details. Changed to "Embedding error: empty embeddings returned".	2026-04-02 10:35:28 +08:00
Lynn	db57155b30	Fix: get user_id from variables (#13716 ) ### What problem does this PR solve? Get user_id from canvas variable when input a {} pattern value. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-20 23:39:34 +08:00
Yongteng Lei	dd839f30e8	Fix: code supports matplotlib (#13724 ) ### What problem does this PR solve? Code as "final" node: ![img_v3_02vs_aece4caf-8403-4939-9e68-9845a22c2cfg](https://github.com/user-attachments/assets/9d87b8df-da6b-401c-bf6d-8b807fe92c22) Code as "mid" node: ![img_v3_02vv_f74f331f-d755-44ab-a18c-96fff8cbd34g](https://github.com/user-attachments/assets/c94ef3f9-2a6c-47cb-9d2b-19703d2752e4) ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-03-20 20:32:00 +08:00
Daniil Sivak	dee68c571b	Feat: support variable interpolation in headers (#13680 ) Closes #13277 ### What problem does this PR solve? Adds `{variable_name}` (and `{component@variable}`) interpolation support to HTTP header values in the `Invoke` component, matching the existing URL interpolation behavior. ### Type of change - [x] New Feature (non-breaking change which adds functionality) <img width="1280" height="867" alt="image" src="https://github.com/user-attachments/assets/8ab7b4e9-7cc0-4a7f-8a5f-f838a15a5fda" /> --------- Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>	2026-03-18 22:38:20 +08:00
Yongteng Lei	53e395ca2e	Fix: cannot debug invoke component (#13649 ) ### What problem does this PR solve? Cannot debug invoke component. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-18 14:22:13 +08:00
Lynn	02070bab2a	Feat: record user_id in memory (#13585 ) ### What problem does this PR solve? Get user_id from canvas and record it. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-03-13 15:38:35 +08:00
JiangNan	2634cfc06f	Fix: undefined variable and wrong method name in agent components (#13462 ) ## Summary This PR fixes two runtime bugs in agent components: Bug 1: `agent/component/invoke.py` — `NameError` in POST + `clean_html` path The POST method's `clean_html` branch uses the variable `sections` without ever defining it. Both the GET and PUT branches correctly call `sections = HtmlParser()(None, response.content)` before referencing `sections`, but this line was missing from the POST branch (copy-paste omission). This causes a `NameError` whenever a user configures an Invoke component with `method="post"` and `clean_html=True`. Bug 2: `agent/component/data_operations.py` — `AttributeError` in `_recursive_eval` The `_recursive_eval` method recursively calls `self.recursive_eval()` (without the leading underscore) instead of `self._recursive_eval()`. Since the method is defined as `_recursive_eval`, this causes an `AttributeError` at runtime when the `literal_eval` operation processes nested dicts or lists. ## Test plan - [ ] Configure an Invoke node with `method=post` and `clean_html=True`, verify HTML is parsed correctly without `NameError` - [ ] Configure a DataOperations node with `operations=literal_eval` on nested data, verify no `AttributeError` --------- Signed-off-by: JiangNan <1394485448@qq.com>	2026-03-09 11:09:47 +08:00
Lynn	0214257886	Fix: init func (#13430 ) ### What problem does this PR solve? Fix update_cnt add error in init_data. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-06 11:42:31 +08:00
Lynn	62cb292635	Feat/tenant model (#13072 ) ### What problem does this PR solve? Add id for table tenant_llm and apply in LLMBundle. ### Type of change - [x] Refactoring --------- Co-authored-by: Yingfeng <yingfeng.zhang@gmail.com> Co-authored-by: Liu An <asiro@qq.com>	2026-03-05 17:27:17 +08:00
statxc	839b603768	feat: Add PDF parser selection to Agent Begin and Await Response comp… (#13325 ) ### Issue: #12756 ### What problem does this PR solve? When users upload files through Agent's Begin or Await Response components, the parsing is hardcoded to "Plain Text", ignoring all other available parsers (DeepDOC, TCADP, Docling, MinerU, PaddleOCR). This PR adds a PDF parser dropdown to these components so users can select the appropriate parser for their file inputs. ### Changes Backend - `agent/component/fillup.py` - Added `layout_recognize` param to `UserFillUpParam`, forwarded to `FileService.get_files()` - `agent/component/begin.py` - Same forwarding in `Begin._invoke()` - `agent/canvas.py` - Extract Begin's `layout_recognize` for `sys.files` parsing, added param to `get_files_async()` / `get_files()` - `api/db/services/file_service.py` - Added `layout_recognize` param to `parse()` and `get_files()`, replacing hardcoded `"Plain Text"` - `rag/app/naive.py` - Added `"plain text"` and `"tcadp parser"` aliases to PARSERS dict to match dropdown values after `.lower()` Frontend - `web/src/pages/agent/form/begin-form/index.tsx` - Show `LayoutRecognizeFormField` dropdown when file inputs exist - `web/src/pages/agent/form/begin-form/schema.ts` - Added `layout_recognize` to Zod schema - `web/src/pages/agent/form/user-fill-up-form/index.tsx` - Same dropdown for Await Response component ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-03-04 11:09:33 +08:00
balibabu	eca60208e3	Fix: The document generation node cannot generate the output content of a large model to a file. #13321 (#13326 ) ### What problem does this PR solve? Fix: The document generation node cannot generate the output content of a large model to a file. #13321 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-03 11:05:24 +08:00
Yihang Wang	7fc97da610	security: Adopt Jinja2 SandboxedEnvironment for template rendering. (#13305 )	2026-03-02 13:17:29 +08:00
tuandang-diag	d89ad8b79d	fix: handle null response in LLM and improve JSON parsing in agent (#13187 ) Fixes AttributeError in _remove_reasoning_content() when LLM returns None, and improves JSON parsing regex for markdown code fences in agent_with_tools.py	2026-02-24 13:15:09 +08:00
Lynn	67befc9119	Fix: add back MCP tool custom header (#13188 ) ### What problem does this PR solve? Add back custom header when use MCP. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-24 13:14:21 +08:00
Magicbook1108	98e1d5aa5c	Refact: switch from google-generativeai to google-genai (#13140 ) ### What problem does this PR solve? Refact: switch from oogle-generativeai to google-genai #13132 Refact: commnet out unused pywencai. ### Type of change - [x] Refactoring	2026-02-24 10:28:33 +08:00
Magicbook1108	109441628b	Fix: upload image files (#13071 ) ### What problem does this PR solve? Fix: upload image files ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-11 09:47:33 +08:00
BitToby	4d4b5a978d	feat: enable multi-file upload for chat and agent workflows (#12977 ) ### Closes: #12921 ### What problem does this PR solve? Previously, multi-file upload was not working correctly across the application: - Chat: UI displayed "Upload max 5 files" but only the first file was actually uploaded - Agent conversational mode: Frontend sent multiple files but backend only processed one - Agent task-mode file inputs: Explicitly limited to single file only This PR enables proper multi-file upload support for both chat and agent workflows, allowing users to upload and process multiple files (up to 5) as the UI originally suggested. Changes: - `web/src/pages/next-chats/hooks/use-upload-file.ts`: Process all files instead of only `files[0]` - `api/apps/canvas_app.py`: Handle multiple files via `files.getlist("file")` - `web/src/pages/agent/debug-content/uploader.tsx`: Allow up to 5 files with `multiple={true}` - `agent/component/begin.py` & `fillup.py`: Support file arrays while maintaining backward compatibility ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-02-04 18:03:21 +08:00
zhanglei	7cbe8b5b53	feat: Add a custom header to the SDK for chatting with the agent. (#12430 ) ### What problem does this PR solve? As title. ### Type of change - [x] New Feature (non-breaking change which adds functionality) Co-authored-by: Liu An <asiro@qq.com>	2026-02-03 11:01:18 +08:00
Kevin Hu	08c01b76d5	Fix: missing parent chunk issue. (#12789 ) ### What problem does this PR solve? Close #12783 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-01-23 12:54:08 +08:00
Kevin Hu	927db0b373	Refa: asyncio.to_thread to ThreadPoolExecutor to break thread limitat… (#12716 ) ### Type of change - [x] Refactoring	2026-01-20 13:29:37 +08:00
Yongteng Lei	941651a16f	Fix: wrong input trace in Category component (#12590 ) ### What problem does this PR solve? Wrong input trace in Category component ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-01-13 17:54:57 +08:00
Lynn	4a6d37f0e8	Fix: use async task to save memory (#12308 ) ### What problem does this PR solve? Use async task to save memory. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Jin Hai <haijin.chn@gmail.com>	2025-12-30 11:41:38 +08:00
Lynn	d285d8cd97	Fix: memory (#12230 ) ### What problem does this PR solve? Judge has attr memory_ids ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-26 14:42:47 +08:00
Lynn	7498bc63a3	Fix: judge retrieval from (#12223 ) ### What problem does this PR solve? Judge retrieval from in retrieval component, and fix bug in message component ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-26 13:01:46 +08:00
Lynn	6e9691a419	Feat: message manage (#12196 ) ### What problem does this PR solve? Manage message and use in agent. Issue #4213 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-25 21:18:13 +08:00
Kevin Hu	f0dac1d90e	Fix: loopitem None issue. (#12166 ) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-25 12:12:38 +08:00
TeslaZY	badd5aa101	Fix: LLM tool does not exist in multiple retrieval case (#12143 ) ### What problem does this PR solve? Fix LLM tool does not exist in multiple retrieval case ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-25 11:53:51 +08:00
buua436	957bc021eb	Fix:remove duplicate tool_meta (#12139 ) ### What problem does this PR solve? pr:#12117 change:remove duplicate tool_meta ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-25 11:53:24 +08:00
Kevin Hu	8197f9a873	Fix: table tag on chunks. (#12126 ) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-25 11:25:38 +08:00
TeslaZY	d1bc7ad2ee	Fix only one of multiple retrieval tools is effective (#12110 ) ### What problem does this PR solve? Fix only one of multiple retrieval tools is effective ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-23 14:08:25 +08:00
buua436	321474fb97	Fix: update method call to use simplified async tool reaction (#12108 ) ### What problem does this PR solve? pr:#12091 change:update method call to use simplified async tool reaction ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-23 13:36:58 +08:00
buua436	1444de981c	Feat: enhance webhook response to include status and success fields and simplify ReAct agent (#12091 ) ### What problem does this PR solve? change： enhance webhook response to include status and success fields and simplify ReAct agent ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-23 09:36:08 +08:00
buua436	57edc215d7	Feat:update webhook component (#11739 ) ### What problem does this PR solve? issue: https://github.com/infiniflow/ragflow/issues/10427 https://github.com/infiniflow/ragflow/issues/8115 change: - Support for Multiple HTTP Methods (POST / GET / PUT / PATCH / DELETE / HEAD) - Security Validation 1. max_body_size 2. IP whitelist 3. rate limit 4. token / basic / jwt authentication - File Upload Support - Unified Content-Type Handling - Full Schema-Based Extraction & Type Validation - Two Execution Modes: Immediately / Streaming ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-18 19:34:39 +08:00
Jin Hai	d38f8a1562	Add license and Fix IDE warnings (#11985 ) ### What problem does this PR solve? - Add license - Fix IDE warnings ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-12-17 17:04:44 +08:00
Jin Hai	0e8b9588ba	Fix error and format issue (#11975 ) ### What problem does this PR solve? 1. Fix error of book chunking. 2. Fix format issues. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-12-16 19:29:37 +08:00
shivam johri	5bba562048	Feature/excel export fix (#11914 ) ### PR details feat: Add Excel export support and fix variable reference regex Changes: - Add Excel export output format option to Message component - Apply nest_asyncio patch to handle nested event loops - Fix async generator iteration in canvas_app.py debug endpoint - Add underscore support in variable reference regex pattern ### What problem does this PR solve? ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Shivam Johri <shivamjohri@Shivams-MacBook-Air.local>	2025-12-16 13:15:52 +08:00
PentaFDevs	f9510edbbc	Feature/docs generator (#11858 ) ### Type of change - [x] New Feature (non-breaking change which adds functionality) ### What problem does this PR solve? This PR introduces a new Docs Generator agent component for producing downloadable PDF, DOCX, or TXT files from Markdown content generated within a RAGFlow workflow. ### Key Features Backend - New component: DocsGenerator (agent/component/docs_generator.py) - - Markdown → PDF/DOCX/TXT conversion - - Supports tables, lists, code blocks, headings, and rich formatting - - Configurable document style (fonts, margins, colors, page size, orientation) - - Optional header logo and footer with page numbers/timestamps - Frontend - New configuration UI for the Docs Generator - - Download button integrated into the chat interface - - Output wired to the Message component - - Full i18n support Documentation Added component guide: docs/guides/agent/agent_component_reference/docs_generator.md Usage Add the Docs Generator to a workflow, connect Markdown output from an upstream component, configure metadata/style, and feed its output into the Message component. Users will see a document download button directly in the chat. Contributor Note We have been following RAGFlow since more than a year and half now and have worked extensively on personalizing the framework and integrating it into several of our internal systems. Over the past year and a half, we have built multiple platforms that rely on RAGFlow as a core component, which has given us a strong appreciation for how flexible and powerful the project is. We also previously contributed the full Italian translation, and we were glad to see it accepted. This new Docs Generator component was created for our own production needs, and we believe that it may be useful for many others in the community as well. We want to sincerely thank the entire RAGFlow team for the remarkable work you have done and continue to do. If there are opportunities to contribute further, we would be glad to help whenever we have time available. It would be a pleasure to support the project in any way we can. If appropriate, we would be glad to be listed among the project’s contributors, but in any case we look forward to continuing to support and contribute to the project. PentaFrame Development Team --------- Co-authored-by: PentaFrame <info@pentaframe.it> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-12-12 14:59:43 +08:00

1 2 3 4 5 ...

285 Commits