ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-04-28 06:17:49 +08:00

Author	SHA1	Message	Date
balibabu	4e9407b4ae	Refactor: Refactoring AzureOpenAIModal using shadcn. #10427 (#12436 ) ### What problem does this PR solve? Refactor: Refactoring AzureOpenAIModal using shadcn. #10427 ### Type of change - [x] Refactoring	2026-01-05 14:09:55 +08:00
chanx	a8a060676a	Refactor: UmiJs -> Vite (#12410 ) ### What problem does this PR solve? Refactor: UmiJs -> Vite+React ### Type of change - [x] Refactoring --------- Co-authored-by: Liu An <asiro@qq.com>	2026-01-04 19:14:20 +08:00
balibabu	10c28c5ecd	Feat: Refactoring the documentation page using shadcn. #10427 (#12376 ) ### What problem does this PR solve? Feat: Refactoring the documentation page using shadcn. #10427 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-31 19:00:37 +08:00
chanx	a7e466142d	Fix: Dataset parse logic (#12330 ) ### What problem does this PR solve? Fix: Dataset logic of parser ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-30 19:53:00 +08:00
balibabu	2fccf3924d	Feat: Adapt the theme of the documentation page. #10427 (#12337 ) ### What problem does this PR solve? Feat: Adapt the theme of the documentation page. #10427 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-30 19:35:44 +08:00
balibabu	109e782493	Feat: On the agent page and chat page, you can only select knowledge bases that use the same embedding model. #12320 (#12321 ) ### What problem does this PR solve? Feat: On the agent page and chat page, you can only select knowledge bases that use the same embedding model. #12320 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-30 17:08:30 +08:00
chanx	dccda35f65	Fix: S3 parameter error (#12290 ) ### What problem does this PR solve? Fix: S3 parameter error ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-12-29 17:38:01 +08:00
balibabu	a24fc8291b	Fix: If there is an error message on the chat page, the subsequent message references will not display correctly. #12252 (#12283 ) ### What problem does this PR solve? Fix: If there is an error message on the chat page, the subsequent message references will not display correctly. #12252 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-29 12:58:12 +08:00
lys1313013	37e4485415	feat: add MDX file support (#12261 ) Feat: add MDX file support #12057 ### What problem does this PR solve? <img width="1055" height="270" alt="image" src="https://github.com/user-attachments/assets/a0ab49f9-7806-41cd-8a96-f593591ab36b" /> The page states that MDX files are supported, but uploading fails with the error: "x.mdx: This type of file has not been supported yet!" <img width="381" height="110" alt="image" src="https://github.com/user-attachments/assets/4bbb7d08-cb47-416a-95fc-bc90b90fcc39" /> ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-29 12:54:31 +08:00
chanx	647fb115a0	Fix: Data-source S3 page style (#12255 ) ### What problem does this PR solve? Fix: Data-source S3 page style ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-29 09:46:35 +08:00
Yongteng Lei	a1ed4430ce	Fix: frontend cannot sync document window context (#12256 ) ### What problem does this PR solve? Frontend cannot sync document window context. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: Liu An <asiro@qq.com>	2025-12-26 20:55:22 +08:00
Yongteng Lei	51bc41b2e8	Refa: improve image table context (#12244 ) ### What problem does this PR solve? Improve image table context. Current strategy in attach_media_context: - Order by position when possible: if any chunk has page/position info, sort by (page, top, left), otherwise keep original order. - Apply only to media chunks: images use image_context_size, tables use table_context_size. - Primary matching: on the same page, choose a text chunk whose vertical span overlaps the media, then pick the one with the closest vertical midpoint. - Fallback matching: if no overlap on that page, choose the nearest text chunk on the same page (page-head uses the next text; page-tail uses the previous text). - Context extraction: inside the chosen text chunk, find a mid-sentence boundary near the text midpoint, then take context_size tokens split before/after (total budget). - No multi-chunk stitching: context comes from a single text chunk to avoid mixing unrelated segments. ### Type of change - [x] Refactoring --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-12-26 17:55:32 +08:00
chanx	c4a66204f0	Fix: Memory-related bug fixes (#12238 ) ### What problem does this PR solve? Fix: Memory-related bug fixes - Forget memory button text - Adjust memory storage interface ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-26 15:56:41 +08:00
Jin Hai	5714895291	Fix message duration (#12233 ) ### What problem does this PR solve? As title ### Type of change - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-12-26 14:40:46 +08:00
Jin Hai	a33936e8ff	Fix small issues on UI (#12231 ) ### What problem does this PR solve? As title ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-12-26 14:21:59 +08:00
balibabu	52dbacc506	Feat: Preview the image at the bottom of the message #12076 (#12225 ) ### What problem does this PR solve? Feat: Preview the image at the bottom of the message #12076 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-26 12:11:19 +08:00
chanx	5fb38ecc2a	Fix: Can not select LLM in memory page (#12219 ) ### What problem does this PR solve? Fix: Can not select LLM in memory page ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-26 11:00:11 +08:00
balibabu	c7b5bfb809	Feat: An image carousel is displayed at the bottom of the agent's chat messages. #12076 (#12215 ) ### What problem does this PR solve? Feat: An image carousel is displayed at the bottom of the agent's chat messages. #12076 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-25 19:02:49 +08:00
chanx	cfd1250615	Fix: Api key modal bug (#12213 ) ### What problem does this PR solve? Fix: Api key modal bug ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-25 19:01:55 +08:00
chanx	2817be14d5	Fix: Metadata tips info (#12209 ) ### What problem does this PR solve? Fix: Metadata tips info ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-25 15:55:06 +08:00
balibabu	f6217bb990	Feat: Images referenced in chat messages are displayed as a carousel at the bottom of the message. #12076 (#12207 ) ### What problem does this PR solve? Feat: Images referenced in chat messages are displayed as a carousel at the bottom of the message. #12076 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-25 15:54:07 +08:00
chanx	89ea760e67	Fix: Add a no-data filter condition to MetaData (#12189 ) ### What problem does this PR solve? Fix: Add a no-data filter condition to MetaData ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-25 12:13:18 +08:00
chanx	4a2978150c	Fix：Metadata saving, copywriting and other related issues (#12169 ) ### What problem does this PR solve? Fix：Bugs Fixed - Text overflow issues that caused rendering problems - Metadata saving, copywriting and other related issues ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-25 12:12:32 +08:00
chanx	9a5c5c46f2	Fix: Add prompts when merging or deleting metadata. (#12138 ) ### What problem does this PR solve? Fix: Add prompts when merging or deleting metadata. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-12-25 11:53:06 +08:00
Yongteng Lei	6c93157b14	Refa: image table context window (#12132 ) ### What problem does this PR solve? Image table context window ### Type of change - [x] Refactoring	2025-12-23 19:51:01 +08:00
balibabu	033029eaa1	Fix: The form waiting for input is not displayed in the dialog message. #12129 (#12130 ) ### What problem does this PR solve? Fix: The form waiting for input is not displayed in the dialog message. #12129 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-23 17:59:55 +08:00
chanx	8e6ddd7c1b	Fix: Metadata bugs. (#12111 ) ### What problem does this PR solve? Fix: Metadata bugs. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-12-23 14:16:57 +08:00
balibabu	9e31631d8f	Feat: Add memory multi-select dropdown to recall and message operator forms. #4213 (#12106 ) ### What problem does this PR solve? Feat: Add memory multi-select dropdown to recall and message operator forms. #4213 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-23 11:54:32 +08:00
chanx	bd4eb19393	Fix:Bugs fix (Reduce metadata saving steps ...) (#12095 ) ### What problem does this PR solve? Fix:Bugs fix - Configure memory and metadata (in Chinese) - Add indexing modal - Reduce metadata saving steps ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-12-23 11:50:35 +08:00
Jimmy Ben Klieve	38ac6a7c27	feat: add image context window in dataset config (#12094 ) ### What problem does this PR solve? Add image context window configuration in Dataset > Configduration and Dataset > Files > Parse > Ingestion Pipeline (Chunk Method modal) ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-22 19:51:23 +08:00
Jimmy Ben Klieve	6d3d3a40ab	fix: hide drop-zone upload button when picked an image (#12088 ) ### What problem does this PR solve? Hide drop-zone upload button when picked an image in chunk editor dialog ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-22 19:04:44 +08:00
chanx	51b12841d6	Feature/1217 (#12087 ) ### What problem does this PR solve? feature: Complete metadata functionality ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-22 17:35:12 +08:00
Jimmy Ben Klieve	b42b5fcf65	feat: display chunk type in chunk editor and dialog (#12086 ) ### What problem does this PR solve? Display chunk type in chunk editor and dialog, may be one of below: - Image - Table - Text ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-22 16:45:47 +08:00
balibabu	2ddfcc7cf6	Images that appear consecutively in the dialogue are displayed using a carousel. #12076 (#12077 ) ### What problem does this PR solve? Images that appear consecutively in the dialogue are displayed using a carousel. #12076 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-22 14:41:02 +08:00
balibabu	5ba51b21c9	Feat: When the webhook returns a field in streaming format, the message displays the status field. #10427 (#12075 ) ### What problem does this PR solve? Feat: When the webhook returns a field in streaming format, the message displays the status field. #10427 ### Type of change - [x] New Feature (non-breaking change which adds functionality) Co-authored-by: balibabu <assassin_cike@163.com>	2025-12-22 14:37:39 +08:00
Jimmy Ben Klieve	8dd2394e93	feat: add optional cache busting for image (#12055 ) ### What problem does this PR solve? Add optional cache busting for image #12003 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-22 09:36:45 +08:00
Jimmy Ben Klieve	47005ebe10	feat: supports multiple retrieval tool under an agent (#12046 ) ### What problem does this PR solve? Add support for multiple Retrieval tools under an agent ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-22 09:35:34 +08:00
chanx	eeb36a5ce7	Feature: Implement metadata functionality (#12049 ) ### What problem does this PR solve? Feature: Implement metadata functionality ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-19 19:13:33 +08:00
balibabu	aceca266ff	Feat: Images appearing consecutively in the dialogue are merged and displayed in a carousel. #10427 (#12051 ) ### What problem does this PR solve? Feat: Images appearing consecutively in the dialogue are merged and displayed in a carousel. #10427 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-19 19:13:18 +08:00
balibabu	4cbe470089	Feat: Display error messages from intermediate nodes of the webhook. #10427 (#11954 ) ### What problem does this PR solve? Feat: Remove HMAC from the webhook #10427 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-19 12:56:56 +08:00
Jimmy Ben Klieve	ce161f09cc	feat: add image uploader in edit chunk dialog (#12003 ) ### What problem does this PR solve? Add image uploader in edit chunk dialog for replacing image chunk ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-18 09:33:52 +08:00
Yongteng Lei	3820de916c	Fix: duplicated PDF parser (#12000 ) ### What problem does this PR solve? Fix duplicated PDF parser. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-17 19:48:10 +08:00
Jimmy Ben Klieve	4046bffaf1	fix: unable to save ingestion pipeline config without modifying children delimiter (#11991 ) …ildren delimiter ### What problem does this PR solve? Fix the issue of unable to save Files > Ingestion Pipeline (Modal) config without modifying children delimiter ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-17 15:37:28 +08:00
Yongteng Lei	03f9be7cbb	Refa: only support MinerU-API now (#11977 ) ### What problem does this PR solve? Only support MinerU-API now, still need to complete frontend for pipeline to allow the configuration of MinerU options. ### Type of change - [x] Refactoring	2025-12-17 12:58:48 +08:00
chanx	205a6483f5	Feature：memory function complete (#11982 ) ### What problem does this PR solve? memory function complete ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-17 12:35:26 +08:00
Jimmy Ben Klieve	2595644dfd	feat: add ingestion pipeline children delimiters configs (#11979 ) ### What problem does this PR solve? Add children delimiters for Ingestion pipeline config ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2025-12-17 11:18:54 +08:00
concertdictate	49c74d08e8	Feature/mineru improvements (#11938 ) 我已在下面的评论中用中文重复说明。 ### What problem does this PR solve? ## Summary This PR enhances the MinerU document parser with additional configuration options, giving users more control over PDF parsing behavior and improving support for multilingual documents. ## Changes ### Backend (`deepdoc/parser/mineru_parser.py`) - Added configurable parsing options: - Parse Method: `auto`, `txt`, or `ocr` — allows users to choose the extraction strategy - Formula Recognition: Toggle for enabling/disabling formula extraction (useful to disable for Cyrillic documents where it may cause issues) - Table Recognition: Toggle for enabling/disabling table extraction - Added language code mapping (`LANGUAGE_TO_MINERU_MAP`) to translate RAGFlow language settings to MinerU-compatible language codes for better OCR accuracy - Improved parser configuration handling to pass these options through the processing pipeline ### Frontend (`web/`) - Created new `MinerUOptionsFormField` component that conditionally renders when MinerU is selected as the layout recognition engine - Added UI controls for: - Parse method selection (dropdown) - Formula recognition toggle (switch) - Table recognition toggle (switch) - Added i18n translations for English and Chinese - Integrated the options into both the dataset creation dialog and dataset settings page ### Integration - Updated `rag/app/naive.py` to forward MinerU options to the parser - Updated task service to handle the new configuration parameters ## Why MinerU is a powerful document parser, but the default settings don't work well for all document types. This PR allows users to: 1. Choose the best parsing method for their documents 2. Disable formula recognition for Cyrillic/non-Latin scripts where it causes issues 3. Control table extraction based on document needs 4. Benefit from automatic language detection for better OCR results ## Testing - [x] Tested MinerU parsing with different parse methods - [x] Verified UI renders correctly when MinerU is selected/deselected - [x] Confirmed settings persist correctly in dataset configuration ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [x] Refactoring - [ ] Performance Improvement - [ ] Other (please describe): --------- Co-authored-by: user210 <user210@rt> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-12-16 13:15:25 +08:00
chanx	a98887d4ca	Fix: Bug fixes (#11960 ) ### What problem does this PR solve? Fix: Bug fixes New search popup style modification Fixed multilingual settings not updating immediately on personal center page Changed overlapped percent to percentage format, with maximum value of 30% ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2025-12-16 09:44:06 +08:00
PentaFDevs	f9510edbbc	Feature/docs generator (#11858 ) ### Type of change - [x] New Feature (non-breaking change which adds functionality) ### What problem does this PR solve? This PR introduces a new Docs Generator agent component for producing downloadable PDF, DOCX, or TXT files from Markdown content generated within a RAGFlow workflow. ### Key Features Backend - New component: DocsGenerator (agent/component/docs_generator.py) - - Markdown → PDF/DOCX/TXT conversion - - Supports tables, lists, code blocks, headings, and rich formatting - - Configurable document style (fonts, margins, colors, page size, orientation) - - Optional header logo and footer with page numbers/timestamps - Frontend - New configuration UI for the Docs Generator - - Download button integrated into the chat interface - - Output wired to the Message component - - Full i18n support Documentation Added component guide: docs/guides/agent/agent_component_reference/docs_generator.md Usage Add the Docs Generator to a workflow, connect Markdown output from an upstream component, configure metadata/style, and feed its output into the Message component. Users will see a document download button directly in the chat. Contributor Note We have been following RAGFlow since more than a year and half now and have worked extensively on personalizing the framework and integrating it into several of our internal systems. Over the past year and a half, we have built multiple platforms that rely on RAGFlow as a core component, which has given us a strong appreciation for how flexible and powerful the project is. We also previously contributed the full Italian translation, and we were glad to see it accepted. This new Docs Generator component was created for our own production needs, and we believe that it may be useful for many others in the community as well. We want to sincerely thank the entire RAGFlow team for the remarkable work you have done and continue to do. If there are opportunities to contribute further, we would be glad to help whenever we have time available. It would be a pleasure to support the project in any way we can. If appropriate, we would be glad to be listed among the project’s contributors, but in any case we look forward to continuing to support and contribute to the project. PentaFrame Development Team --------- Co-authored-by: PentaFrame <info@pentaframe.it> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2025-12-12 14:59:43 +08:00
balibabu	22a51a3868	Feat: Add mineru as a model manufacturer to the system. #10621 (#11903 ) ### What problem does this PR solve? Feat: Add mineru as a model manufacturer to the system. #10621 ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: balibabu <assassin_cike@163.com>	2025-12-11 17:37:10 +08:00

1 2 3 4 5 ...

581 Commits