ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-05-26 10:47:21 +08:00

Author	SHA1	Message	Date
Jin Hai	393efa9b7c	Refactor variable of front end (#13953 ) ### What problem does this PR solve? api_host -> webAPI ExternalApi -> restAPIv1 ### Type of change - [x] Refactoring <!-- This is an auto-generated comment: release notes by coderabbit.ai --> ## Summary by CodeRabbit * Refactor * Updated internal API endpoint configuration to use consolidated base URL constants for improved maintainability and consistency across the application. * Chores * Updated server-side protocol validation for admin connectivity checks. <!-- end of auto-generated comment: release notes by coderabbit.ai --> --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-04-07 15:08:11 +08:00
balibabu	38acf34724	Fix: The agent selected a knowledge base, but the API returned the error: "No dataset is selected". (#13950 ) ### What problem does this PR solve? Fix: The agent selected a knowledge base, but the API returned the error: "No dataset is selected". ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: balibabu <assassin_cike@163.com>	2026-04-07 14:16:37 +08:00
Ricardo-M-L	29cf8aba48	fix: correct typos in locale files and search hooks (#13932 ) ## Summary - Fix `Refrence` → `Reference` in zh, id, zh-traditional locale files (en.ts already correct) - Fix `from from` → `from` and `this files` → `this file` in en.ts - Fix variable name `reponse` → `response` in search hooks ## Test plan - [ ] Verify UI strings display correctly - [ ] Verify search functionality works with renamed variable 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: yuj <yuj@ztjzsoft.com> Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com> Co-authored-by: Jin Hai <haijin.chn@gmail.com>	2026-04-07 12:26:25 +08:00
Yongteng Lei	112007243d	Refa: refine code_exec component (#13925 ) ### What problem does this PR solve? Refine code_exec component. ### Type of change - [x] Refactoring	2026-04-07 11:48:29 +08:00
buildearth	a0be7c7ca7	Fix(connector): expose id_column, timestamp_column, metadata_columns for MySQL/PostgreSQL incremental sync (#13849 ) ### What problem does this PR solve? The MySQL and PostgreSQL sync classes in `sync_data_source.py` were not passing `id_column`, `timestamp_column`, and `metadata_columns` to `RDBMSConnector`, making incremental sync and document update impossible even when configured. - Without `id_column`: updated records generate new documents instead of overwriting existing ones (doc ID is derived from content hash, so any change produces a new ID). - Without `timestamp_column`: `poll_source` always falls back to full sync, ignoring the configured time range. - The three fields existed in the frontend default values but had no form inputs, so users had no way to fill them in. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) ### Changes - Backend (`rag/svr/sync_data_source.py`): pass `id_column`, `timestamp_column`, and `metadata_columns` from `self.conf` to `RDBMSConnector` for both `MySQL` and `PostgreSQL` sync classes. - Frontend (`web/src/pages/user-setting/data-source/constant/index.tsx`): add `ID Column`, `Timestamp Column`, and `Metadata Columns` form fields to MySQL and PostgreSQL data source configuration UI with tooltips. Signed-off-by: lixintao <lixintao@uniontech.com> Co-authored-by: lixintao <lixintao@uniontech.com>	2026-04-07 10:24:30 +08:00
Magicbook1108	69264b3a70	Feat: Refact pipeline (#13826 ) ### What problem does this PR solve? ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Refactoring --------- Co-authored-by: Zhichang Yu <yuzhichang@gmail.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-03 19:26:45 +08:00
akie	35b2a714f9	Fix: tag datasets not visible in tag sets dropdown (#13921 ) ## Problem Description When a user creates Dataset A using the Tag parser (for CSV/Excel files with tag definitions), and then creates Dataset B, the Tag Sets dropdown in Dataset B's Configuration page cannot display Dataset A. ### Steps to Reproduce 1. Create Dataset A with Tag as the chunking method 2. Upload a CSV file to Dataset A to generate tags 3. Create Dataset B 4. Navigate to Dataset B → Configuration → Tag Sets 5. Expected: Dataset A should appear in the dropdown 6. Actual: The dropdown is empty, Dataset A is not visible --- ## Root Cause Analysis After thorough code review, the original code logic is correct. The `chunk_method` field flows properly through the system: ### Data Flow ```mermaid sequenceDiagram participant Frontend participant Pydantic participant API participant Database Note over Frontend,Database: Creating a Tag Dataset Frontend->>Pydantic: POST {chunk_method: "tag"} Pydantic->>API: serialization_alias converts<br/>chunk_method → parser_id API->>Database: INSERT {parser_id: "tag"} Note over Frontend,Database: Querying Datasets Frontend->>API: GET /api/v1/datasets API->>Database: SELECT parser_id, ... Database-->>API: Returns {parser_id: "tag"} API->>API: remap_dictionary_keys()<br/>parser_id → chunk_method API-->>Frontend: {chunk_method: "tag"} Note over Frontend: Filter: x.chunk_method === 'tag' Note over Frontend: ✅ Match found! ``` ### Field Mapping Location: `api/utils/api_utils.py:657-662` ```python DEFAULT_KEY_MAP = { "chunk_num": "chunk_count", "doc_num": "document_count", "parser_id": "chunk_method", # Maps DB field to API response "embd_id": "embedding_model", } ``` ### Frontend Filtering (Already Correct) Location: `web/src/pages/dataset/dataset-setting/components/tag-item.tsx:24` ```typescript const knowledgeOptions = knowledgeList .filter((x) => x.chunk_method === 'tag') // ✅ Correct field .map((x) => ({...})); ``` --- ## Actual Issue The most likely causes for the "bug" are: 1. Browser Cache: Old data cached before proper deployment 2. Stale Data: Datasets created before the code was fully deployed 3. Container Not Restarted: Changes not applied to running container --- ## Resolution No code changes are needed. The existing code correctly: 1. Accepts `chunk_method` from frontend 2. Converts to `parser_id` via Pydantic serialization_alias 3. Stores in database as `parser_id` 4. Maps back to `chunk_method` in API response 5. Frontend filters by `chunk_method === 'tag'`	2026-04-03 17:29:10 +08:00
balibabu	5b43c7cf16	Feat: Place the language configuration in web/.env for easy user configuration. (#13920 ) ### What problem does this PR solve? Feat: Place the language configuration in web/.env for easy user configuration. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-04-03 16:50:18 +08:00
Yongteng Lei	b7daf6285b	Refa: Chat conversations /convsersation API to RESTFul (#13893 ) ### What problem does this PR solve? Chat conversations /convsersation API to RESTFul. ### Type of change - [x] Refactoring	2026-04-02 20:49:23 +08:00
Sank	1c2c4b337e	[RU] Add schema synchronization and translate (#13891 ) ### What problem does this PR solve? Add schema synchronization and translate ### Type of change - [x] Translation into Russian	2026-04-02 11:18:27 +08:00
balibabu	af40be68c3	Fix: The dataset on the list page cannot be renamed. (#13886 ) ### What problem does this PR solve? Fix: The dataset on the list page cannot be renamed. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-04-01 20:53:05 +08:00
Yongteng Lei	b622c47ed6	Refa: Chats /chat API to RESTFul (#13881 ) ### What problem does this PR solve? Refactor Chats /chat API to RESTFul. ### Type of change - [x] Refactoring	2026-04-01 20:10:37 +08:00
Liu An	b1d28b5898	Revert "Refa: Chats /chat API to RESTFul (#13871 )" (#13877 ) ### What problem does this PR solve? This reverts commit `1a608ac411`. ### Type of change - [x] Other (please describe):	2026-04-01 11:05:29 +08:00
Yongteng Lei	1a608ac411	Refa: Chats /chat API to RESTFul (#13871 ) ### What problem does this PR solve? Chats /chat API to RESTFul. ### Type of change - [x] Refactoring	2026-04-01 10:50:22 +08:00
balibabu	00b62dd587	Feat: If a model configured in the agent is deleted from the user center, a notification will be displayed on the canvas with a red border. (#13872 ) ### What problem does this PR solve? Feat: If a model configured in the agent is deleted from the user center, a notification will be displayed on the canvas with a red border. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-03-31 18:43:24 +08:00
Sank	68b4287892	Add translate [RU] for MinerU (#13832 ) add translate for MinerU in knowledgeConfiguration ### Type of change - [X] Other (please describe):	2026-03-31 17:03:31 +08:00
balibabu	36513313f8	Fix: The agent form sheet will be obscured by the message log sheet. (#13870 ) ### What problem does this PR solve? Fix: The agent form sheet will be obscured by the message log sheet. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-31 16:18:43 +08:00
balibabu	4f27090289	Fix: Unable to reconnect after deleting the connection between begin and parser #13868 (#13869 ) ### What problem does this PR solve? Fix: Unable to reconnect after deleting the connection between begin and parser #13868 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-31 14:44:47 +08:00
balibabu	3a4f0d1a83	Fix: The chat settings are not displayed correctly on the first page load. (#13855 ) ### What problem does this PR solve? Fix: The chat settings are not displayed correctly on the first page load. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-03-30 20:16:52 +08:00
KeJun	cb78ce0a7b	feat: support rss datasource (#13721 ) ### What problem does this PR solve? Supporting public RSS/Atom feed URLs as data sources for RagFlow. link https://github.com/infiniflow/ragflow/issues/12313 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-03-27 22:58:44 +08:00
balibabu	308b3a1299	Feat: Remove antd-related code and upgrade lucide-react to the latest version. (#13830 ) ### What problem does this PR solve? Feat: Remove antd-related code and upgrade lucide-react to the latest version. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-03-27 19:24:52 +08:00
Sank	992a15146d	Add translate autoMetadata (#13807 ) ### What problem does this PR solve? add translate autoMetadata in Russia ### Type of change - [x] Other	2026-03-27 09:31:20 +08:00
Renzo	f3aa3381a2	Fix username line break in SharedBadge component (#13794 ) ## Summary - Added Tailwind truncation classes (`inline-block max-w-[120px] truncate align-middle`) to the username `<span>` in `SharedBadge` to prevent long usernames from wrapping onto multiple lines - Added `title` attribute to show the full username on hover when truncated ![ragflow](https://github.com/user-attachments/assets/8b3d8c03-d605-4957-bcf0-8b4d81fc4e70) ## Test plan - [x] Verify long usernames display truncated with ellipsis (`...`) - [x] Verify hovering over a truncated username shows the full name as a tooltip - [x] Verify short usernames display normally without truncation Closes #13748	2026-03-27 09:31:08 +08:00
balibabu	8402fcac6b	Fix: The chunk method of the knowledge base cannot be saved. (#13813 ) ### What problem does this PR solve? Fix: The chunk method of the knowledge base cannot be saved. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-26 19:05:49 +08:00
Syed Shahmeer Ali	ff92b5575b	Fix: /file2document/convert blocks event loop on large folders causing 504 timeout (#13784 ) Problem The /file2document/convert endpoint ran all file lookups, document deletions, and insertions synchronously inside the request cycle. Linking a large folder (~1.7GB with many files) caused 504 Gateway Timeout because the blocking DB loop held the HTTP connection open for too long. Fix - Extracted the heavy DB work into a plain sync function _convert_files - Inputs are validated and folder file IDs expanded upfront (fast path) - The blocking work is dispatched to a thread pool via get_running_loop().run_in_executor() and the endpoint returns 200 immediately - Frontend only checks data.code === 0 so the response change (file2documents list → True) has no impact Fixes #13781 --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-26 16:45:10 +08:00
Yongteng Lei	d19ca71b43	Refa: Searches /search API to RESTFul (#13770 ) ### What problem does this PR solve? Searches /search API to RESTFul ### Type of change - [x] Documentation Update - [x] Refactoring Co-authored-by: Jin Hai <haijin.chn@gmail.com> Co-authored-by: Yingfeng <yingfeng.zhang@gmail.com>	2026-03-26 01:07:41 +08:00
balibabu	33948b9dd8	Fix: Fix the issue of errors when creating datasets. (#13787 ) ### What problem does this PR solve? Fix: Fix the issue of errors when creating datasets. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: Jin Hai <haijin.chn@gmail.com> Co-authored-by: Yingfeng <yingfeng.zhang@gmail.com>	2026-03-25 21:37:58 +08:00
balibabu	8f45398422	Fix: Using AvatarUpload in a dialog and pressing Enter will cause a file selection pop-up to appear. #13779 (#13780 ) ### What problem does this PR solve? Fix: Using AvatarUpload in a dialog and pressing Enter will cause a file selection pop-up to appear. #13779 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Jin Hai <haijin.chn@gmail.com>	2026-03-25 19:02:51 +08:00
chanx	a2e6daa8d6	Fix: Metadata,chunk,dataset Related bugs (#13760 ) ### What problem does this PR solve? Fix: Metadata,chunk,dataset Related bugs - metadata not show add button #13731 - chunk edit question style - dataset modified chunk method bug ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-25 10:47:34 +08:00
balibabu	d84b688b91	Fix: This resolves the issue where selecting a knowledge base in chat could not differentiate between different users. (#13764 ) ### What problem does this PR solve? Fix: This resolves the issue where selecting a knowledge base in chat could not differentiate between different users. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-24 20:07:06 +08:00
Yongteng Lei	3d10e2075c	Refa: files /file API to RESTFul style (#13741 ) ### What problem does this PR solve? Files /file API to RESTFul style. ### Type of change - [x] Documentation Update - [x] Refactoring --------- Co-authored-by: writinwaters <cai.keith@gmail.com> Co-authored-by: Liu An <asiro@qq.com>	2026-03-24 19:24:41 +08:00
Baki Burak Öğün	1319a25416	feat: complete Turkish localization (#13749 ) ## Summary Complete and improve the existing Turkish (tr.ts) localization to fully match the English (en.ts) reference file. ## Changes - Translate 6 English model tips in the setting section (chatModelTip, embeddingModelTip, img2txtModelTip, sequence2txtModelTip, rerankModelTip, ttsModelTip) to Turkish - Expand all 13 truncated parser HTML descriptions (book, laws, manual, naive, paper, presentation, qa, resume, table, picture, one, knowledgeGraph, tag) to match the full en.ts structure - Expand shortened tooltips across knowledgeDetails, knowledgeConfiguration, chat, and setting sections (~40+ tooltips expanded) - Add missing translation details for data source connectors (SeaFile, Jira, Gmail, Moodle, Dropbox, Google Drive, etc.) ## Impact - 182 insertions, 71 deletions in web/src/locales/tr.ts - No structural changes, only translation content improvements - All application terminology maintained consistently Co-authored-by: bakiburakogun <bakiburakogun@users.noreply.github.com> Co-authored-by: Liu An <asiro@qq.com>	2026-03-24 18:58:58 +08:00
balibabu	48c60b8ce5	Fix: Fixed the issue where agent log time could not be selected. (#13756 ) ### What problem does this PR solve? Fix: Fixed the issue where agent log time could not be selected. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-24 16:02:26 +08:00
Yongteng Lei	dd839f30e8	Fix: code supports matplotlib (#13724 ) ### What problem does this PR solve? Code as "final" node: ![img_v3_02vs_aece4caf-8403-4939-9e68-9845a22c2cfg](https://github.com/user-attachments/assets/9d87b8df-da6b-401c-bf6d-8b807fe92c22) Code as "mid" node: ![img_v3_02vv_f74f331f-d755-44ab-a18c-96fff8cbd34g](https://github.com/user-attachments/assets/c94ef3f9-2a6c-47cb-9d2b-19703d2752e4) ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-03-20 20:32:00 +08:00
balibabu	0507463f4e	Fix: The retrieval_test interface is continuously requested when the user enters a question. #13719 (#13720 ) ### What problem does this PR solve? Fix: The retrieval_test interface is continuously requested when the user enters a question. #13719 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-20 15:46:41 +08:00
tmimmanuel	13d0df1562	feat: add Perplexity contextualized embeddings API as a new model provider (#13709 ) ### What problem does this PR solve? Adds Perplexity contextualized embeddings API as a new model provider, as requested in #13610. - `PerplexityEmbed` provider in `rag/llm/embedding_model.py` supporting both standard (`/v1/embeddings`) and contextualized (`/v1/contextualizedembeddings`) endpoints - All 4 Perplexity embedding models registered in `conf/llm_factories.json`: `pplx-embed-v1-0.6b`, `pplx-embed-v1-4b`, `pplx-embed-context-v1-0.6b`, `pplx-embed-context-v1-4b` - Frontend entries (enum, icon mapping, API key URL) in `web/src/constants/llm.ts` - Updated `docs/guides/models/supported_models.mdx` - 22 unit tests in `test/unit_test/rag/llm/test_perplexity_embed.py` Perplexity's API returns `base64_int8` encoded embeddings (not OpenAI-compatible), so this uses a custom `requests`-based implementation. Contextualized vs standard model is auto-detected from the model name. Closes #13610 ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Documentation Update	2026-03-20 10:47:48 +08:00
Zhicheng Wu	456b1bbf66	fix: row selection leaks across pages in dataset and file list tables (#13668 ) ### What problem does this PR solve? When using pagination in the Dataset file list or File Manager, selecting row N on page 1 would incorrectly cause row N on page 2 (and subsequent pages) to also appear selected. This is a state pollution bug. ### Root Cause TanStack React Table defaults to using array indices (0, 1, 2...) as `rowSelection` keys. With server-side (manual) pagination, each page's rows start from index 0, so a selection like `{2: true}` on page 1 also matches index 2 on every other page. ### Fix - Added `getRowId: (row) => row.id` to `useReactTable` in both `DatasetTable` and `FilesTable`, so selection state is keyed by unique document/file IDs instead of positional indices. - Updated the `useSelectedIds` helper to support ID-based selection keys while maintaining backward compatibility with index-based keys. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) ### Files Changed \| File \| Change \| \|------\|--------\| \| `web/src/pages/dataset/dataset/dataset-table.tsx` \| Added `getRowId` to table config \| \| `web/src/pages/files/files-table.tsx` \| Added `getRowId` to table config \| \| `web/src/hooks/logic-hooks/use-row-selection.ts` \| Updated `useSelectedIds` to handle ID-based selection \|	2026-03-19 21:08:09 +08:00
writinwaters	b5e0b37d69	Refact: Renamed 'Agent flow' to 'Workflow' (#13705 ) ### What problem does this PR solve? 'Agent flow' rebranded. ### Type of change - [x] Refactoring	2026-03-19 20:17:25 +08:00
Lynn	4bb1acaa5b	Refactor: dataset / kb API to RESTFul style (#13690 ) ### What problem does this PR solve? 1. Split dataset api to gateway and service, and modify web UI to use restful http api. 2. Old KB releated APIs are commented. ### Type of change - [x] Refactoring --------- Co-authored-by: Yingfeng <yingfeng.zhang@gmail.com>	2026-03-19 14:41:36 +08:00
Mustafa YILDIZ	e4d8cdaff3	feat: add Turkish language support (#13670 ) ### What problem does this PR solve? RAGFlow had no Turkish language support. This PR adds Turkish (tr) locale translations to the UI. ### Type of change - [x] New Feature (non-breaking change which adds functionality) ### What problem does this PR solve? Co-authored-by: Mustafa YILDIZ <mustafa.yildiz@cilek.com>	2026-03-18 21:09:32 +08:00
Yongteng Lei	53e395ca2e	Fix: cannot debug invoke component (#13649 ) ### What problem does this PR solve? Cannot debug invoke component. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-18 14:22:13 +08:00
balibabu	6cae364ac2	Feat: Export Agent Logs. (#13658 ) ### What problem does this PR solve? Feat: Export Agent Logs. ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: balibabu <assassin_cike@163.com>	2026-03-17 18:51:26 +08:00
balibabu	fc4f1e2488	Fix: The dataset description should not be a required field. (#13655 ) ### What problem does this PR solve? Fix: The dataset description should not be a required field. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-17 18:51:18 +08:00
Idriss Sbaaoui	ad6bdb5bfe	Fix: left preview containment regression for file previews (#13652 ) ### What problem does this PR solve? Fix left preview containment regression for file previews ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-17 17:21:13 +08:00
balibabu	549833b8a4	Fix: Fixed an issue where agent template titles were not displayed in Chinese mode. (#13647 ) ### What problem does this PR solve? Fix: Fixed an issue where agent template titles were not displayed in Chinese mode. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-17 15:56:57 +08:00
Jin Hai	986dcf1cc8	Revert "Refactor: dataset / kb API to RESTFul style" (#13646 ) Reverts infiniflow/ragflow#13619	2026-03-17 12:09:48 +08:00
balibabu	fdf2d84ffc	Fix: Fixed an issue where the agent could not publish. (#13644 ) ### What problem does this PR solve? Fix: Fixed an issue where the agent could not publish. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-17 11:44:01 +08:00
Lynn	1db5409d82	Refactor: dataset / kb API to RESTFul style (#13619 ) ### What problem does this PR solve? 1. Split dataset api to gateway and service, and modify web UI to use restful http api. 2. Old KB releated APIs are commented. ### Type of change - [x] Refactoring	2026-03-16 22:51:34 +08:00
chanx	5403f142ae	Feat: Add chunk also supports uploading image. (#13628 ) ### What problem does this PR solve? Feat: Add chunk also supports uploading image. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-03-16 20:15:49 +08:00
Idriss Sbaaoui	d5ed179d15	Playwright : add test ids and chat test (#13432 ) ### What problem does this PR solve? ### Type of change - [x] Other	2026-03-16 16:39:05 +08:00

1 2 3 4 5 ...

1803 Commits