ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-03-27 17:29:39 +08:00

Author	SHA1	Message	Date
NeedmeFordev	840cc8fbe9	fix(asana): use project memberships endpoint for project IDs in connector (#13746 ) ### What problem does this PR solve? Fixes a bug in the Asana connector where providing `Project IDs` caused sync to fail with: `project_membership: Not a recognized ID: <PROJECT_GID>` Root cause: the connector called `get_project_membership(project_gid)`, but that API expects a project membership gid, not a project gid. This PR switches to the correct project-scoped API and adds regression tests. Fixes: [#13669](https://github.com/infiniflow/ragflow/issues/13669) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) ### Changes made - Updated `common/data_source/asana_connector.py`: - Replaced `get_project_membership(pid, ...)` with `get_project_memberships_for_project(pid, ...)` - Trimmed and filtered `asana_project_ids` parsing to avoid empty/whitespace IDs - Normalized `asana_team_id` by trimming whitespace - Used safer access for membership email extraction (`m.get("user")`) - Added `test/unit_test/common/test_asana_connector.py`: - Verifies the correct project-membership API method is called - Verifies empty `project_ids` path returns workspace emails - Verifies project/team input normalization behavior ### Compatibility / risk - Non-breaking bug fix - No API contract changes - Existing behavior for empty `Project IDs` remains unchanged	2026-03-24 20:21:31 +08:00
qinling0210	7c8927c4fb	Implement GetChunk() in Infinity in GO (#13758 ) ### What problem does this PR solve? Implement GetChunk() in Infinity in GO Add cli: GET CHUNK 'XXX'; LIST CHUNKS OF DOCUMENT 'XXX'; ### Type of change - [x] Refactoring	2026-03-24 20:10:21 +08:00
Jin Hai	b308cd3a02	Update go cli (#13717 ) ### What problem does this PR solve? Go cli ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-03-24 20:08:36 +08:00
balibabu	d84b688b91	Fix: This resolves the issue where selecting a knowledge base in chat could not differentiate between different users. (#13764 ) ### What problem does this PR solve? Fix: This resolves the issue where selecting a knowledge base in chat could not differentiate between different users. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-24 20:07:06 +08:00
Yongteng Lei	3d10e2075c	Refa: files /file API to RESTFul style (#13741 ) ### What problem does this PR solve? Files /file API to RESTFul style. ### Type of change - [x] Documentation Update - [x] Refactoring --------- Co-authored-by: writinwaters <cai.keith@gmail.com> Co-authored-by: Liu An <asiro@qq.com>	2026-03-24 19:24:41 +08:00
Idriss Sbaaoui	10a36d6443	Tests : add tests for dataset settings (#13747 ) ### What problem does this PR solve? add tests ### Type of change - [x] Other (please describe): test Co-authored-by: Liu An <asiro@qq.com>	2026-03-24 19:04:04 +08:00
Yongteng Lei	1b1f1bc69f	Fix: minor fix of refacotr excel parser use lazy image loader (#13752 ) ### What problem does this PR solve? Minor fix. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Hu Di <812791840@qq.com>	2026-03-24 19:03:54 +08:00
Baki Burak Öğün	8a4da41406	docs: add Turkish README translation (README_tr.md) (#13750 ) ## Summary Add a complete Turkish translation of the README and include a Turkish language badge across all existing README files. ## Changes - New file: `README_tr.md` - Full Turkish translation of README.md, covering all sections (What is RAGFlow, Demo, Latest Updates, Key Features, System Architecture, Get Started, Configurations, Docker Image, Development from Source, Documentation, Roadmap, Community, Contributing) - Updated 9 existing README files (README.md, README_zh.md, README_tzh.md, README_ja.md, README_ko.md, README_id.md, README_pt_br.md, README_fr.md, README_ar.md) to include the Turkish language badge in the language selector ## Impact - 10 files changed, 417 insertions - Follows the same structure and conventions as other language-specific README files (README_ja.md, README_ko.md, etc.) - Turkish badge uses the same styling pattern (highlighted with DBEDFA in README_tr.md, standard DFE0E5 in others) --------- Co-authored-by: bakiburakogun <bakiburakogun@users.noreply.github.com>	2026-03-24 19:00:48 +08:00
Baki Burak Öğün	1319a25416	feat: complete Turkish localization (#13749 ) ## Summary Complete and improve the existing Turkish (tr.ts) localization to fully match the English (en.ts) reference file. ## Changes - Translate 6 English model tips in the setting section (chatModelTip, embeddingModelTip, img2txtModelTip, sequence2txtModelTip, rerankModelTip, ttsModelTip) to Turkish - Expand all 13 truncated parser HTML descriptions (book, laws, manual, naive, paper, presentation, qa, resume, table, picture, one, knowledgeGraph, tag) to match the full en.ts structure - Expand shortened tooltips across knowledgeDetails, knowledgeConfiguration, chat, and setting sections (~40+ tooltips expanded) - Add missing translation details for data source connectors (SeaFile, Jira, Gmail, Moodle, Dropbox, Google Drive, etc.) ## Impact - 182 insertions, 71 deletions in web/src/locales/tr.ts - No structural changes, only translation content improvements - All application terminology maintained consistently Co-authored-by: bakiburakogun <bakiburakogun@users.noreply.github.com> Co-authored-by: Liu An <asiro@qq.com>	2026-03-24 18:58:58 +08:00
Jin Hai	f59d96f879	Remove rust/cargo install in docker (#13739 ) ### What problem does this PR solve? As title ### Type of change - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-03-24 17:04:57 +08:00
balibabu	48c60b8ce5	Fix: Fixed the issue where agent log time could not be selected. (#13756 ) ### What problem does this PR solve? Fix: Fixed the issue where agent log time could not be selected. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-24 16:02:26 +08:00
Jin Hai	9eb11bf65d	Fix ping response (#13757 ) ### What problem does this PR solve? As title to be compatible with go server ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-03-24 15:15:21 +08:00
Stephen Hu	d32967eda8	refactor: let excel use lazy image loader (#13558 ) ### What problem does this PR solve? let excel use lazy image loader ### Type of change - [x] Refactoring --------- Co-authored-by: Yingfeng <yingfeng.zhang@gmail.com>	2026-03-23 21:24:40 +08:00
Magicbook1108	f991cd362e	Fix: type check in resume parsing method (#13740 ) ### What problem does this PR solve? Fix: type check in resume parsing method ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-23 21:19:09 +08:00
Idriss Sbaaoui	df2cc32f51	Fix: dataset settings save (#13745 ) ### What problem does this PR solve? Saving dataset settings failed with validation error 101 (Extra inputs are not permitted) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-23 17:46:41 +08:00
qinling0210	ac542da505	Fix tokenizer in cpp (#13735 ) ### What problem does this PR solve? Tokenzier in Infinity is modified in https://github.com/infiniflow/infinity/pull/3330, sync the code change to cpp files in ragflow ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-23 15:40:35 +08:00
qinling0210	7b86f577be	Implement metadata search in Infinity in GO (#13706 ) ### What problem does this PR solve? Add cli LIST DOCUMENTS OF DATASET quoted_string ";" LIST METADATA OF DATASETS quoted_string ("," quoted_string)* ";" LIST METADATA SUMMARY OF DATASET quoted_string (DOCUMENTS quoted_string ("," quoted_string)*)? ";" ### Type of change - [x] Refactoring	2026-03-21 18:10:00 +08:00
Lynn	db57155b30	Fix: get user_id from variables (#13716 ) ### What problem does this PR solve? Get user_id from canvas variable when input a {} pattern value. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-20 23:39:34 +08:00
Yongteng Lei	dd839f30e8	Fix: code supports matplotlib (#13724 ) ### What problem does this PR solve? Code as "final" node: ![img_v3_02vs_aece4caf-8403-4939-9e68-9845a22c2cfg](https://github.com/user-attachments/assets/9d87b8df-da6b-401c-bf6d-8b807fe92c22) Code as "mid" node: ![img_v3_02vv_f74f331f-d755-44ab-a18c-96fff8cbd34g](https://github.com/user-attachments/assets/c94ef3f9-2a6c-47cb-9d2b-19703d2752e4) ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-03-20 20:32:00 +08:00
balibabu	0507463f4e	Fix: The retrieval_test interface is continuously requested when the user enters a question. #13719 (#13720 ) ### What problem does this PR solve? Fix: The retrieval_test interface is continuously requested when the user enters a question. #13719 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-20 15:46:41 +08:00
Jin Hai	9ce766192f	Init storage engine (#13707 ) ### What problem does this PR solve? 1. Init Minio / S3 / OSS 2. Fix minio / s3 / oss config ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-03-20 13:15:41 +08:00
Jin Hai	04a60a41e0	Allow default admin user login ragflow user of go server (#13715 ) ### What problem does this PR solve? 1. Allow admin@ragflow.io login go ragflow server 2. Fix go server start error. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-03-20 12:02:44 +08:00
tmimmanuel	13d0df1562	feat: add Perplexity contextualized embeddings API as a new model provider (#13709 ) ### What problem does this PR solve? Adds Perplexity contextualized embeddings API as a new model provider, as requested in #13610. - `PerplexityEmbed` provider in `rag/llm/embedding_model.py` supporting both standard (`/v1/embeddings`) and contextualized (`/v1/contextualizedembeddings`) endpoints - All 4 Perplexity embedding models registered in `conf/llm_factories.json`: `pplx-embed-v1-0.6b`, `pplx-embed-v1-4b`, `pplx-embed-context-v1-0.6b`, `pplx-embed-context-v1-4b` - Frontend entries (enum, icon mapping, API key URL) in `web/src/constants/llm.ts` - Updated `docs/guides/models/supported_models.mdx` - 22 unit tests in `test/unit_test/rag/llm/test_perplexity_embed.py` Perplexity's API returns `base64_int8` encoded embeddings (not OpenAI-compatible), so this uses a custom `requests`-based implementation. Contextualized vs standard model is auto-detected from the model name. Closes #13610 ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Documentation Update	2026-03-20 10:47:48 +08:00
Zhicheng Wu	456b1bbf66	fix: row selection leaks across pages in dataset and file list tables (#13668 ) ### What problem does this PR solve? When using pagination in the Dataset file list or File Manager, selecting row N on page 1 would incorrectly cause row N on page 2 (and subsequent pages) to also appear selected. This is a state pollution bug. ### Root Cause TanStack React Table defaults to using array indices (0, 1, 2...) as `rowSelection` keys. With server-side (manual) pagination, each page's rows start from index 0, so a selection like `{2: true}` on page 1 also matches index 2 on every other page. ### Fix - Added `getRowId: (row) => row.id` to `useReactTable` in both `DatasetTable` and `FilesTable`, so selection state is keyed by unique document/file IDs instead of positional indices. - Updated the `useSelectedIds` helper to support ID-based selection keys while maintaining backward compatibility with index-based keys. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) ### Files Changed \| File \| Change \| \|------\|--------\| \| `web/src/pages/dataset/dataset/dataset-table.tsx` \| Added `getRowId` to table config \| \| `web/src/pages/files/files-table.tsx` \| Added `getRowId` to table config \| \| `web/src/hooks/logic-hooks/use-row-selection.ts` \| Updated `useSelectedIds` to handle ID-based selection \|	2026-03-19 21:08:09 +08:00
chanx	e1dbfb8a9c	fix(dao): Remove unnecessary status filter conditions in user queries (#13698 ) ### What problem does this PR solve? Fix: Enhanced the user deletion function to return detailed deletion information. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-19 21:05:15 +08:00
Magicbook1108	cfe6ea6f56	Feat: CREATE / DELETE / LIST dataset api in Go (#13695 ) ### What problem does this PR solve? Feat: CREATE / DELETE / LIST dataset api in Go ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Lynn <lynn_inf@hotmail.com> Co-authored-by: Yingfeng <yingfeng.zhang@gmail.com>	2026-03-19 20:48:32 +08:00
Lynn	f06e332c44	Fix: allow on (#13704 ) ### What problem does this PR solve? Allow input on/ON as status. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-19 20:41:02 +08:00
writinwaters	b5e0b37d69	Refact: Renamed 'Agent flow' to 'Workflow' (#13705 ) ### What problem does this PR solve? 'Agent flow' rebranded. ### Type of change - [x] Refactoring	2026-03-19 20:17:25 +08:00
Jin Hai	8d50ee632d	Add environments reading (#13701 ) ### What problem does this PR solve? environment variable > config file ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-03-19 18:50:28 +08:00
yH	757d8d42dd	Fix: use configured OrderByExpr in _community_retrieval_ (#13683 ) The `odr` variable was configured with `desc("weight_flt")` but a new empty `OrderByExpr()` was passed to `dataStore.search()` instead, causing the descending sort to have no effect. ### What problem does this PR solve? In `_community_retrieval_`, the configured `OrderByExpr` with `desc("weight_flt")` was discarded — a new empty `OrderByExpr()` was passed to `dataStore.search()` instead, so community reports were never sorted by weight. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-19 17:55:40 +08:00
Lynn	e12147f5b9	Fix: admin client (#13699 ) ### What problem does this PR solve? Define a crypt function in admin directory, remove import from api.utils. And move requests-toolbelt to dependency. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-19 17:06:54 +08:00
Lynn	4bb1acaa5b	Refactor: dataset / kb API to RESTFul style (#13690 ) ### What problem does this PR solve? 1. Split dataset api to gateway and service, and modify web UI to use restful http api. 2. Old KB releated APIs are commented. ### Type of change - [x] Refactoring --------- Co-authored-by: Yingfeng <yingfeng.zhang@gmail.com>	2026-03-19 14:41:36 +08:00
Idriss Sbaaoui	7827f0fce5	fix : empty mind map (#13693 ) ### What problem does this PR solve? Fix graphrag extractor chat response parsing and skip truncated cache values ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-19 13:53:06 +08:00
Jin Hai	7ebe1d2722	Fix docker building (#13681 ) ### What problem does this PR solve? 1. Refactor go server log 2. Update docker building, since nginx config should be set according to the deployment. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-03-19 10:25:35 +08:00
NeedmeFordev	c3f79dbcb0	fix(jira): prevent missed incremental updates after issue edits (#13674 ) ### What problem does this PR solve? Fixes [#13505](https://github.com/infiniflow/ragflow/issues/13505): Jira incremental sync could miss updated issues after initial sync, especially near time boundaries. Root cause: - Jira JQL uses minute-level precision for `updated` filters. - Incremental windows had no overlap buffer, so boundary updates could be skipped. - Sync log cursor tracking used a backward-facing update for `poll_range_start`. - Existing-doc updates in `upload_document` lacked a KB ownership guard for doc-id collisions. What changed: - Added Jira incremental overlap buffer (`time_buffer_seconds`, defaulting to `JIRA_SYNC_TIME_BUFFER_SECONDS`) when building JQL lower-bound time. - Preserved second-level post-filtering to avoid duplicate reprocessing while still catching boundary updates. - Improved Jira sync logging to include start/end window and overlap configuration. - Updated sync cursor tracking in `increase_docs` to keep `poll_range_start` moving forward with max update time. - Added KB ID safety check before updating existing document records in `upload_document`. Verification performed: - Python syntax compile checks passed for modified files. - Manual verification flow: 1. Run full Jira sync. 2. Edit an already-indexed Jira issue. 3. Run next incremental sync. 4. Confirm updated content is re-ingested into KB. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>	2026-03-18 23:31:05 +08:00
Daniil Sivak	dee68c571b	Feat: support variable interpolation in headers (#13680 ) Closes #13277 ### What problem does this PR solve? Adds `{variable_name}` (and `{component@variable}`) interpolation support to HTTP header values in the `Invoke` component, matching the existing URL interpolation behavior. ### Type of change - [x] New Feature (non-breaking change which adds functionality) <img width="1280" height="867" alt="image" src="https://github.com/user-attachments/assets/8ab7b4e9-7cc0-4a7f-8a5f-f838a15a5fda" /> --------- Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>	2026-03-18 22:38:20 +08:00
Mustafa YILDIZ	e4d8cdaff3	feat: add Turkish language support (#13670 ) ### What problem does this PR solve? RAGFlow had no Turkish language support. This PR adds Turkish (tr) locale translations to the UI. ### Type of change - [x] New Feature (non-breaking change which adds functionality) ### What problem does this PR solve? Co-authored-by: Mustafa YILDIZ <mustafa.yildiz@cilek.com>	2026-03-18 21:09:32 +08:00
writinwaters	bbd0cd80e4	Docs: Updated Add Google Drive as data source (#13684 ) ### What problem does this PR solve? Gave an editorial pass to the Add Google Drive document. ### Type of change - [x] Documentation Update	2026-03-18 21:05:25 +08:00
Octopus	f171554c0a	feat: upgrade MiniMax default model to M2.7 (#13676 ) ## Summary Upgrade MiniMax model configuration to include the latest M2.7 model. ## Changes - Add `MiniMax-M2.7` and `MiniMax-M2.7-highspeed` to the model selection list in `conf/llm_factories.json` - Place M2.7 models at the top of the list as the recommended default - Retain all previous models (M2.5, M2.5-highspeed, M2.1, M2) as available alternatives ## Why MiniMax-M2.7 is the latest flagship model with enhanced reasoning and coding capabilities. This update ensures RAGFlow users can access the newest model while maintaining backward compatibility with existing configurations. ## Testing - JSON config validated (well-formed) - No existing MiniMax-specific unit tests affected - Model entries follow the same structure as existing entries Co-authored-by: PR Bot <pr-bot@minimaxi.com>	2026-03-18 19:20:10 +08:00
Idriss Sbaaoui	9070408b04	Fix : model-specific handling (#13675 ) ### What problem does this PR solve? add a handler for gpt 5 models that do not accept parameters by dropping them, and centralize all models with specific paramter handling function into a single helper. solves issue #13639 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Refactoring	2026-03-18 17:28:20 +08:00
Yongteng Lei	53e395ca2e	Fix: cannot debug invoke component (#13649 ) ### What problem does this PR solve? Cannot debug invoke component. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-18 14:22:13 +08:00
Jin Hai	74866371ef	Fix compatiblity issue (#13667 ) ### What problem does this PR solve? 1. Change go admin server port from 9385 to 9383 to avoid conflicts 2. Start go server after python servers are started completely, in entrypoint.sh 3. Fix some database migration issue 4. Add more API routes in web to compliant with EE. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-03-18 11:51:03 +08:00
Daniil Sivak	60ad32a0c2	Feat: support epub parsing (#13650 ) Closes #1398 ### What problem does this PR solve? Adds native support for EPUB files. EPUB content is extracted in spine (reading) order and parsed using the existing HTML parser. No new dependencies required. ### Type of change - [x] New Feature (non-breaking change which adds functionality) To check this parser manually: ```python uv run --python 3.12 python -c " from deepdoc.parser import EpubParser with open('$HOME/some_epub_book.epub', 'rb') as f: data = f.read() sections = EpubParser()(None, binary=data, chunk_token_num=512) print(f'Got {len(sections)} sections') for i, s in enumerate(sections[:5]): print(f'\n--- Section {i} ---') print(s[:200]) " ```	2026-03-17 20:14:06 +08:00
Idriss Sbaaoui	1399c60164	fix builtin model fail when parsing (#13657 ) ### What problem does this PR solve? using builtin model when parsing gave an error because it expects fid==builtin. split_model_name_and_factory returns id=None. pr allows the model to be accepted wheter with or without @Builtin ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-17 19:38:54 +08:00
balibabu	6cae364ac2	Feat: Export Agent Logs. (#13658 ) ### What problem does this PR solve? Feat: Export Agent Logs. ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: balibabu <assassin_cike@163.com>	2026-03-17 18:51:26 +08:00
balibabu	fc4f1e2488	Fix: The dataset description should not be a required field. (#13655 ) ### What problem does this PR solve? Fix: The dataset description should not be a required field. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-17 18:51:18 +08:00
Idriss Sbaaoui	ad6bdb5bfe	Fix: left preview containment regression for file previews (#13652 ) ### What problem does this PR solve? Fix left preview containment regression for file previews ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-17 17:21:13 +08:00
Yongteng Lei	ca6c3218c3	Refa: follow-up expose agent structured outputs in non-stream completions (#13524 ) ### What problem does this PR solve? Follow-up expose agent structured outputs in non-stream completions #13389. ### Type of change - [x] Documentation Update - [x] Refactoring --------- Co-authored-by: writinwaters <cai.keith@gmail.com>	2026-03-17 17:11:27 +08:00
qinling0210	ca182dc188	Implement Search() in Infinity in GO (#13645 ) ### What problem does this PR solve? Implement Search() in Infinity in GO. The function can handle the following request. "search '曹操' on datasets 'infinity'" "search '常胜将军' on datasets 'infinity'" "search '卓越儒雅' on datasets 'infinity'" "search '辅佐刘禅北伐中原' on datasets 'infinity'" The output is exactly the same as request to python Search() ### Type of change - [ ] New Feature (non-breaking change which adds functionality)	2026-03-17 16:45:45 +08:00
balibabu	549833b8a4	Fix: Fixed an issue where agent template titles were not displayed in Chinese mode. (#13647 ) ### What problem does this PR solve? Fix: Fixed an issue where agent template titles were not displayed in Chinese mode. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-03-17 15:56:57 +08:00

1 2 3 4 5 ...

5593 Commits