ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-04-26 21:45:42 +08:00

Author	SHA1	Message	Date
Jim Smith	7029b8ca81	Fix: Make time_utils tests timezone-independent (#13100 ) ## Summary - Replace hardcoded CST (UTC+8) expected values in `test_time_utils.py` with dynamically computed local-time expectations using `time.localtime()` and `time.mktime()` - Tests previously failed in any timezone other than UTC+8; they now pass regardless of the system's local timezone ## Test plan - [x] `uv run pytest test/unit_test/ -v` — 317 passed, 25 skipped 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-authored-by: Jim Smith <jhsmith0@me.com> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-02-11 10:51:53 +08:00
Ahmad Intisar	99ed8e759d	Fix: Correct Gemini embedding model name in llm_factories.json (#13051 ) ## Problem RAGFlow was using incorrect model names for Google Gemini embeddings: - `embedding-001` (missing `gemini-` prefix) - `text-embedding-004` (OpenAI model name, not Gemini) This caused API errors when users tried to use Gemini embeddings. ## Solution - Updated `conf/llm_factories.json` to use the correct model name: `gemini-embedding-001` - Removed the incorrect `text-embedding-004` entry - Added volume mount in `docker-compose.yml` to ensure config changes persist ## Testing Tested with a valid Gemini API key and confirmed embeddings now work correctly. ## Changes - Modified `conf/llm_factories.json` - Modified `docker/docker-compose.yml` --------- Co-authored-by: Ahmad Intisar <ahmadintisar@Ahmads-MacBook-M4-Pro.local> Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2026-02-11 09:49:48 +08:00
Magicbook1108	109441628b	Fix: upload image files (#13071 ) ### What problem does this PR solve? Fix: upload image files ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-11 09:47:33 +08:00
writinwaters	630f05b8a1	Docs: Added v0.24.0 release notes (#13096 ) ### What problem does this PR solve? Added v0.24.0 release notes. ### Type of change - [x] Documentation Update	2026-02-10 17:38:27 +08:00
Liu An	392ec99651	Docs: Update version references to v0.24.0 in READMEs and docs (#13095 ) ### What problem does this PR solve? - Update version tags in README files (including translations) from v0.23.1 to v0.24.0 - Modify Docker image references and documentation to reflect new version - Update version badges and image descriptions - Maintain consistency across all language variants of README files ### Type of change - [x] Documentation Update v0.24.0	2026-02-10 17:24:03 +08:00
Lynn	d938b47877	Fix: judge table name prefix before migrate (#13094 ) ### What problem does this PR solve? Judge table created with current infinity mapping before migrate db. #13089 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-10 17:05:34 +08:00
akie	6f785e06a4	Fix issue #13084 (#13088 ) When match_expressions contains coroutine objects (from GraphRAG's Dealer.get_vector()), the code cannot identify this type because it only checks for MatchTextExpr, MatchDenseExpr, or FusionExpr. As a result: score_func remains initialized as an empty string "" This empty string is appended to the output list The output list is passed to Infinity SDK's table_instance.output() method Infinity's SQL parser (via sqlglot) fails to parse the empty string, throwing a ParseError	2026-02-10 17:04:45 +08:00
writinwaters	4341d81e29	Refact: Updated UI tips. (#13093 ) ### What problem does this PR solve? Updated UI tips. ### Type of change - [x] Refactoring	2026-02-10 16:25:56 +08:00
Yongteng Lei	48591cb1e7	Refa: boost OpenAI-compatible reranker UX (#13087 ) ### What problem does this PR solve? boost OpenAI-compatible reranker UX. ### Type of change - [x] Refactoring	2026-02-10 16:13:21 +08:00
chanx	586a9e05a7	Fix: Memory log style (#13090 ) ### What problem does this PR solve? Fix: Memory log style ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-10 16:12:59 +08:00
balibabu	126ec85ef6	Feat: Hide log button (#13085 ) ### What problem does this PR solve? Feat: Hide log button ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-02-10 14:05:17 +08:00
chanx	4186821de8	Fix: Bugs fixed (#13086 ) ### What problem does this PR solve? Fix: Bugs fixed - metadata icon error - search page's image not display ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-10 14:04:50 +08:00
balibabu	141157f529	Feat: Translation page index. (#13083 ) ### What problem does this PR solve? Feat: Translation page index. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-02-10 10:37:38 +08:00
writinwaters	9c39ac11c6	Docs: Replaced TOC Enhance with Page Index. (#13075 ) ### What problem does this PR solve? Replaced TOC Enhance with Page Index. ### Type of change - [x] Documentation Update	2026-02-09 20:13:34 +08:00
balibabu	db37804f10	Feat: Add Explore page (#13043 ) ### What problem does this PR solve? Feat: Add Explore page ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-02-09 19:53:51 +08:00
chanx	8ad7339448	Fix: Add authentication validation to the document API interface for embedded pages. (#13078 ) ### What problem does this PR solve? Fix: Add authentication validation to the document API interface for embedded pages and modify the document display styles. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-09 19:53:24 +08:00
Kevin Hu	9bc16d8df2	Fix: agent files issue, (#13067 ) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-09 19:52:52 +08:00
Neel Harsola	a2dda8fb70	Fix: enable chat input resizing (#12998 ) ## Summary - add resizable support to shared textarea component - enable vertical resizing for chat inputs in chat and share surfaces - preserve autosize behavior while honoring manual resize height ## Test plan - not run (not requested) Fixes #12803 --------- Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-09 19:33:05 +08:00
qinling0210	4bc622b409	Fix parameter of calling self.dataStore.get() and warning info during parser (#13068 ) ### What problem does this PR solve? Fix parameter of calling self.dataStore.get() and warning info during parser https://github.com/infiniflow/ragflow/issues/13036 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-09 17:56:59 +08:00
Magicbook1108	25a32c198d	Fix: gemini model names (#13073 ) ### What problem does this PR solve? Fix: gemini model names #13053 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-09 17:51:22 +08:00
6ba3i	fabbfcab90	Fix: failing p3 test for SDK/HTTP APIs (#13062 ) ### What problem does this PR solve? Adjust highlight parsing, add row-count SQL override, tweak retrieval thresholding, and update tests with engine-aware skips/utilities. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-09 14:56:10 +08:00
Yingfeng	ba95167e13	Clean directories (#13061 ) ### Type of change - [x] Documentation Update	2026-02-09 12:08:12 +08:00
Stephen Hu	2ee39f64fe	Refactor: improve ppt shape order logic (#13054 ) ### What problem does this PR solve? improve ppt shape order logic ### Type of change - [x] Refactoring	2026-02-09 11:59:24 +08:00
eviaaaaa	0b55d1e860	fix: remove 10-item display limit in Agent Canvas configuration tables (#13049 ) ## Description This PR fixes an issue where the input and variable configuration tables in the Agent Canvas (specifically for Begin, UserFillUp, and Invoke nodes) were truncated at 10 items. Root Cause: The tables utilized `@tanstack/react-table` with `getPaginationRowModel()` enabled. Since the default page size is 10 and no pagination UI controls were implemented, users could not access items beyond the 10th row. Solution: Removed `getPaginationRowModel` from the table configurations. These lists (inputs/variables) are typically short, so rendering all items in a single scrollable view is the intended behavior. * Modified `query-table.tsx` * Modified `variable-table.tsx` ## How to verify 1. Create a Begin, UserFillUp, or Invoke node in the Agent Canvas. 2. Add more than 10 input items or variables. 3. Verify that all items are visible in the list and not truncated at the 10th item. ## What kind of change does this PR introduce? * [x] Bugfix	2026-02-09 10:43:50 +08:00
Kevin Hu	e51a40fdfc	Fix: launch an agent. (#13039 ) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-09 10:08:36 +08:00
chanx	8217ccced8	Fix: whyDidYouRender error (#13040 ) ### What problem does this PR solve? Fix: whyDidYouRender error ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-09 09:57:52 +08:00
Clint-chan	38289084a8	Chore/upgrade dashscope to 1.25.11 (#13007 ) ## Description Upgrade dashscope package to support text-embedding-v4 model. ## Changes - Update dashscope version from 1.20.11 to 1.25.11 in pyproject.toml ## Reason The text-embedding-v4 model requires dashscope >= 1.25.0 to function properly. This upgrade ensures compatibility with the latest embedding models. Co-authored-by: Clint-chan <Clint-chan@users.noreply.github.com>	2026-02-06 19:06:41 +08:00
Yongteng Lei	279b01a028	Feat: MCP host mode supports STREAMABLE-HTTP endpoint (#13037 ) ### What problem does this PR solve? MCP host mode supports STREAMABLE-HTTP endpoint ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-02-06 16:22:43 +08:00
chanx	c130ac0f88	Fix: Lazy loading adds a loading state to the page (#13038 ) ### What problem does this PR solve? Fix: Lazy loading adds a loading state to the page ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-06 16:20:52 +08:00
Magicbook1108	301ed76aa4	Fix: task cancel (#13034 ) ### What problem does this PR solve? Fix: task cancel #11745 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-06 14:48:24 +08:00
MkDev11	13a6545e48	fix(rdbms): use brackets around field names to preserve distinction after chunking (#13010 ) Fix RDBMS field separation after chunking by wrapping field names in brackets (【field】: value). This ensures fields remain distinguishable even when TxtParser strips newline delimiters during chunk merging. Closes #13001 Co-authored-by: mkdev11 <YOUR_GITHUB_ID+MkDev11@users.noreply.github.com>	2026-02-06 14:44:58 +08:00
yH	5333e764fc	fix: optimize Excel row counting for files with abnormal max_row (#13018 ) ### What problem does this PR solve? Some Excel files have abnormal `max_row` metadata (e.g., `max_row=1,048,534` with only 300 actual data rows). This causes: - `row_number()` returns incorrect count, creating 350+ tasks instead of 1 - `list(ws.rows)` iterates through millions of empty rows, causing system hang This PR uses binary search to find the actual last row with data. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Performance Improvement Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-06 14:43:52 +08:00
chanx	00c392e633	Fix: dataset page enter key to save (#13035 ) ### What problem does this PR solve? Fix dataset page enter key to save Fix the warnings and optimize the code. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-06 14:42:16 +08:00
Magicbook1108	4b0d65f089	Fix: correct llm_id for graphrag (#13032 ) ### What problem does this PR solve? Fix: correct llm_id for graphrag #13030 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-06 14:05:32 +08:00
Yingfeng	6a17e8cc85	Update basics (#13033 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2026-02-06 13:15:33 +08:00
Clint-chan	a68c56def7	fix: ensure all metadata filters are processed in AND logic (#13019 ) ### What problem does this PR solve? Bug: When a filter key doesn't exist in metas or has no matching values, the filter was skipped entirely, causing AND logic to fail. Example: - Filter 1: meeting_series = '宏观早8点' (matches doc1, doc2, doc3) - Filter 2: date = '2026-03-05' (no matches) - Expected: [] (AND should return empty) - Actual: [doc1, doc2, doc3] (Filter 2 was skipped) Root cause: Old logic iterated metas.items() first, then filters. If a filter's key wasn't in metas, it was never processed. Fix: Iterate filters first, then look up in metas. If key not found, treat as no match (empty result), which correctly applies AND logic. Changes: - Changed loop order from 'for k in metas: for f in filters' to 'for f in filters: if f.key in metas' - Explicitly handle missing keys as empty results ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: Clint-chan <Clint-chan@users.noreply.github.com>	2026-02-06 12:57:27 +08:00
LIRUI YU	0586d5148d	fixed vulnerabilities CVE-2025-53859 & CVE-2025-23419 (#13016 ) ### What problem does this PR solve? Fixed vulnerabilities CVE-2025-53859 & CVE-2025-23419 by updating nginx to 1.29.5-1~noble ### Type of change - [X] Bug Fix (non-breaking change which fixes an issue) <img width="709" height="54" alt="image" src="https://github.com/user-attachments/assets/d8c3518f-bca4-4314-a85c-1aed1678f72e" />	2026-02-06 12:55:06 +08:00
Stephen Hu	11703d957d	Refactor: Improve Picture.py resource usage (#13011 ) ### What problem does this PR solve? Improve Picture.py resource usage ### Type of change - [x] Refactoring	2026-02-06 09:50:53 +08:00
Kevin Hu	1262533b74	Feat: support verify to set llm key and boost bigrams. (#12980 ) #12863 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-02-05 19:19:09 +08:00
balibabu	bbd8ba64a1	Feat: Control interface documentation directory display and hiding (#13008 ) ### What problem does this PR solve? Feat: Control interface documentation directory display and hiding ### Type of change - [x] New Feature (non-breaking change which adds functionality) Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2026-02-05 16:59:20 +08:00
Neel Harsola	1a85d2f8de	Fix: prevent streaming message width collapse (#12999 ) ## Summary - keep assistant message containers stretched to available width - avoid width collapse during streaming by allowing flex items to shrink ## Test plan - not run (not requested) Fixes #12985 Made with [Cursor](https://cursor.com) Co-authored-by: Cursor <cursoragent@cursor.com> Co-authored-by: Liu An <asiro@qq.com>	2026-02-05 15:58:55 +08:00
chanx	2a7dca6fc9	Fix: parser bug (#13014 ) …, clicking "Parse" will still ask if you want to clear the chunks of the already parsed files. ### What problem does this PR solve? Fix: After selecting all and then unchecking the already parsed files, clicking "Parse" will still ask if you want to clear the chunks of the already parsed files. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-05 15:57:38 +08:00
Magicbook1108	0a08fc7b07	Fix: example code in session.py (#13004 ) ### What problem does this PR solve? Fix: example code in session.py #12950 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) --------- Co-authored-by: Levi <stupse-tipp0j@icloud.com> Co-authored-by: writinwaters <93570324+writinwaters@users.noreply.github.com> Co-authored-by: Liu An <asiro@qq.com>	2026-02-05 15:56:58 +08:00
Magicbook1108	75b2d482e2	Fix: ingestion pipeline (#13012 ) ### What problem does this PR solve? Fix ingestion pipeline Only 1 file is acceptable for ingestion pipeline. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-05 15:55:41 +08:00
chanx	89fdb1d498	Feat: Add model verify (#13005 ) ### What problem does this PR solve? Feat: Add model verify ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Liu An <asiro@qq.com>	2026-02-05 15:53:20 +08:00
Clint-chan	90b726c988	fix: support date comparison operators (>=, <=, >, <) in metadata filtering (#12982 ) ## Description This PR fixes the issue where date metadata conditions with comparison operators (`>=`, `<=`, `>`, `<`) did not work correctly in the `/api/v1/retrieval` endpoint. ## Problem When using metadata conditions like: ```json { "metadata_condition": { "conditions": [ { "name": "date", "comparison_operator": ">=", "value": "2027-01-13" } ] } } The filtering did not work as expected because: 1. Operators >= and <= were not mapped to internal symbols ≥ and ≤ 2. Date strings like "2027-01-13" failed to parse with ast.literal_eval() 3. Non-standard date formats were incorrectly compared as strings Solution Changes in common/metadata_utils.py: 1. Added operator mapping in convert_conditions(): - >= → ≥ - <= → ≤ - != → ≠ 2. Implemented strict date format detection in meta_filter(): - Only processes dates in YYYY-MM-DD format (10 characters, properly formatted) - When query value is a date, only matches data in the same standard format - Non-standard formats (e.g., "2026年1月13日", "2026-1-22") are skipped 3. Maintained backward compatibility: - Numeric comparisons still work - String comparisons still work - Only affects date-formatted queries Testing All test cases pass (8/8): - ✅ Date >= comparison - ✅ Date > comparison - ✅ Date < comparison - ✅ Date <= comparison - ✅ Date = comparison - ✅ Date range queries - ✅ Non-date string comparison (backward compatibility) - ✅ Numeric comparison (backward compatibility) Example Usage { "dataset_ids": ["xxx"], "question": "test", "metadata_condition": { "conditions": [ { "name": "date", "comparison_operator": ">=", "value": "2027-01-13" } ] } } Notes - Only supports standard YYYY-MM-DD format - Non-standard date formats in data are treated as data quality issues and will not match - Users should ensure their date metadata is in the correct format --------- Co-authored-by: Clint-chan <Clint-chan@users.noreply.github.com>	2026-02-05 13:52:51 +08:00
Magicbook1108	1349e6b7d1	Fix: adressing style without a default value (#13009 ) ### What problem does this PR solve? Fix: adressing style without a default value #12396 #11510 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-05 13:52:23 +08:00
Yongteng Lei	6361fc4b33	Feat: update stepfun list (#12991 ) ### What problem does this PR solve? Update stepfun list. Add TTS and Sequence2Text functionalities. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-02-05 12:47:04 +08:00
Levi	803b480f9c	feat: Add optional document metadata in OpenAI-compatible response references (#12950 ) ### What problem does this PR solve? This PR adds an opt‑in way to include document‑level metadata in OpenAI‑compatible reference chunks. Until now, metadata could be used for filtering but wasn’t returned in responses. The change enables clients to show richer citations (author/year/source, etc.) while keeping payload size and privacy under control via an explicit request flag and optional field allowlist. ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) - [x] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe): Contribution during my time at RAGcon GmbH.	2026-02-05 09:54:33 +08:00
writinwaters	2843570d8e	Refact: Updated Agent template description. (#12995 ) ### What problem does this PR solve? ### Type of change - [x] Refactoring	2026-02-05 09:50:44 +08:00

1 2 3 4 5 ...

5273 Commits