ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-03-12 18:49:00 +08:00

Author	SHA1	Message	Date
Magicbook1108	46dec98f52	Fix: Chat/Agent embedded page (#13199 ) ### What problem does this PR solve? Fix: Chat/Agent embedded page #13190 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-24 19:14:24 +08:00
chanx	7db2fb200c	Fix: Metadata mult-selected display error (#13189 ) ### What problem does this PR solve? Fix: Metadata mult-selected display error ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-24 13:10:32 +08:00
PandaMan	f462a9d85a	fix(web): prevent LaTeX from being cut at \right] or \big) in agent o… (#13155 ) ### What problem does this PR solve? - Use negative lookbehind (?<![a-zA-Z]) so \] and \) inside commands (e.g. \right], \big)) are not treated as block/inline delimiters - Use greedy matching to capture up to the last valid delimiter, fixing truncated formulas (e.g. C_{seq}(y\|x) = \frac{1}{\|y\|} ...) - Add unit tests for preprocessLaTeX Closes #13134 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-24 12:30:06 +08:00
chanx	59e9e77061	fix: Add admin proxy (#13186 ) ### What problem does this PR solve? fix: Add admin proxy ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-24 10:29:58 +08:00
Trifon	ce71d87867	Add Bulgarian language support (#13147 ) ### What problem does this PR solve? RAGFlow supports 12 UI languages but does not include Bulgarian. This PR adds Bulgarian (`bg` / `Български`) as the 13th supported language, covering the full UI translation (2001 keys across all 26 sections) and OCR/PDF parser language mapping. ### Type of change - [x] New Feature (non-breaking change which adds functionality) ### Changes - `web/src/constants/common.ts` — Registered Bulgarian in all 5 language data structures (`LanguageList`, `LanguageMap`, `LanguageAbbreviation` enum, `LanguageAbbreviationMap`, `LanguageTranslationMap`) - `web/src/locales/config.ts` — Added lazy-loading dynamic import for the `bg` locale - `web/src/locales/bg.ts` (new) — Full Bulgarian translation file with all 26 sections, matching the English source (`en.ts`). All interpolation placeholders, HTML tags, and technical terms are preserved as-is - `deepdoc/parser/mineru_parser.py` — Mapped `'Bulgarian'` to `'cyrillic'` in `LANGUAGE_TO_MINERU_MAP` for OCR/PDF parser support ### How it works The language selector automatically picks up the new entry. When a user selects "Български", the translation bundle is lazy-loaded on demand. The preference is persisted to the database and localStorage across sessions.	2026-02-14 16:51:29 +08:00
chanx	f2a1d59c36	Refactor: Remove ant design component (#13143 ) ### What problem does this PR solve? _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ ### Type of change - [x] Refactoring	2026-02-13 18:40:41 +08:00
chanx	b922a5cbdf	Fix: replace session page icons and fix nested list search functionality in filters (#13127 ) ### What problem does this PR solve? Fix: replace session page icons and fix nested list search functionality in filters ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-12 19:48:35 +08:00
Magicbook1108	e89fd686e2	Improve: optimize file name (with path) in box container. (#13124 ) ### What problem does this PR solve? Refact: optimize file name (with path) in box container. ### Type of change - [x] Performance Improvement <img width="2357" height="1258" alt="image" src="https://github.com/user-attachments/assets/f4c5c90b-d885-4514-b7bc-f17ab62b045f" />	2026-02-12 15:40:55 +08:00
Lynn	6e7bcf58bc	Refactor: split message apis to gateway and service (#13126 ) ### What problem does this PR solve? Split message apis to gateway and service ### Type of change - [x] Refactoring	2026-02-12 14:43:52 +08:00
chanx	7210178620	Fix: Bugs fixed (#13109 ) (#13122 ) ### What problem does this PR solve? Fix: Bugs fixed (#13109) - chat pdf preview error - data source add box error - change route next-chat -> chat , next-search->search ... ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-12 13:42:12 +08:00
Levi	4b50b8c579	Fix: persist SSO auth token on root route loader (#12784 ) ### What problem does this PR solve? This PR fixes SSO/OIDC login persistence after the Vite migration #12568. Because wrappers are ignored by React Router, the OAuth callback never stored the auth token in localStorage, causing auth to only work while ?auth= stayed in the URL. We move that logic into a route loader and remove the Bearer prefix for the signed token so the backend accepts it. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Contribution during my time at RAGcon GmbH. Co-authored-by: factory-droid[bot] <138933559+factory-droid[bot]@users.noreply.github.com>	2026-02-12 10:09:35 +08:00
writinwaters	4341d81e29	Refact: Updated UI tips. (#13093 ) ### What problem does this PR solve? Updated UI tips. ### Type of change - [x] Refactoring	2026-02-10 16:25:56 +08:00
chanx	586a9e05a7	Fix: Memory log style (#13090 ) ### What problem does this PR solve? Fix: Memory log style ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-10 16:12:59 +08:00
balibabu	126ec85ef6	Feat: Hide log button (#13085 ) ### What problem does this PR solve? Feat: Hide log button ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-02-10 14:05:17 +08:00
chanx	4186821de8	Fix: Bugs fixed (#13086 ) ### What problem does this PR solve? Fix: Bugs fixed - metadata icon error - search page's image not display ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-10 14:04:50 +08:00
balibabu	141157f529	Feat: Translation page index. (#13083 ) ### What problem does this PR solve? Feat: Translation page index. ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-02-10 10:37:38 +08:00
balibabu	db37804f10	Feat: Add Explore page (#13043 ) ### What problem does this PR solve? Feat: Add Explore page ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-02-09 19:53:51 +08:00
chanx	8ad7339448	Fix: Add authentication validation to the document API interface for embedded pages. (#13078 ) ### What problem does this PR solve? Fix: Add authentication validation to the document API interface for embedded pages and modify the document display styles. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-09 19:53:24 +08:00
Neel Harsola	a2dda8fb70	Fix: enable chat input resizing (#12998 ) ## Summary - add resizable support to shared textarea component - enable vertical resizing for chat inputs in chat and share surfaces - preserve autosize behavior while honoring manual resize height ## Test plan - not run (not requested) Fixes #12803 --------- Co-authored-by: Cursor <cursoragent@cursor.com>	2026-02-09 19:33:05 +08:00
eviaaaaa	0b55d1e860	fix: remove 10-item display limit in Agent Canvas configuration tables (#13049 ) ## Description This PR fixes an issue where the input and variable configuration tables in the Agent Canvas (specifically for Begin, UserFillUp, and Invoke nodes) were truncated at 10 items. Root Cause: The tables utilized `@tanstack/react-table` with `getPaginationRowModel()` enabled. Since the default page size is 10 and no pagination UI controls were implemented, users could not access items beyond the 10th row. Solution: Removed `getPaginationRowModel` from the table configurations. These lists (inputs/variables) are typically short, so rendering all items in a single scrollable view is the intended behavior. * Modified `query-table.tsx` * Modified `variable-table.tsx` ## How to verify 1. Create a Begin, UserFillUp, or Invoke node in the Agent Canvas. 2. Add more than 10 input items or variables. 3. Verify that all items are visible in the list and not truncated at the 10th item. ## What kind of change does this PR introduce? * [x] Bugfix	2026-02-09 10:43:50 +08:00
chanx	8217ccced8	Fix: whyDidYouRender error (#13040 ) ### What problem does this PR solve? Fix: whyDidYouRender error ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-09 09:57:52 +08:00
chanx	c130ac0f88	Fix: Lazy loading adds a loading state to the page (#13038 ) ### What problem does this PR solve? Fix: Lazy loading adds a loading state to the page ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-06 16:20:52 +08:00
chanx	00c392e633	Fix: dataset page enter key to save (#13035 ) ### What problem does this PR solve? Fix dataset page enter key to save Fix the warnings and optimize the code. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-06 14:42:16 +08:00
balibabu	bbd8ba64a1	Feat: Control interface documentation directory display and hiding (#13008 ) ### What problem does this PR solve? Feat: Control interface documentation directory display and hiding ### Type of change - [x] New Feature (non-breaking change which adds functionality) Co-authored-by: Kevin Hu <kevinhu.sh@gmail.com>	2026-02-05 16:59:20 +08:00
Neel Harsola	1a85d2f8de	Fix: prevent streaming message width collapse (#12999 ) ## Summary - keep assistant message containers stretched to available width - avoid width collapse during streaming by allowing flex items to shrink ## Test plan - not run (not requested) Fixes #12985 Made with [Cursor](https://cursor.com) Co-authored-by: Cursor <cursoragent@cursor.com> Co-authored-by: Liu An <asiro@qq.com>	2026-02-05 15:58:55 +08:00
chanx	2a7dca6fc9	Fix: parser bug (#13014 ) …, clicking "Parse" will still ask if you want to clear the chunks of the already parsed files. ### What problem does this PR solve? Fix: After selecting all and then unchecking the already parsed files, clicking "Parse" will still ask if you want to clear the chunks of the already parsed files. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-05 15:57:38 +08:00
Magicbook1108	75b2d482e2	Fix: ingestion pipeline (#13012 ) ### What problem does this PR solve? Fix ingestion pipeline Only 1 file is acceptable for ingestion pipeline. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-05 15:55:41 +08:00
chanx	89fdb1d498	Feat: Add model verify (#13005 ) ### What problem does this PR solve? Feat: Add model verify ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Co-authored-by: Liu An <asiro@qq.com>	2026-02-05 15:53:20 +08:00
Magicbook1108	1349e6b7d1	Fix: adressing style without a default value (#13009 ) ### What problem does this PR solve? Fix: adressing style without a default value #12396 #11510 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-05 13:52:23 +08:00
balibabu	2ff2e72488	Fix: Fixed the issue where deleted images in the agent chat box would still be sent to the backend. (#12992 ) ### What problem does this PR solve? Fix: Fixed the issue where deleted images in the agent chat box would still be sent to the backend. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-05 09:49:01 +08:00
balibabu	2627a7f5a8	Feat: Move the reasoning field to the root of the payload in the completion interface. (#12990 ) ### What problem does this PR solve? ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-02-04 19:21:49 +08:00
BitToby	4d4b5a978d	feat: enable multi-file upload for chat and agent workflows (#12977 ) ### Closes: #12921 ### What problem does this PR solve? Previously, multi-file upload was not working correctly across the application: - Chat: UI displayed "Upload max 5 files" but only the first file was actually uploaded - Agent conversational mode: Frontend sent multiple files but backend only processed one - Agent task-mode file inputs: Explicitly limited to single file only This PR enables proper multi-file upload support for both chat and agent workflows, allowing users to upload and process multiple files (up to 5) as the UI originally suggested. Changes: - `web/src/pages/next-chats/hooks/use-upload-file.ts`: Process all files instead of only `files[0]` - `api/apps/canvas_app.py`: Handle multiple files via `files.getlist("file")` - `web/src/pages/agent/debug-content/uploader.tsx`: Allow up to 5 files with `multiple={true}` - `agent/component/begin.py` & `fillup.py`: Support file arrays while maintaining backward compatibility ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-02-04 18:03:21 +08:00
balibabu	ffdf19b27f	Fix: Variables within multiple parentheses cannot be displayed correctly. #12987 (#12988 ) ### What problem does this PR solve? Fix: Variables within multiple parentheses cannot be displayed correctly. #12987 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-04 17:54:02 +08:00
Magicbook1108	a37d287fad	Fix: pdf chunking / table rotation (#12981 ) ### What problem does this PR solve? Fix: PDF chunking issue for single-page documents Refactor: Change the default refresh frequency to 5 Fix: Add a 0-degree threshold; require other rotation angles to exceed it by at least 0.2 Fix: Put connector name tips to correct place Fix: incorrect example response in delete datasets. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] Refactoring	2026-02-04 17:00:25 +08:00
MkDev11	6f31c5fed2	feat/add MySQL and PostgreSQL data source connectors (#12817 ) ### What problem does this PR solve? This PR adds MySQL and PostgreSQL as data source connectors, allowing users to import data directly from relational databases into RAGFlow for RAG workflows. Many users store their knowledge in databases (product catalogs, documentation, FAQs, etc.) and currently have no way to sync this data into RAGFlow without exporting to files first. This feature lets them connect directly to their databases, run SQL queries, and automatically create documents from the results. Closes #763 Closes #11560 ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe): ### What this PR does New capabilities: - Connect to MySQL and PostgreSQL databases - Run custom SQL queries to extract data - Map database columns to document content (vectorized) and metadata (searchable) - Support incremental sync using a timestamp column - Full frontend UI with connection form and tooltips Files changed: Backend: - `common/constants.py` - Added MYSQL/POSTGRESQL to FileSource enum - `common/data_source/config.py` - Added to DocumentSource enum - `common/data_source/rdbms_connector.py` - New connector (368 lines) - `common/data_source/__init__.py` - Exported the connector - `rag/svr/sync_data_source.py` - Added MySQL and PostgreSQL sync classes - `pyproject.toml` - Added mysql-connector-python dependency Frontend: - `web/src/pages/user-setting/data-source/constant/index.tsx` - Form fields - `web/src/locales/en.ts` - English translations - `web/src/assets/svg/data-source/mysql.svg` - MySQL icon - `web/src/assets/svg/data-source/postgresql.svg` - PostgreSQL icon ### Testing done Tested with MySQL 8.0 and PostgreSQL 16: - Connection validation works correctly - Full sync imports all query results as documents - Incremental sync only fetches rows updated since last sync - Custom SQL queries filter data as expected - Invalid credentials show clear error messages - Lint checks pass (`ruff check` returns no errors) --------- Co-authored-by: mkdev11 <YOUR_GITHUB_ID+MkDev11@users.noreply.github.com>	2026-02-04 10:14:32 +08:00
writinwaters	0ab02854d9	Refact: Updated UI tips (#12976 ) ### What problem does this PR solve? Updated UI tips. ### Type of change - [x] Refactoring	2026-02-04 09:48:59 +08:00
balibabu	414e261eda	Fix: If the agent debug sheet contains too much content, some of it will not be displayed. #12974 (#12975 ) ### What problem does this PR solve? Fix: If the agent debug sheet contains too much content, some of it will not be displayed. #12974 ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-04 09:48:28 +08:00
Yesid Cano Castro	deeae8dba4	feat(connector): add Seafile as data source (#12945 ) ### What problem does this PR solve? This PR adds Seafile as a new data source connector for RAGFlow. [Seafile](https://www.seafile.com/) is an open-source, self-hosted file sync and share platform widely used by enterprises, universities, and organizations that require data sovereignty and privacy. Users who store documents in Seafile currently have no way to index and search their content through RAGFlow. This connector enables RAGFlow users to: - Connect to self-hosted Seafile servers via API token - Index documents from personal and shared libraries - Support incremental polling for updated files - Seamlessly integrate Seafile-stored documents into their RAG pipelines ### Type of change - [x] New Feature (non-breaking change which adds functionality) ### Changes included - `SeaFileConnector` implementing `LoadConnector` and `PollConnector` interfaces - Support for API token - Recursive file traversal across libraries - Time-based filtering for incremental updates - Seafile logo (sourced from Simple Icons, CC0) - Connector configuration and registration ### Testing - Tested against self-hosted Seafile Community Edition - Verified authentication (token) - Verified document ingestion from personal and shared libraries - Verified incremental polling with time filters	2026-02-03 13:42:05 +08:00
chanx	25bb2e1616	Fix:Optimize metadata and optimize the empty state style of the agent page. (#12960 ) ### What problem does this PR solve? Fix:Optimize metadata and optimize the empty state style of the agent page. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-02-03 11:43:44 +08:00
Jimmy Ben Klieve	fafaaa26c3	feat: memory status (#12959 ) ### What problem does this PR solve? Add memory status indicator and detail message dialog ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-02-03 11:16:18 +08:00
Philipp Heyken Soares	ad06c042c4	Support operator constraints in semi-automatic metadata filtering (#12956 ) ### What problem does this PR solve? #### Summary This PR enhances the Semi-automatic metadata filtering mode by allowing users to explicitly pre-define operators (e.g., contains, =, >, etc.) for selected metadata keys. While the LLM still dynamically extracts the filter value from the user's query, it is now strictly constrained to use the operator specified in the UI configuration. Using this feature is optional. By default the operator selection is set to "automatic" resulting in the LLM choosing the operator (as presently). #### Rationale & Use Case This enhancement was driven by a concrete challenge I encountered while working with technical documentation. In my specific use case, I was trying to filter for software versions within a technical manual. In this dataset, a single document chunk often applies to multiple software versions. These versions are stored as a combined string within the metadata for each chunk. When using the standard semi-automatic filter, the LLM would inconsistently choose between the contains and equals operators. When it chose equals, it would exclude every chunk that applied to more than one version, even if the version I was searching for was clearly included in that metadata string. This led to incomplete and frustrating retrieval results. By extending the semi-automatic filter to allow pre-defining the operator for a specific key, I was able to force the use of contains for the version field. This change immediately led to significantly improved and more reliable results in my case. I believe this functionality will be equally useful for others dealing with "tagged" or multi-value metadata where the relationship between the query and the field is known, but the specific value needs to remain dynamic. #### Key Changes ##### Backend & Core Logic - `common/metadata_utils.py`: Updated apply_meta_data_filter to support a mixed data structure for semi_auto (handling both legacy string arrays and the new object-based format {"key": "...", "op": "..."}). - `rag/prompts/generator.py`: Extended gen_meta_filter to accept and pass operator constraints to the LLM. - `rag/prompts/meta_filter.md`: Updated the system prompt to instruct the LLM to strictly respect provided operator constraints. ##### Frontend - `web/src/components/metadata-filter/metadata-semi-auto-fields.tsx`: Enhanced the UI to include an operator dropdown for each selected metadata key, utilizing existing operator constants. - `web/src/components/metadata-filter/index.tsx`: Updated the validation schema to accommodate the new state structure. #### Test Plan - Backward Compatibility: Verified that existing semi-auto filters stored as simple strings still function correctly. - Prompt Verification: Confirmed that constraints are correctly rendered in the LLM system prompt when specified. - Added unit tests as `test/unit_test/common/test_apply_semi_auto_meta_data_filter.py` - Manual End-to-End: - Configured a "Semi-automatic" filter for a "Version" key with the "contains" operator. - Asked a version-specific query. - Result <img width="1173" height="704" alt="Screenshot 2026-02-02 145359" src="https://github.com/user-attachments/assets/510a6a61-a231-4dc2-a7fe-cdfc07219132" /> ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe): --------- Co-authored-by: Philipp Heyken Soares <philipp.heyken-soares@am.ai>	2026-02-03 11:11:34 +08:00
eviaaaaa	2e5a18602b	refactor: optimize agent list payload and improve multimodal detection logic (#12942 ) ## Description This PR focuses on API performance optimization and refining the model capability detection logic in the Agent/Canvas module. ### 1. Performance Optimization (Backend) - Changes: Removed `cls.model.dsl` from query fields in `UserCanvasService.get_by_tenant_ids`. - Reasoning: The `dsl` object is large and unnecessary for the Agent list view. Excluding it reduces the payload size of the `/v1/canvas/list` API, leading to faster serialization and reduced network latency. - Consistency: Full DSL data remains accessible via the individual `/v1/canvas/get/<id>` endpoint used in the detail view. ### 2. Multimodal Detection Refinement (Frontend) - Changes: Replaced `model_type === LlmModelType.Image2text` with `tags?.includes('IMAGE2TEXT')`. - Reasoning: In RAGFlow, `model_type` defines the primary role of a model (e.g., `chat`). However, many advanced Chat models are also vision-capable. Since `model_type` is a single-value field, it cannot represent these multiple capabilities. - Solution: Utilizing the `tags` field (which supports multiple attributes) to check for `IMAGE2TEXT` ensures that models like `gpt-5.2-pro` correctly display multimodal input options. ## Type of Change - [x] Bug fix (logic correction for multimodal detection) - [x] Optimization (performance improvement for list API) ## Main Changes - `api/db/services/canvas_service.py`: Optimized DB query by excluding heavy DSL fields. - `web/src/pages/agent/form/agent-form/index.tsx`: Enhanced capability detection using the tags system. ## Verification - [x] Verified Agent list loads faster with reduced response payload. - [x] Confirmed that `chat` models with the `IMAGE2TEXT` tag now correctly enable the multimodal input UI.	2026-02-02 17:35:54 +08:00
方程	8fc3986f70	fix(llm): Fix Gitee AI links and update the reranker model configuration (#12916 ) ### What problem does this PR solve? Fix Gitee AI links and update the reranker model configuration ### Type of change - [X] Bug Fix (non-breaking change which fixes an issue) Co-authored-by: franco <1787003204@q.comq>	2026-02-02 13:43:16 +08:00
Carve_	ee23b9eb63	feature:Add OceanBase Support to Text-to-SQL Agent (#12919 ) ### What problem does this PR solve? Close #12768. This PR adds OceanBase support to RAGFlow’s Text-to-SQL (ExeSQL) component. OceanBase is integrated via MySQL compatibility mode, and the UI `db_type` options are updated accordingly. ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe): ### Changes Backend - Add `oceanbase` `db_type` validation and connection logic in `exesql.py` and reuse existing MySQL compatibility mode Frontend - Add OceanBase option to the ExeSQL `db_type` selector ### How to test 1. Configure OceanBase connection in ExeSQL node (host/port/user/password/database) 2. Input: “Show 10 rows from test table” 3. Generated SQL: `SELECT * FROM test LIMIT 10;` 4. Query executes successfully and results are returned ### Screenshots - ExeSQL db_type includes OceanBase <img width="649" height="1015" alt="2" src="https://github.com/user-attachments/assets/e0a5f7b9-e282-402a-8639-64c1aef8fce6" /> - ExeSQL test OceanBase connection <img width="2247" height="1140" alt="test_ob" src="https://github.com/user-attachments/assets/f16ebd93-b48e-4d18-b53f-8496581e755d" /> - Query results from OceanBase shown in UI <img width="2550" height="1351" alt="1" src="https://github.com/user-attachments/assets/b44163dc-baab-420d-b31e-b644bdcb77a9" />	2026-01-31 15:03:40 +08:00
BitToby	73645e2f78	fix: preserve line breaks in prompt editor and add auto-save on blur (#12887 ) Closes #12762 ### What problem does this PR solve? Line break issue in Agent prompt editor: - Text with blank lines in `system_prompt` or `user_prompt` would have extra/fewer blank lines after save/reload or paste - Root cause: Mismatch between Lexical editor's paragraph nodes (`\n\n` separator) and line break nodes (`\n` separator) Auto-save issue: - Changes were only saved after 20-second debounce, causing data loss on page refresh before timer completed ### Solution 1. Line break fix: Use `LineBreakNode` consistently for all line breaks (typing Enter, paste, load) 2. Auto-save: Save immediately when prompt editor loses focus [1.webm](https://github.com/user-attachments/assets/eb2c2428-54a3-4d4e-8037-6cc34a859b83) ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-01-30 10:29:51 +08:00
writinwaters	d99f6a611a	Refact: Updated UI tips. (#12898 ) ### What problem does this PR solve? ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [x] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2026-01-29 17:56:07 +08:00
Jimmy Ben Klieve	ec88e17710	fix: task executor bar chart error (#12894 ) ### What problem does this PR solve? Fix wrong data rendered in task executor bar chart ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-01-29 17:34:05 +08:00
Zhichang Yu	fd11aca8e5	feat: Implement pluggable multi-provider sandbox architecture (#12820 ) ## Summary Implement a flexible sandbox provider system supporting both self-managed (Docker) and SaaS (Aliyun Code Interpreter) backends for secure code execution in agent workflows. Key Changes: - ✅ Aliyun Code Interpreter provider using official `agentrun-sdk>=0.0.16` - ✅ Self-managed provider with gVisor (runsc) security - ✅ Arguments parameter support for dynamic code execution - ✅ Database-only configuration (removed fallback logic) - ✅ Configuration scripts for quick setup Issue #12479 ## Features ### 🔌 Provider Abstraction Layer 1. Self-Managed Provider (`agent/sandbox/providers/self_managed.py`) - Wraps existing executor_manager HTTP API - gVisor (runsc) for secure container isolation - Configurable pool size, timeout, retry logic - Languages: Python, Node.js, JavaScript - ⚠️ Requires: gVisor installation, Docker, base images 2. Aliyun Code Interpreter (`agent/sandbox/providers/aliyun_codeinterpreter.py`) - SaaS integration using official agentrun-sdk - Serverless microVM execution with auto-authentication - Hard timeout: 30 seconds max - Credentials: `AGENTRUN_ACCESS_KEY_ID`, `AGENTRUN_ACCESS_KEY_SECRET`, `AGENTRUN_ACCOUNT_ID`, `AGENTRUN_REGION` - Automatically wraps code to call `main()` function 3. E2B Provider (`agent/sandbox/providers/e2b.py`) - Placeholder for future integration ### ⚙️ Configuration System - `conf/system_settings.json`: Default provider = `aliyun_codeinterpreter` - `agent/sandbox/client.py`: Enforces database-only configuration - Admin UI: `/admin/sandbox-settings` - Configuration validation via `validate_config()` method - Health checks for all providers ### 🎯 Key Capabilities Arguments Parameter Support: All providers support passing arguments to `main()` function: ```python # User code def main(name: str, count: int) -> dict: return {"message": f"Hello {name}!" * count} # Executed with: arguments={"name": "World", "count": 3} # Result: {"message": "Hello World!Hello World!Hello World!"} ``` Self-Describing Providers: Each provider implements `get_config_schema()` returning form configuration for Admin UI Error Handling: Structured `ExecutionResult` with stdout, stderr, exit_code, execution_time ## Configuration Scripts Two scripts for quick Aliyun sandbox setup: Shell Script (requires jq): ```bash source scripts/configure_aliyun_sandbox.sh ``` Python Script (interactive): ```bash python3 scripts/configure_aliyun_sandbox.py ``` ## Testing ```bash # Unit tests uv run pytest agent/sandbox/tests/test_providers.py -v # Aliyun provider tests uv run pytest agent/sandbox/tests/test_aliyun_codeinterpreter.py -v # Integration tests (requires credentials) uv run pytest agent/sandbox/tests/test_aliyun_codeinterpreter_integration.py -v # Quick SDK validation python3 agent/sandbox/tests/verify_sdk.py ``` Test Coverage: - 30 unit tests for provider abstraction - Provider-specific tests for Aliyun - Integration tests with real API - Security tests for executor_manager ## Documentation - `docs/develop/sandbox_spec.md` - Complete architecture specification - `agent/sandbox/tests/MIGRATION_GUIDE.md` - Migration from legacy sandbox - `agent/sandbox/tests/QUICKSTART.md` - Quick start guide - `agent/sandbox/tests/README.md` - Testing documentation ## Breaking Changes ⚠️ Migration Required: 1. Directory Move: `sandbox/` → `agent/sandbox/` - Update imports: `from sandbox.` → `from agent.sandbox.` 2. Mandatory Configuration: - SystemSettings must have `sandbox.provider_type` configured - Removed fallback default values - Configuration must exist in database (from `conf/system_settings.json`) 3. Aliyun Credentials: - Requires `AGENTRUN_` environment variables (not `ALIYUN_`) - `AGENTRUN_ACCOUNT_ID` is now required (Aliyun primary account ID) 4. Self-Managed Provider: - gVisor (runsc) must be installed for security - Install: `go install gvisor.dev/gvisor/runsc@latest` ## Database Schema Changes ```python # SystemSettings.value: CharField → TextField api/db/db_models.py: Changed for unlimited config length # SystemSettingsService.get_by_name(): Fixed query precision api/db/services/system_settings_service.py: startswith → exact match ``` ## Files Changed ### Backend (Python) - `agent/sandbox/providers/base.py` - SandboxProvider ABC interface - `agent/sandbox/providers/manager.py` - ProviderManager - `agent/sandbox/providers/self_managed.py` - Self-managed provider - `agent/sandbox/providers/aliyun_codeinterpreter.py` - Aliyun provider - `agent/sandbox/providers/e2b.py` - E2B provider (placeholder) - `agent/sandbox/client.py` - Unified client (enforces DB-only config) - `agent/tools/code_exec.py` - Updated to use provider system - `admin/server/services.py` - SandboxMgr with registry & validation - `admin/server/routes.py` - 5 sandbox API endpoints - `conf/system_settings.json` - Default: aliyun_codeinterpreter - `api/db/db_models.py` - TextField for SystemSettings.value - `api/db/services/system_settings_service.py` - Exact match query ### Frontend (TypeScript/React) - `web/src/pages/admin/sandbox-settings.tsx` - Settings UI - `web/src/services/admin-service.ts` - Sandbox service functions - `web/src/services/admin.service.d.ts` - Type definitions - `web/src/utils/api.ts` - Sandbox API endpoints ### Documentation - `docs/develop/sandbox_spec.md` - Architecture spec - `agent/sandbox/tests/MIGRATION_GUIDE.md` - Migration guide - `agent/sandbox/tests/QUICKSTART.md` - Quick start - `agent/sandbox/tests/README.md` - Testing guide ### Configuration Scripts - `scripts/configure_aliyun_sandbox.sh` - Shell script (jq) - `scripts/configure_aliyun_sandbox.py` - Python script ### Tests - `agent/sandbox/tests/test_providers.py` - 30 unit tests - `agent/sandbox/tests/test_aliyun_codeinterpreter.py` - Provider tests - `agent/sandbox/tests/test_aliyun_codeinterpreter_integration.py` - Integration tests - `agent/sandbox/tests/verify_sdk.py` - SDK validation ## Architecture ``` Admin UI → Admin API → SandboxMgr → ProviderManager → [SelfManaged\|Aliyun\|E2B] ↓ SystemSettings ``` ## Usage ### 1. Configure Provider Via Admin UI: 1. Navigate to `/admin/sandbox-settings` 2. Select provider (Aliyun Code Interpreter / Self-Managed) 3. Fill in configuration 4. Click "Test Connection" to verify 5. Click "Save" to apply Via Configuration Scripts: ```bash # Aliyun provider export AGENTRUN_ACCESS_KEY_ID="xxx" export AGENTRUN_ACCESS_KEY_SECRET="yyy" export AGENTRUN_ACCOUNT_ID="zzz" export AGENTRUN_REGION="cn-shanghai" source scripts/configure_aliyun_sandbox.sh ``` ### 2. Restart Service ```bash cd docker docker compose restart ragflow-server ``` ### 3. Execute Code in Agent ```python from agent.sandbox.client import execute_code result = execute_code( code='def main(name: str) -> dict: return {"message": f"Hello {name}!"}', language="python", timeout=30, arguments={"name": "World"} ) print(result.stdout) # {"message": "Hello World!"} ``` ## Troubleshooting ### "Container pool is busy" (Self-Managed) - Cause: Pool exhausted (default: 1 container in `.env`) - Fix: Increase `SANDBOX_EXECUTOR_MANAGER_POOL_SIZE` to 5+ ### "Sandbox provider type not configured" - Cause: Database missing configuration - Fix: Run config script or set via Admin UI ### "gVisor not found" - Cause: runsc not installed - Fix: `go install gvisor.dev/gvisor/runsc@latest && sudo cp ~/go/bin/runsc /usr/local/bin/` ### Aliyun authentication errors - Cause: Wrong environment variable names - Fix: Use `AGENTRUN_` prefix (not `ALIYUN_`) ## Checklist - [x] All tests passing (30 unit tests + integration tests) - [x] Documentation updated (spec, migration guide, quickstart) - [x] Type definitions added (TypeScript) - [x] Admin UI implemented - [x] Configuration validation - [x] Health checks implemented - [x] Error handling with structured results - [x] Breaking changes documented - [x] Configuration scripts created - [x] gVisor requirements documented Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com> --------- Co-authored-by: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-28 13:28:21 +08:00
balibabu	fe99905a2b	Refactor: Remove the brute-force deduplication method for agent logs. (#12864 ) ### What problem does this PR solve? Refactor: Remove the brute-force deduplication method for agent logs. ### Type of change - [x] Refactoring	2026-01-28 12:04:30 +08:00
BitToby	df3d044f03	fix: enable auto-resize for chat input textarea (#12836 ) Closes #12803 ### What problem does this PR solve? The chat input textarea in the Chat UI (and Embed UI) has a fixed height and cannot be resized, causing poor UX when users type messages longer than 2 sentences. The input becomes cramped and difficult to read/edit. Root cause: The `Textarea` component in [NextMessageInput](cci:1://file:///ragflow/web/src/components/message-input/next.tsx:62:0-290:1) had `resize-none` and `field-sizing-content` CSS classes that prevented resizing, and the existing `autoSize` prop was not being utilized. Solution: - Removed `resize-none` and `field-sizing-content` classes - Added `autoSize={{ minRows: 1, maxRows: 8 }}` to enable auto-expand - Added `max-h-40` class to limit maximum height to 160px The textarea now auto-expands from 1 to 8 rows as users type longer messages. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue)	2026-01-28 09:53:02 +08:00

1 2 3 4 5 ...

1711 Commits