ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-05-31 04:56:00 +08:00

Author	SHA1	Message	Date
qinling0210	12af73f2ca	Support stream for multimodal chat (#14537 ) ### What problem does this PR solve? Support stream for multimodal chat ### Type of change - [x] Refactoring	2026-04-30 19:33:57 +08:00
Haruko386	93f3b90121	Go: implement provider: Vllm (#14532 ) ### What problem does this PR solve? Implement the vLLM model provider for RAGFlow to fully support local and self-hosted open-source models (e.g., Qwen, GLM, Llama) via the vLLM framework, and fix several critical bugs related to model instance management and API requests. Key changes and fixes: 1. Added Standard vLLM Provider (`vllm.go`, `vllm.json`): - Implemented `VllmModel` driver strictly adhering to the OpenAI API specification. - Removed hardcoded and dangerous routing logic (e.g., forcing `AsyncChat` for Qwen/GLM prefixes), ensuring standard `/v1/chat/completions` compatibility. - Refactored `ListModels` to use safe JSON parsing (resolving nil pointer panics) and standard `GET` requests without bodies. - Added `APIConfig.Region` fallback logic to prevent empty `base_url` fetching when checking models. 2. Fixed `ChatToModelStreamWithSender` Bug (`model_service.go`): - Resolved the `model is disabled` error when streaming chat with local database-saved models. - Added the missing `if modelInfo.Status == "active"` block to correctly invoke `NewInstance` and inject the dynamic `base_url` into the provider driver before starting the SSE stream. 3. Fixed `ListSupportedModels` Bug (`model_service.go`): - Added dynamic `NewInstance` injection for `base_url`. Previously, the list models function used the static JSON config without injecting user-configured dynamic URLs from the database, resulting in an `unsupported protocol scheme ""` error. ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality)	2026-04-30 16:30:14 +08:00
qinling0210	265f92c83e	Simplify chat and support multimodal chat (#14523 ) ### What problem does this PR solve? Simplify chat and support multimodal chat ### Type of change - [x] Refactoring	2026-04-30 15:25:01 +08:00
Yingfeng	4ee0702aed	Feat: add skills space to context engine (#13908 ) ### What problem does this PR solve? issue #13714 ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-04-30 12:36:03 +08:00
Jin Hai	261be81127	Go: add drop instance models (#14485 ) ### What problem does this PR solve? 1. drop instance model 2. Fix issue of drop instance but not drop models. ### Type of change - [x] New Feature (non-breaking change which adds functionality) Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-04-29 19:18:49 +08:00
Haruko386	0e1477eb23	Go: implement provider: MiniMax (#14478 ) ### What problem does this PR solve? implement MiniMax provider ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) - [x] New Feature (non-breaking change which adds functionality)	2026-04-29 19:06:40 +08:00
Jin Hai	bb05a8bd7e	Update create model instance command (#14441 ) ### What problem does this PR solve? 1. support command: ``` RAGFlow(user)> create provider 'vllm' instance 'test' key 'test-key' url 'base-url' region 'abc'; SUCCESS RAGFlow(user)> list instances from 'vllm'; +----------+----------------------------------------+----------------------------------+--------------+----------------------------------+--------+ \| apiKey \| extra \| id \| instanceName \| providerID \| status \| +----------+----------------------------------------+----------------------------------+--------------+----------------------------------+--------+ \| test-key \| {"base_url":"base-url","region":"abc"} \| 40213c89430311f1a7cf38a74640adcc \| test \| b4d40e6142d311f1a4f938a74640adcc \| enable \| +----------+----------------------------------------+----------------------------------+--------------+----------------------------------+--------+ ``` 2. support add vllm model ``` RAGFlow(user)> add model 'Qwen/Qwen2-0.5B' to provider 'vllm' instance 'test' with tokens 131072 chat; SUCCESS ``` 3. add vllm chat ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-04-29 17:05:08 +08:00
Haruko386	decf673049	Go: implement provider: volcengine (#14460 ) ### What problem does this PR solve? implement `volcengine` provider ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-04-29 15:45:08 +08:00
qinling0210	dcce864d4c	Simplify Encode (#14437 ) ### What problem does this PR solve? Simplify Encode ### Type of change - [x] Refactoring	2026-04-28 18:07:42 +08:00
Haruko386	4e5a093ac5	Go: implement provider: Moonshot (#14433 ) ### What problem does this PR solve? implement `Moonshot` provider ### Type of change - [x] New Feature (non-breaking change which adds functionality)	2026-04-28 18:06:25 +08:00
Jin Hai	f670913bb4	Refactor model type to model class (#14426 ) ### What problem does this PR solve? As title ### Type of change - [x] Refactoring Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-04-28 16:05:15 +08:00
Jin Hai	ae420f6358	Go: fix compilation (#14418 ) ### What problem does this PR solve? Add methods to volcengine ### Type of change - [x] Bug Fix (non-breaking change which fixes an issue) Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-04-28 13:21:05 +08:00
qinling0210	effc84a042	Refactor model in GO (#14398 ) ### What problem does this PR solve? Refactor model in GO ### Type of change - [x] Refactoring	2026-04-28 12:59:01 +08:00
Jin Hai	819257f257	Go: add volcengine (#14409 ) ### What problem does this PR solve? 1. Refactor server_main 2. Add volcengine ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-04-28 12:12:58 +08:00
Jin Hai	965717c4fb	Go: add new provider: google (#14395 ) ### What problem does this PR solve? As title. ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-04-27 20:35:47 +08:00
Jin Hai	c3eac4103a	Go: aliyun model provider (#14379 ) ### What problem does this PR solve? As title. ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-04-27 14:53:33 +08:00
Jin Hai	1c244df90d	Go: add gitee and siliconflow as model provider (#14336 ) ### What problem does this PR solve? As title ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-04-24 20:59:30 +08:00
qinling0210	1473000135	Implement retrieval_test in GO (#14231 ) ### What problem does this PR solve? Implement retrieval_test in GO ### Type of change - [x] Refactoring	2026-04-24 15:30:14 +08:00
Jin Hai	2b029882d7	Go: add new provider minimax (#14296 ) ### What problem does this PR solve? 1. Add new provider minimax 2. Add new command: CHECK INSTANCE 'instance_name' FROM 'provider_name'; ``` RAGFlow(user)> check instance 'test' from 'minimax'; SUCCESS ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality) Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-04-23 10:16:20 +08:00
Jin Hai	74b44e1aa3	Go: add balance command (#14262 ) ### What problem does this PR solve? ``` RAGFlow(user)> list supported models from 'moonshot' 'test'; +---------------------------------+ \| model_name \| +---------------------------------+ \| moonshot-v1-32k-vision-preview \| \| kimi-k2.6 \| \| moonshot-v1-8k \| \| moonshot-v1-auto \| \| moonshot-v1-128k \| \| moonshot-v1-32k \| \| kimi-k2.5 \| \| moonshot-v1-8k-vision-preview \| \| moonshot-v1-128k-vision-preview \| +---------------------------------+ RAGFlow(user)> show balance from 'moonshot' 'test'; +---------+----------+ \| balance \| currency \| +---------+----------+ \| 0 \| CNY \| +---------+----------+ ``` ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-04-21 21:31:50 +08:00
Jin Hai	e48d75987c	Go: add stream / think chat (#14242 ) ### What problem does this PR solve? 1. Supports stream and non-stream chat 2. Supports think and non-think chat 3. List supported models from DeepSeek service. (This command can be used to verify the API validity) ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-04-21 16:52:32 +08:00
Jin Hai	af2ed416a7	Add extra field to model instance (#14203 ) ### What problem does this PR solve? Now each model support region with different URL ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-04-20 15:31:12 +08:00
Jin Hai	6d9430a125	Add think chat to CLI (#13922 ) ### What problem does this PR solve? Now user can use 'think mode' to chat with LLM ### Type of change - [x] New Feature (non-breaking change which adds functionality) --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-04-03 18:11:23 +08:00
Jin Hai	6c29128de1	Refactor model provider and command (#13887 ) ### What problem does this PR solve? Introduce 5 new tables, including model groups and provider instance. ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2026-04-02 20:20:35 +08:00

24 Commits