ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-06-01 05:17:51 +08:00

Author	SHA1	Message	Date
FuturMix	2548c28d65	feat: add FuturMix as model provider (#14419 ) ## Summary Add [FuturMix](https://futurmix.ai) as a new model provider. FuturMix is an OpenAI-compatible unified AI gateway that provides access to 22+ models (GPT, Claude, Gemini, DeepSeek, and more) through a single API endpoint and key. - API Base: `https://futurmix.ai/v1` (OpenAI-compatible) - Supported capabilities: Chat, Embedding, Image2Text, TTS, Speech2Text, Rerank ### Changes \| File \| Change \| \|------\|--------\| \| `rag/llm/__init__.py` \| Add `FuturMix` to `SupportedLiteLLMProvider` enum, `FACTORY_DEFAULT_BASE_URL`, and `LITELLM_PROVIDER_PREFIX` \| \| `rag/llm/chat_model.py` \| Add `FuturMixChat(Base)` — follows Astraflow/Avian pattern \| \| `rag/llm/embedding_model.py` \| Add `FuturMixEmbed(OpenAIEmbed)` — follows Astraflow pattern \| \| `rag/llm/cv_model.py` \| Add `FuturMixCV(GptV4)` — follows SILICONFLOW/OpenRouter pattern \| \| `rag/llm/tts_model.py` \| Add `FuturMixTTS(OpenAITTS)` — follows CometAPI/DeerAPI pattern \| \| `rag/llm/sequence2txt_model.py` \| Add `FuturMixSeq2txt(GPTSeq2txt)` — follows StepFun pattern \| \| `rag/llm/rerank_model.py` \| Add `FuturMixRerank(OpenAI_APIRerank)` \| \| `conf/llm_factories.json` \| Add factory config with 8 chat, 2 embedding, 1 image2text, 2 TTS, 1 speech2text models \| \| `docs/guides/models/supported_models.mdx` \| Add FuturMix to supported models table \| ### Models included - Chat: claude-sonnet-4-20250514, claude-3.5-haiku, gpt-4o, gpt-4o-mini, gemini-2.5-flash, gemini-2.0-flash, deepseek-chat, deepseek-reasoner - Embedding: text-embedding-3-small, text-embedding-3-large - Image2Text: gpt-4o - TTS: tts-1, tts-1-hd - Speech2Text: whisper-1 ## Test plan - [ ] Verify FuturMix appears in the model provider list in RAGFlow UI - [ ] Configure FuturMix with API key and test chat completion - [ ] Test embedding model with document indexing - [ ] Test image2text with a sample image 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-04-30 10:59:37 +08:00
ucloudnb666	f853a39b40	feat: Add Astraflow provider support (global + China endpoints) (#14270 ) ## Add Astraflow Provider Support This PR integrates [Astraflow](https://astraflow.ucloud.cn/) (by UCloud / 优刻得) as a new AI model provider in RAGFlow, with support for both global and China endpoints. ### About Astraflow Astraflow is an OpenAI-compatible AI model aggregation platform supporting 200+ models from major providers including DeepSeek, Qwen, GPT, Claude, Gemini, Llama, Mistral, and more. \| Variant \| Factory Name \| Endpoint \| Env Var \| \|---------\|-------------\|----------\|---------\| \| Global \| `Astraflow` \| `https://api-us-ca.umodelverse.ai/v1` \| `ASTRAFLOW_API_KEY` \| \| China \| `Astraflow-CN` \| `https://api.modelverse.cn/v1` \| `ASTRAFLOW_CN_API_KEY` \| - API key signup: https://astraflow.ucloud.cn/ --- ### Files Changed \| File \| Change \| \|------\|--------\| \| `rag/llm/__init__.py` \| Register `Astraflow` and `Astraflow-CN` in `SupportedLiteLLMProvider` enum, `FACTORY_DEFAULT_BASE_URL`, and `LITELLM_PROVIDER_PREFIX` \| \| `rag/llm/chat_model.py` \| Add `AstraflowChat` and `AstraflowCNChat` (OpenAI-compatible `Base` subclass) \| \| `rag/llm/embedding_model.py` \| Add `AstraflowEmbed` and `AstraflowCNEmbed` (subclasses of `OpenAIEmbed`) \| \| `rag/llm/rerank_model.py` \| Add `AstraflowRerank` and `AstraflowCNRerank` (subclasses of `OpenAI_APIRerank`) \| \| `rag/llm/cv_model.py` \| Add `AstraflowCV` and `AstraflowCNCV` (subclasses of `GptV4`) \| \| `rag/llm/tts_model.py` \| Add `AstraflowTTS` and `AstraflowCNTTS` (subclasses of `OpenAITTS`) \| \| `rag/llm/sequence2txt_model.py` \| Add `AstraflowSeq2txt` and `AstraflowCNSeq2txt` (subclasses of `GPTSeq2txt`) \| \| `conf/llm_factories.json` \| Register `Astraflow` and `Astraflow-CN` factories with a curated list of popular models \| --- ### Supported Model Types - ✅ Chat / LLM — DeepSeek-V3/R1, Qwen3, GPT-4o/4.1, Claude 3.5/3.7, Gemini 2.0/2.5 Flash, Llama 3.3/4, Mistral, and 200+ more - ✅ Text Embedding — text-embedding-3-small/large - ✅ Image / Vision (IMAGE2TEXT) — GPT-4o, GPT-4.1, Claude, Gemini, Llama-4, etc. - ✅ Text Re-Rank - ✅ TTS — tts-1 - ✅ Speech-to-Text (SPEECH2TEXT) — whisper-1 ### Implementation Notes - Uses the `openai/` LiteLLM prefix — consistent with other OpenAI-compatible aggregation platforms (SILICONFLOW, DeerAPI, CometAPI, OpenRouter, n1n, Avian, etc.) - `Astraflow` (global, rank 250) and `Astraflow-CN` (China, rank 249) are separate factory entries, allowing users to choose the optimal endpoint based on their region. - All model classes cleanly subclass existing base classes (`Base`, `OpenAIEmbed`, `OpenAI_APIRerank`, `GptV4`, `OpenAITTS`, `GPTSeq2txt`) with no custom logic needed — the provider is fully OpenAI-compatible. --------- Co-authored-by: user <user@xzaaaMacBook-Air.local>	2026-04-22 15:38:34 +08:00
writinwaters	db5ab7bbe8	Docs: Image2text is supported by GPUStack. (#13856 ) ### What problem does this PR solve? Image2text is supported by GPUStack. #9515 ### Type of change - [x] Documentation Update	2026-03-30 20:39:02 +08:00
tmimmanuel	13d0df1562	feat: add Perplexity contextualized embeddings API as a new model provider (#13709 ) ### What problem does this PR solve? Adds Perplexity contextualized embeddings API as a new model provider, as requested in #13610. - `PerplexityEmbed` provider in `rag/llm/embedding_model.py` supporting both standard (`/v1/embeddings`) and contextualized (`/v1/contextualizedembeddings`) endpoints - All 4 Perplexity embedding models registered in `conf/llm_factories.json`: `pplx-embed-v1-0.6b`, `pplx-embed-v1-4b`, `pplx-embed-context-v1-0.6b`, `pplx-embed-context-v1-4b` - Frontend entries (enum, icon mapping, API key URL) in `web/src/constants/llm.ts` - Updated `docs/guides/models/supported_models.mdx` - 22 unit tests in `test/unit_test/rag/llm/test_perplexity_embed.py` Perplexity's API returns `base64_int8` encoded embeddings (not OpenAI-compatible), so this uses a custom `requests`-based implementation. Contextualized vs standard model is auto-detected from the model name. Closes #13610 ### Type of change - [x] New Feature (non-breaking change which adds functionality) - [x] Documentation Update	2026-03-20 10:47:48 +08:00
foyou	f75dc6a452	Docs: Fix normalization of case and some code blocks (#13520 ) ### What problem does this PR solve? Standardize term capitalization in `deploy_local_llm.mdx` and improve code block formatting. ### Type of change - [x] Documentation Update	2026-03-11 17:51:13 +08:00
yiminghub2024	5eb602166c	Enhance local model deployment documentation support gpustack guide (#13339 ) ### Type of change - [X] Documentation Update:Enhance local model deployment documentation support gpustack guide	2026-03-04 13:54:20 +08:00
writinwaters	1c87f97dde	Docs: Minor document structure tweak. (#13346 ) ### What problem does this PR solve? Refactored the document architecture. ### Type of change - [x] Documentation Update	2026-03-03 20:09:34 +08:00
writinwaters	f7c808383f	Docs: Refactored documentation (#13340 ) ### What problem does this PR solve? Refactored documentation. ### Type of change - [x] Documentation Update	2026-03-03 17:48:48 +08:00
Jimmy Ben Klieve	867ec94258	revert white-space changes in docs (#12557 ) ### What problem does this PR solve? Trailing white-spaces in commit `6814ace1aa` got automatically trimmed by code editor may causes documentation typesetting broken. Mostly for double spaces for soft line breaks. ### Type of change - [x] Documentation Update	2026-01-13 09:41:02 +08:00
Jimmy Ben Klieve	6814ace1aa	docs: update docs icons (#12465 ) ### What problem does this PR solve? Update icons for docs. Trailing spaces are auto truncated by the editor, does not affect real content. ### Type of change - [x] Documentation Update	2026-01-07 10:00:09 +08:00
Yingfeng	2114b9e3ad	Update deploy_local_llm.mdx (#12276 ) ### Type of change - [x] Documentation Update	2025-12-28 19:46:50 +08:00
yiminghub2024	45b96acf6b	Update deploy_local_llm.mdx vllm guide picture (#12275 ) ### Type of change - [x] Documentation Update	2025-12-28 19:29:33 +08:00
writinwaters	3fe94d3386	Docs: Fixed a display issue (#12259 ) ### Type of change - [x] Documentation Update	2025-12-26 21:33:55 +08:00
yiminghub2024	3ad147d349	Update deploy_local_llm.mdx with vllm guide support (#12222 ) ### What problem does this PR solve? vllm guide support ### Type of change - [x] Documentation Update	2025-12-26 15:14:25 +08:00
Jin Hai	6546f86b4e	Fix errors (#11795 ) ### What problem does this PR solve? - typos - IDE warnings ### Type of change - [x] Refactoring --------- Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-12-08 09:42:10 +08:00
writinwaters	0ebbb60102	Docs: deploying a local model using Jina not supported (#11624 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2025-12-01 11:24:29 +08:00
Jin Hai	20b6dafbd8	Update docs (#11204 ) ### What problem does this PR solve? as title ### Type of change - [x] Documentation Update Signed-off-by: Jin Hai <haijin.chn@gmail.com>	2025-11-12 14:01:47 +08:00
Zhichang Yu	73144e278b	Don't release full image (#10654 ) ### What problem does this PR solve? Introduced gpu profile in .env Added Dockerfile_tei fix datrie Removed LIGHTEN flag ### Type of change - [x] Documentation Update - [x] Refactoring	2025-10-23 23:02:27 +08:00
writinwaters	4058715df7	Docs: Knowledge base renamed to dataset. (#10269 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2025-09-25 09:45:27 +08:00
writinwaters	5a8bc88147	Docs: Removed `/v1` from Ollama base URLs (#10067 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2025-09-17 13:48:29 +08:00
writinwaters	7eb25e0de6	UI updates (#9836 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2025-08-30 21:44:58 +08:00
writinwaters	1fd92e6bee	Docs: RAGFlow does not suppport batch metadata setting (#7795 ) ### What problem does this PR solve? _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ ### Type of change - [x] Documentation Update	2025-05-22 17:02:23 +08:00
Raffaele Mancuso	60787f8d5d	Fix Ollama instructions (#7478 ) Fix instructions for Ollama ### What problem does this PR solve? _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2025-05-06 13:57:39 +08:00
Raffaele Mancuso	c4b3d3af95	Fix instructions for Ollama (#7468 ) 1. Use `host.docker.internal` as base URL 2. Fix numbers in list 3. Make clear what is the console input and what is the output ### What problem does this PR solve? _Briefly describe what this PR aims to solve. Include background context that will help reviewers understand the purpose of the PR._ ### Type of change - [ ] Bug Fix (non-breaking change which fixes an issue) - [ ] New Feature (non-breaking change which adds functionality) - [ ] Documentation Update - [ ] Refactoring - [ ] Performance Improvement - [ ] Other (please describe):	2025-05-06 09:47:19 +08:00
writinwaters	6051abb4a3	Miscellaneous UI updates (#6947 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2025-04-10 20:09:46 +08:00
writinwaters	5a8c479ff3	Miscellaneous editorial updates (#6805 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2025-04-07 09:33:55 +08:00
writinwaters	979cdc3626	UI updates. (#6398 ) ### What problem does this PR solve? Updated UI descriptions for delimiters and recommended chunk size ### Type of change - [x] Documentation Update	2025-03-21 16:50:20 +08:00
writinwaters	fb4b5b0a06	Added 0.17.0 release notes (#5608 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2025-03-04 19:21:28 +08:00
writinwaters	d9bbaf5d6c	Minor: Fixed broken links (#5565 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2025-03-03 19:24:28 +08:00
writinwaters	b67697b6f2	Restructured guides (#5555 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2025-03-03 17:13:37 +08:00
writinwaters	03d1265cfd	Restructured guides (#5549 ) ### What problem does this PR solve? ### Type of change - [x] Documentation Update	2025-03-03 15:42:39 +08:00

31 Commits