mirror of
https://github.com/infiniflow/ragflow.git
synced 2026-05-28 19:53:06 +08:00
## Summary Add [FuturMix](https://futurmix.ai) as a new model provider. FuturMix is an OpenAI-compatible unified AI gateway that provides access to 22+ models (GPT, Claude, Gemini, DeepSeek, and more) through a single API endpoint and key. - **API Base**: `https://futurmix.ai/v1` (OpenAI-compatible) - **Supported capabilities**: Chat, Embedding, Image2Text, TTS, Speech2Text, Rerank ### Changes | File | Change | |------|--------| | `rag/llm/__init__.py` | Add `FuturMix` to `SupportedLiteLLMProvider` enum, `FACTORY_DEFAULT_BASE_URL`, and `LITELLM_PROVIDER_PREFIX` | | `rag/llm/chat_model.py` | Add `FuturMixChat(Base)` — follows Astraflow/Avian pattern | | `rag/llm/embedding_model.py` | Add `FuturMixEmbed(OpenAIEmbed)` — follows Astraflow pattern | | `rag/llm/cv_model.py` | Add `FuturMixCV(GptV4)` — follows SILICONFLOW/OpenRouter pattern | | `rag/llm/tts_model.py` | Add `FuturMixTTS(OpenAITTS)` — follows CometAPI/DeerAPI pattern | | `rag/llm/sequence2txt_model.py` | Add `FuturMixSeq2txt(GPTSeq2txt)` — follows StepFun pattern | | `rag/llm/rerank_model.py` | Add `FuturMixRerank(OpenAI_APIRerank)` | | `conf/llm_factories.json` | Add factory config with 8 chat, 2 embedding, 1 image2text, 2 TTS, 1 speech2text models | | `docs/guides/models/supported_models.mdx` | Add FuturMix to supported models table | ### Models included - **Chat**: claude-sonnet-4-20250514, claude-3.5-haiku, gpt-4o, gpt-4o-mini, gemini-2.5-flash, gemini-2.0-flash, deepseek-chat, deepseek-reasoner - **Embedding**: text-embedding-3-small, text-embedding-3-large - **Image2Text**: gpt-4o - **TTS**: tts-1, tts-1-hd - **Speech2Text**: whisper-1 ## Test plan - [ ] Verify FuturMix appears in the model provider list in RAGFlow UI - [ ] Configure FuturMix with API key and test chat completion - [ ] Test embedding model with document indexing - [ ] Test image2text with a sample image 🤖 Generated with [Claude Code](https://claude.com/claude-code) --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
100 lines
11 KiB
Plaintext
100 lines
11 KiB
Plaintext
---
|
|
sidebar_position: 3
|
|
slug: /supported_models
|
|
sidebar_custom_props: {
|
|
categoryIcon: LucideBox
|
|
}
|
|
---
|
|
# Supported models
|
|
|
|
import APITable from '@site/src/components/APITable';
|
|
|
|
A complete list of models supported by RAGFlow, which will continue to expand.
|
|
|
|
```mdx-code-block
|
|
<APITable>
|
|
```
|
|
|
|
| Provider | LLM | Image2Text | Speech2text | TTS | Embedding | Rerank | OCR |
|
|
| --------------------- | ------------------ | ------------------ | ------------------ | ------------------ | ------------------ | ------------------ | ------------------ |
|
|
| Anthropic | :heavy_check_mark: | | | | | | |
|
|
| Astraflow | :heavy_check_mark: | | | | :heavy_check_mark: | | |
|
|
| Astraflow-CN | :heavy_check_mark: | | | | :heavy_check_mark: | | |
|
|
| Avian | :heavy_check_mark: | | | | | | |
|
|
| Azure-OpenAI | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | | :heavy_check_mark: | | |
|
|
| BaiChuan | :heavy_check_mark: | | | | :heavy_check_mark: | | |
|
|
| BaiduYiyan | :heavy_check_mark: | :heavy_check_mark: | | | :heavy_check_mark: | :heavy_check_mark: | |
|
|
| Bedrock | :heavy_check_mark: | | | | :heavy_check_mark: | | |
|
|
| Cohere | :heavy_check_mark: | :heavy_check_mark: | | | :heavy_check_mark: | :heavy_check_mark: | |
|
|
| DeepSeek | :heavy_check_mark: | | | | | | |
|
|
| Fish Audio | | | | :heavy_check_mark: | | | |
|
|
| FuturMix | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | |
|
|
| Gemini | :heavy_check_mark: | :heavy_check_mark: | | | :heavy_check_mark: | | |
|
|
| Google Cloud | :heavy_check_mark: | | | | | | |
|
|
| GPUStack | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | |
|
|
| Groq | :heavy_check_mark: | | | | | | |
|
|
| HuggingFace | :heavy_check_mark: | | | | :heavy_check_mark: | | |
|
|
| Jina | | | | | :heavy_check_mark: | :heavy_check_mark: | |
|
|
| LocalAI | :heavy_check_mark: | :heavy_check_mark: | | | :heavy_check_mark: | | |
|
|
| LongCat | :heavy_check_mark: | | | | | | |
|
|
| LM-Studio | :heavy_check_mark: | :heavy_check_mark: | | | :heavy_check_mark: | | |
|
|
| MiniMax | :heavy_check_mark: | | | | | | |
|
|
| MinerU | | | | | | | :heavy_check_mark: |
|
|
| Mistral | :heavy_check_mark: | | | | :heavy_check_mark: | | |
|
|
| ModelScope | :heavy_check_mark: | | | | | | |
|
|
| Moonshot | :heavy_check_mark: | :heavy_check_mark: | | | | | |
|
|
| NovitaAI | :heavy_check_mark: | | | | :heavy_check_mark: | | |
|
|
| NVIDIA | :heavy_check_mark: | :heavy_check_mark: | | | :heavy_check_mark: | :heavy_check_mark: | |
|
|
| Ollama | :heavy_check_mark: | :heavy_check_mark: | | | :heavy_check_mark: | | |
|
|
| OpenAI | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | | |
|
|
| OpenAI-API-Compatible | :heavy_check_mark: | :heavy_check_mark: | | | :heavy_check_mark: | :heavy_check_mark: | |
|
|
| OpenRouter | :heavy_check_mark: | :heavy_check_mark: | | | | | |
|
|
| Perplexity | | :heavy_check_mark: | | | | | |
|
|
| Replicate | :heavy_check_mark: | | | | :heavy_check_mark: | | |
|
|
| PPIO | :heavy_check_mark: | | | | | | |
|
|
| SILICONFLOW | :heavy_check_mark: | :heavy_check_mark: | | | :heavy_check_mark: | :heavy_check_mark: | |
|
|
| StepFun | :heavy_check_mark: | | | | | | |
|
|
| Tencent Hunyuan | :heavy_check_mark: | | | | | | |
|
|
| Tencent Cloud | | | :heavy_check_mark: | | | | |
|
|
| TogetherAI | :heavy_check_mark: | :heavy_check_mark: | | | :heavy_check_mark: | :heavy_check_mark: | |
|
|
| TokenPony | :heavy_check_mark: | | | | | | |
|
|
| Tongyi-Qianwen | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | |
|
|
| Upstage | :heavy_check_mark: | | | | :heavy_check_mark: | | |
|
|
| VLLM | :heavy_check_mark: | :heavy_check_mark: | | | :heavy_check_mark: | :heavy_check_mark: | |
|
|
| VolcEngine | :heavy_check_mark: | | | | | | |
|
|
| Voyage AI | | :heavy_check_mark: | | | :heavy_check_mark: | :heavy_check_mark: | |
|
|
| Xinference | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | |
|
|
| XunFei Spark | :heavy_check_mark: | | | :heavy_check_mark: | | | |
|
|
| xAI | :heavy_check_mark: | :heavy_check_mark: | | | | | |
|
|
| ZHIPU-AI | :heavy_check_mark: | :heavy_check_mark: | | | :heavy_check_mark: | | |
|
|
| DeepInfra | :heavy_check_mark: | | :heavy_check_mark: | :heavy_check_mark: | :heavy_check_mark: | | |
|
|
| 302.AI | :heavy_check_mark: | :heavy_check_mark: | | | :heavy_check_mark: | :heavy_check_mark: | |
|
|
| CometAPI | :heavy_check_mark: | | | | :heavy_check_mark: | | |
|
|
| DeerAPI | :heavy_check_mark: | :heavy_check_mark: | | :heavy_check_mark: | :heavy_check_mark: | | |
|
|
| Jiekou.AI | :heavy_check_mark: | | | | :heavy_check_mark: | :heavy_check_mark: | |
|
|
|
|
```mdx-code-block
|
|
</APITable>
|
|
```
|
|
|
|
:::danger IMPORTANT
|
|
If your model is not listed here but has APIs compatible with those of OpenAI, click **OpenAI-API-Compatible** on the **Model providers** page to configure your model.
|
|
:::
|
|
|
|
## Example: AI Badgr (OpenAI-compatible)
|
|
|
|
You can use **AI Badgr** with RAGFlow via the existing OpenAI-API-Compatible provider.
|
|
|
|
To configure AI Badgr:
|
|
|
|
- **Provider**: `OpenAI-API-Compatible`
|
|
- **Base URL**: `https://aibadgr.com/api/v1`
|
|
- **API Key**: your AI Badgr API key (from the AI Badgr dashboard)
|
|
- **Model**: any AI Badgr chat or embedding model ID, as exposed by AI Badgr's OpenAI-compatible APIs
|
|
|
|
AI Badgr implements OpenAI-compatible endpoints for `/v1/chat/completions`, `/v1/embeddings`, and `/v1/models`, so no additional code changes in RAGFlow are required.
|
|
|
|
:::note
|
|
The list of supported models is extracted from [this source](https://github.com/infiniflow/ragflow/blob/main/rag/llm/__init__.py) and may not be the most current. For the latest supported model list, please refer to the Python file.
|
|
:::
|