ragflow/vllm.json at 59bb184e63e47b7cb706a104f8cc3096bca02821 - ragflow - Gitea: Git with a cup of tea

youngkingdom/ragflow

mirror of https://github.com/infiniflow/ragflow.git synced 2026-06-01 13:27:54 +08:00

Files

Jin Hai bb05a8bd7e Update create model instance command (#14441 )

### What problem does this PR solve?

1. support command:

```
RAGFlow(user)> create provider 'vllm' instance 'test' key 'test-key' url 'base-url' region 'abc';
SUCCESS
RAGFlow(user)> list instances from 'vllm';
+----------+----------------------------------------+----------------------------------+--------------+----------------------------------+--------+
| apiKey   | extra                                  | id                               | instanceName | providerID                       | status |
+----------+----------------------------------------+----------------------------------+--------------+----------------------------------+--------+
| test-key | {"base_url":"base-url","region":"abc"} | 40213c89430311f1a7cf38a74640adcc | test         | b4d40e6142d311f1a4f938a74640adcc | enable |
+----------+----------------------------------------+----------------------------------+--------------+----------------------------------+--------+
```
2. support add vllm model
```
RAGFlow(user)> add model 'Qwen/Qwen2-0.5B' to provider 'vllm' instance 'test' with tokens 131072 chat;
SUCCESS
```
3. add vllm chat

### Type of change

- [x] New Feature (non-breaking change which adds functionality)
- [x] Refactoring

---------

Signed-off-by: Jin Hai <haijin.chn@gmail.com>

2026-04-29 17:05:08 +08:00

8 lines

118 B

JSON

Raw Blame History

 {
   "name": "vllm",
   "url_suffix": {
     "chat": "chat/completions",
     "models": "models"
   },
   "class": "local"
 }