[doc] add Anything LLM integration (#17216)
Signed-off-by: reidliu41 <reid201711@gmail.com> Co-authored-by: reidliu41 <reid201711@gmail.com>
This commit is contained in:
BIN
docs/source/assets/deployment/anything-llm-chat-with-doc.png
Normal file
BIN
docs/source/assets/deployment/anything-llm-chat-with-doc.png
Normal file
Binary file not shown.
|
After Width: | Height: | Size: 118 KiB |
BIN
docs/source/assets/deployment/anything-llm-chat-without-doc.png
Normal file
BIN
docs/source/assets/deployment/anything-llm-chat-without-doc.png
Normal file
Binary file not shown.
|
After Width: | Height: | Size: 136 KiB |
BIN
docs/source/assets/deployment/anything-llm-provider.png
Normal file
BIN
docs/source/assets/deployment/anything-llm-provider.png
Normal file
Binary file not shown.
|
After Width: | Height: | Size: 110 KiB |
BIN
docs/source/assets/deployment/anything-llm-upload-doc.png
Normal file
BIN
docs/source/assets/deployment/anything-llm-upload-doc.png
Normal file
Binary file not shown.
|
After Width: | Height: | Size: 111 KiB |
47
docs/source/deployment/frameworks/anything-llm.md
Normal file
47
docs/source/deployment/frameworks/anything-llm.md
Normal file
@ -0,0 +1,47 @@
|
||||
(deployment-anything-llm)=
|
||||
|
||||
# Anything LLM
|
||||
|
||||
[Anything LLM](https://github.com/Mintplex-Labs/anything-llm) is a full-stack application that enables you to turn any document, resource, or piece of content into context that any LLM can use as references during chatting.
|
||||
|
||||
It allows you to deploy a large language model (LLM) server with vLLM as the backend, which exposes OpenAI-compatible endpoints.
|
||||
|
||||
## Prerequisites
|
||||
|
||||
- Setup vLLM environment
|
||||
|
||||
## Deploy
|
||||
|
||||
- Start the vLLM server with the supported chat completion model, e.g.
|
||||
|
||||
```console
|
||||
vllm serve Qwen/Qwen1.5-32B-Chat-AWQ --max-model-len 4096
|
||||
```
|
||||
|
||||
- Download and install [Anything LLM desktop](https://anythingllm.com/desktop).
|
||||
|
||||
- On the bottom left of open settings, AI Prooviders --> LLM:
|
||||
- LLM Provider: Generic OpenAI
|
||||
- Base URL: http://{vllm server host}:{vllm server port}/v1
|
||||
- Chat Model Name: `Qwen/Qwen1.5-32B-Chat-AWQ`
|
||||
|
||||
:::{image} /assets/deployment/anything-llm-provider.png
|
||||
:::
|
||||
|
||||
- Back to home page, New Workspace --> create `vllm` workspace, and start to chat:
|
||||
|
||||
:::{image} /assets/deployment/anything-llm-chat-without-doc.png
|
||||
:::
|
||||
|
||||
- Click the upload button:
|
||||
- upload the doc
|
||||
- select the doc and move to the workspace
|
||||
- save and embed
|
||||
|
||||
:::{image} /assets/deployment/anything-llm-upload-doc.png
|
||||
:::
|
||||
|
||||
- Chat again:
|
||||
|
||||
:::{image} /assets/deployment/anything-llm-chat-with-doc.png
|
||||
:::
|
||||
@ -3,6 +3,7 @@
|
||||
:::{toctree}
|
||||
:maxdepth: 1
|
||||
|
||||
anything-llm
|
||||
bentoml
|
||||
cerebrium
|
||||
dstack
|
||||
|
||||
Reference in New Issue
Block a user