vllm/triton.md at 26b4fa45bead5d65d4e15bfaffaa52ac71bea270

Files

Harry Mellor a1fe24d961 Migrate docs from Sphinx to MkDocs (#18145 )

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

2025-05-23 02:09:53 -07:00

title

title
NVIDIA Triton

{ #deployment-triton }

The Triton Inference Server hosts a tutorial demonstrating how to quickly deploy a simple facebook/opt-125m model using vLLM. Please see Deploying a vLLM model in Triton for more details.