Files
vllm/docs/deployment/frameworks/anyscale.md

655 B

Anyscale

{ #deployment-anyscale }

Anyscale is a managed, multi-cloud platform developed by the creators of Ray. It hosts Ray clusters inside your own AWS, GCP, or Azure account, delivering the flexibility of open-source Ray without the operational overhead of maintaining Kubernetes control planes, configuring autoscalers, or managing observability stacks. When serving large language models with vLLM, Anyscale can rapidly provision production-ready HTTPS endpoints or fault-tolerant batch inference jobs.