youngkingdom/vllm

Files

History

Ricardo Decal e290594072 [Docs] Rename “Distributed inference and serving” to “Parallelism & Scaling” (#22466 )

Signed-off-by: Ricardo Decal <rdecal@anyscale.com>

2025-08-08 19:26:21 +00:00

..

faq.md

Stop using title frontmatter and fix doc that can only be reached by search (#20623 )

2025-07-08 03:27:40 -07:00

metrics.md

Remove unnecessary explicit title anchors and use relative links instead (#20620 )

2025-07-08 02:49:13 -07:00

README.md

[Doc] Reorganize user guide (#18661 )

2025-05-24 07:25:33 -07:00

reproducibility.md

[Doc] Update reproducibility doc and example (#18741 )

2025-05-27 07:03:13 +00:00

security.md

[Docs] Switch to better markdown linting pre-commit hook (#21851 )

2025-07-29 19:45:08 -07:00

troubleshooting.md

[Docs] Rename “Distributed inference and serving” to “Parallelism & Scaling” (#22466 )

2025-08-08 19:26:21 +00:00

usage_stats.md

Make distinct code and console admonitions so readers are less likely to miss them (#20585 )

2025-07-07 19:55:28 -07:00

v1_guide.md

[v1] - Mamba1 Attention Metadata (#21249 )

2025-08-06 17:03:42 -07:00

README.md

Using vLLM

vLLM supports the following usage patterns:

Inference and Serving: Run a single instance of a model.
Deployment: Scale up model instances for production.
Training: Train or fine-tune a model.