vllm/README.md at cda10fa3e2bb69ea276d663e5369ba16ec42cebb

Files

Harry Mellor a1fe24d961 Migrate docs from Sphinx to MkDocs (#18145 )

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>

2025-05-23 02:09:53 -07:00

title

title
Quantization

{ #quantization-index }

Quantization trades off model precision for smaller memory footprint, allowing large models to be run on a wider range of devices.

Contents: