vllm/features at 182f40ea8b5981864b23e08bb2a5aafc5800e976 - vllm - Gitea: Git with a cup of tea

youngkingdom/vllm

Files

History

Zhiyu 182f40ea8b Add NVIDIA TensorRT Model Optimizer in vLLM documentation (#17561 )

2025-05-02 11:36:46 -07:00

..

Add NVIDIA TensorRT Model Optimizer in vLLM documentation (#17561 )

2025-05-02 11:36:46 -07:00

automatic_prefix_caching.md

[Doc] Convert docs to use colon fences (#12471 )

2025-01-29 11:38:29 +08:00

compatibility_matrix.md

[Doc]: Improve feature tables (#13224 )

2025-02-18 18:52:39 +08:00

disagg_prefill.md

[Doc] Add two links to disagg_prefill.md (#17168 )

2025-04-25 10:23:57 +00:00

lora.md

[Misc] Enable vLLM to Dynamically Load LoRA from a Remote Server (#10546 )

2025-04-15 22:31:38 +00:00

reasoning_outputs.md

[Feature][Frontend]: Deprecate --enable-reasoning (#17452 )

2025-05-01 06:46:16 -07:00

spec_decode.md

[V1][Spec Decode] Remove deprecated spec decode config params (#15466 )

2025-03-31 09:19:35 -07:00

structured_outputs.md

[Docs] Update structured output doc for V1 (#17135 )

2025-04-26 15:12:18 +00:00

tool_calling.md

Add chat template for Llama 4 models (#16428 )

2025-04-24 20:19:36 +00:00