vllm/serving at ed2e464653ab552c54c6da5c2b69d31bd61ba765 - vllm

Files

Nicolò Lucchesi 2ef0dc53b8 [Frontend] Add sampling params to v1/audio/transcriptions endpoint (#16591 )

Signed-off-by: Jannis Schönleber <joennlae@gmail.com>
Signed-off-by: NickLucche <nlucches@redhat.com>
Co-authored-by: Jannis Schönleber <joennlae@gmail.com>

2025-04-19 07:03:54 +00:00

integrations

[Doc] Convert docs to use colon fences (#12471 )

2025-01-29 11:38:29 +08:00

distributed_serving.md

[Doc] Clarify run vllm only on one node in distributed inference (#15148 )

2025-03-20 09:55:59 +08:00

engine_args.md

[Doc] Update docs on handling OOM (#15357 )

2025-03-24 14:29:34 -07:00

env_vars.md

[Doc] Convert docs to use colon fences (#12471 )

2025-01-29 11:38:29 +08:00

metrics.md

[V1][Metrics] Updated list of deprecated metrics in v0.8 (#14695 )

2025-03-15 00:45:25 +08:00

multimodal_inputs.md

Improve-mm-and-pooler-and-decoding-configs (#16789 )

2025-04-17 22:13:32 -07:00

offline_inference.md

[Doc] Add more tips to avoid OOM (#16765 )

2025-04-17 09:54:34 +00:00

openai_compatible_server.md

[Frontend] Add sampling params to v1/audio/transcriptions endpoint (#16591 )

2025-04-19 07:03:54 +00:00

usage_stats.md

[Docs] update usage stats language (#15898 )

2025-04-01 12:54:13 -07:00