This website requires JavaScript.
Explore
Help
Sign In
youngkingdom
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
93abf23a648051fe6dc053ba0b74499d119920bf
vllm
/
examples
History
…
..
chart-helm
…
fp8
…
production_monitoring
…
api_client.py
…
aqlm_example.py
…
cpu_offload.py
…
disaggregated_prefill.sh
…
florence2_inference.py
…
gguf_inference.py
…
gradio_openai_chatbot_webserver.py
…
gradio_webserver.py
…
llm_engine_example.py
…
logging_configuration.md
…
lora_with_quantization_inference.py
…
multilora_inference.py
…
offline_chat_with_tools.py
…
offline_inference_arctic.py
…
offline_inference_audio_language.py
…
offline_inference_chat.py
…
offline_inference_classification.py
…
offline_inference_cli.py
…
offline_inference_distributed.py
…
offline_inference_embedding.py
…
offline_inference_encoder_decoder.py
…
offline_inference_mlpspeculator.py
…
offline_inference_neuron_int8_quantization.py
…
offline_inference_neuron.py
…
offline_inference_openai.md
…
offline_inference_pixtral.py
…
offline_inference_scoring.py
…
offline_inference_structured_outputs.py
…
offline_inference_tpu.py
…
offline_inference_vision_language_embedding.py
…
offline_inference_vision_language_multi_image.py
…
offline_inference_vision_language.py
…
offline_inference_with_prefix.py
…
offline_inference_with_profiler.py
…
offline_inference.py
…
offline_profile.py
…
openai_chat_completion_client_for_multimodal.py
…
openai_chat_completion_client_with_tools.py
…
openai_chat_completion_client.py
…
openai_chat_completion_structured_outputs.py
…
openai_chat_embedding_client_for_multimodal.py
…
openai_completion_client.py
…
openai_cross_encoder_score.py
…
openai_embedding_client.py
…
openai_example_batch.jsonl
…
run_cluster.sh
…
save_sharded_state.py
…
template_alpaca.jinja
…
template_baichuan.jinja
…
template_blip2.jinja
…
template_chatglm2.jinja
…
template_chatglm.jinja
…
template_chatml.jinja
…
template_dse_qwen2_vl.jinja
…
template_falcon_180b.jinja
…
template_falcon.jinja
…
template_inkbot.jinja
…
template_llava.jinja
…
template_vlm2vec.jinja
…
tensorize_vllm_model.py
…
tool_chat_template_granite_20b_fc.jinja
…
tool_chat_template_granite.jinja
…
tool_chat_template_hermes.jinja
…
tool_chat_template_internlm2_tool.jinja
…
tool_chat_template_llama3.1_json.jinja
…
tool_chat_template_llama3.2_json.jinja
…
tool_chat_template_llama3.2_pythonic.jinja
…
tool_chat_template_mistral_parallel.jinja
…
tool_chat_template_mistral.jinja
…
tool_chat_template_toolace.jinja
…