vllm/.buildkite at f09edd8a25d54c48eb804abe391e98d0b85b9ea2 - vllm - Gitea: Git with a cup of tea

youngkingdom/vllm

Files

History

Simon Mo f09edd8a25 Add JSON output support for benchmark_latency and benchmark_throughput (#4848 )

2024-05-16 10:02:56 -07:00

..

check-wheel-size.py

[Kernel] Refactor FP8 kv-cache with NVIDIA float8_e4m3 support (#4535 )

2024-05-09 18:04:17 -06:00

download-images.sh

[Feature] Add vision language model support. (#3042 )

2024-03-25 14:16:30 -07:00

run-amd-test.sh

[Build/CI] Fixing 'docker run' to re-enable AMD CI tests. (#4642 )

2024-05-07 09:23:17 -07:00

run-benchmarks.sh

Add JSON output support for benchmark_latency and benchmark_throughput (#4848 )

2024-05-16 10:02:56 -07:00

run-cpu-test.sh

[HotFix] [CI/Build] Minor fix for CPU backend CI (#3787 )

2024-04-01 22:50:53 -07:00

run-neuron-test.sh

[CI] clean docker cache for neuron (#4441 )

2024-04-28 23:32:07 +00:00

test-pipeline.yaml

[Speculative decoding][Re-take] Enable TP>1 speculative decoding (#4840 )

2024-05-16 00:53:51 -07:00

test-template.j2

[Build/CI] Fixing 'docker run' to re-enable AMD CI tests. (#4642 )

2024-05-07 09:23:17 -07:00