This website requires JavaScript.
Explore
Help
Sign In
youngkingdom
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
4172235ab78b09989fb56edaf734dbee283dda3e
vllm
/
.buildkite
History
Woosuk Kwon
4172235ab7
[V0 deprecation] Deprecate V0 Neuron backend (
#21159
)
...
Signed-off-by: Woosuk Kwon <
woosuk.kwon@berkeley.edu
>
2025-09-06 16:15:18 -07:00
..
lm-eval-harness
[Deprecation] Remove
prompt_token_ids
arg fallback in
LLM.generate
and
LLM.embed
(
#18800
)
2025-08-22 10:56:57 +08:00
nightly-benchmarks
Adding int4 and int8 models for CPU benchmarking (
#23709
)
2025-09-05 20:08:50 +08:00
scripts
[V0 deprecation] Deprecate V0 Neuron backend (
#21159
)
2025-09-06 16:15:18 -07:00
check-wheel-size.py
[Attention] FlashAttn MLA (
#14258
)
2025-09-04 02:47:59 -07:00
generate_index.py
[ci/build] Fix abi tag for aarch64 (
#23329
)
2025-08-21 23:32:55 +08:00
pyproject.toml
[Doc] Move examples and further reorganize user guide (
#18666
)
2025-05-26 07:38:04 -07:00
release-pipeline.yaml
[V0 deprecation] Deprecate V0 Neuron backend (
#21159
)
2025-09-06 16:15:18 -07:00
test-pipeline.yaml
[Feature] Support Decode Context Parallel (DCP) for MLA (
#23734
)
2025-09-06 13:24:05 +08:00