This website requires JavaScript.
Explore
Help
Sign In
youngkingdom
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
770a2cf7ae2de8bbdab35f0e1aeeebac4cf2ece9
vllm
/
tests
/
v1
/
spec_decode
History
qizixi
1356ae0aa8
[spec decode] Consolidate speculative decode method name for MTP (
#25232
)
...
Signed-off-by: zixi-qi <
qizixi@meta.com
> Signed-off-by: yewentao256 <
zhyanwentao@126.com
>
2025-10-03 13:35:56 -07:00
..
test_eagle.py
[V0 deprecation] Remove _VLLM_V1 suffixes from attention backend names (
#25489
)
2025-10-03 13:35:55 -07:00
test_max_len.py
[V0 deprecation] Remove _VLLM_V1 suffixes from attention backend names (
#25489
)
2025-10-03 13:35:55 -07:00
test_mtp.py
[spec decode] Consolidate speculative decode method name for MTP (
#25232
)
2025-10-03 13:35:56 -07:00
test_ngram.py
[Spec Decode] Add Batch Parallel Ngram. Upto 8x lower overhead. (
#24986
)
2025-10-03 13:35:55 -07:00
test_tree_attention.py
[V0 deprecation] Remove _VLLM_V1 suffixes from attention backend names (
#25489
)
2025-10-03 13:35:55 -07:00