This website requires JavaScript.
Explore
Help
Sign In
youngkingdom
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
b2c8ce57c68db0764a49d66f048b8a7a5cef9d13
vllm
/
tests
/
v1
/
spec_decode
History
TJian
1ee5ead5f8
[ROCm] [V1] [SpecDec] Enable Speculative Decoding on ROCm V1 Engine (
#21496
)
...
Signed-off-by: tjtanaa <
tunjian.tan@embeddedllm.com
>
2025-08-07 19:13:17 -07:00
..
test_eagle.py
[ROCm] [V1] [SpecDec] Enable Speculative Decoding on ROCm V1 Engine (
#21496
)
2025-08-07 19:13:17 -07:00
test_max_len.py
[ROCm] [V1] [SpecDec] Enable Speculative Decoding on ROCm V1 Engine (
#21496
)
2025-08-07 19:13:17 -07:00
test_ngram.py
Remove
from_dict
from
SpeculativeConfig
(
#22451
)
2025-08-07 10:13:04 -07:00
test_tree_attention.py
[V1] reduce block size for tree attention correctness test to fix 'ou… (
#22207
)
2025-08-04 19:11:06 -07:00