This website requires JavaScript.
Explore
Help
Sign In
youngkingdom
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
b2c8ce57c68db0764a49d66f048b8a7a5cef9d13
vllm
/
tests
/
v1
/
e2e
History
TJian
1ee5ead5f8
[ROCm] [V1] [SpecDec] Enable Speculative Decoding on ROCm V1 Engine (
#21496
)
...
Signed-off-by: tjtanaa <
tunjian.tan@embeddedllm.com
>
2025-08-07 19:13:17 -07:00
..
__init__.py
[V1] Implement Cascade Attention (
#11635
)
2025-01-01 21:56:46 +09:00
test_cascade_attention.py
[XPU] Use spawn with XPU multiprocessing (
#20649
)
2025-07-09 00:34:28 -07:00
test_correctness_sliding_window.py
[KVCache] Make KVCacheSpec hashable (
#21791
)
2025-07-29 19:58:29 +08:00
test_kv_sharing_fast_prefill.py
Fix test_kv_sharing_fast_prefill flakiness (
#22038
)
2025-08-01 23:55:34 -07:00
test_spec_decode.py
[ROCm] [V1] [SpecDec] Enable Speculative Decoding on ROCm V1 Engine (
#21496
)
2025-08-07 19:13:17 -07:00