This website requires JavaScript.
Explore
Help
Sign In
youngkingdom
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
a59cd9d9f7fd89e19beeffb7e7f89437d413eafb
vllm
/
tests
/
v1
/
e2e
History
zhiweiz
9e0726e5bf
[Meta] Official Eagle mm support, first enablement on llama4 (
#20788
)
...
Signed-off-by: morgendave <
morgendave@gmail.com
> Co-authored-by: Roger Wang <
hey@rogerw.me
>
2025-07-31 10:35:07 -07:00
..
__init__.py
[V1] Implement Cascade Attention (
#11635
)
2025-01-01 21:56:46 +09:00
test_cascade_attention.py
[XPU] Use spawn with XPU multiprocessing (
#20649
)
2025-07-09 00:34:28 -07:00
test_correctness_sliding_window.py
[KVCache] Make KVCacheSpec hashable (
#21791
)
2025-07-29 19:58:29 +08:00
test_kv_sharing_fast_prefill.py
Override attention metadata for fast prefill in some KV sharing setups (
#21590
)
2025-07-30 08:54:15 -07:00
test_spec_decode.py
[Meta] Official Eagle mm support, first enablement on llama4 (
#20788
)
2025-07-31 10:35:07 -07:00