vllm/e2e at aeff0604bbc8e87625b7fcfebf4dff3ad83f37d2 - vllm

Files

WeiQing Chen 38c2df831a [Multimodal][Speculative Decoding]Eagle Eagle3 mm support, enablement on qwen2.5vl (#22872 )

Signed-off-by: Junhong <liujunhong11@huawei.com>
Signed-off-by: Junhong Liu <98734602+LJH-LBJ@users.noreply.github.com>
Co-authored-by: Junhong <liujunhong11@huawei.com>
Co-authored-by: LJH-LBJ <98734602+LJH-LBJ@users.noreply.github.com>
Signed-off-by: yewentao256 <zhyanwentao@126.com>

2025-10-03 13:35:56 -07:00

__init__.py

[V1] Implement Cascade Attention (#11635 )

2025-01-01 21:56:46 +09:00

test_cascade_attention.py

[V0 deprecation] Remove _VLLM_V1 suffixes from attention backend names (#25489 )

2025-10-03 13:35:55 -07:00

test_correctness_sliding_window.py

[CI] Revert back prepare_prompts and check_answers (#25087 )

2025-09-17 11:03:16 -07:00

test_kv_sharing_fast_prefill.py

Revert gemma3n fast prefill changes (#23897 )