vllm/e2e at use-uv-python-for-docker - vllm - Gitea: Git with a cup of tea

Files

WeiQing Chen f1d53d150c [Multimodal][Speculative Decoding]Eagle Eagle3 mm support, enablement on qwen2.5vl (#22872 )

Signed-off-by: Junhong <liujunhong11@huawei.com>
Signed-off-by: Junhong Liu <98734602+LJH-LBJ@users.noreply.github.com>
Co-authored-by: Junhong <liujunhong11@huawei.com>
Co-authored-by: LJH-LBJ <98734602+LJH-LBJ@users.noreply.github.com>

2025-09-27 03:35:47 +00:00

__init__.py

[V1] Implement Cascade Attention (#11635 )

2025-01-01 21:56:46 +09:00

test_cascade_attention.py

[V0 deprecation] Remove _VLLM_V1 suffixes from attention backend names (#25489 )

2025-09-25 17:37:50 +00:00

test_correctness_sliding_window.py

[CI] Revert back prepare_prompts and check_answers (#25087 )

2025-09-17 11:03:16 -07:00

test_kv_sharing_fast_prefill.py

Revert gemma3n fast prefill changes (#23897 )

2025-08-29 12:16:57 -07:00

test_min_tokens.py

[CI] Add end-to-end V1 min_tokens test coverage (#22495 )

2025-08-21 22:04:07 -06:00

test_spec_decode.py

[Multimodal][Speculative Decoding]Eagle Eagle3 mm support, enablement on qwen2.5vl (#22872 )

2025-09-27 03:35:47 +00:00