vllm/core at e348e1027c021fc7fbe9e97fe9cdca4f5a542e11 - vllm

Files

Chen Zhang 267b4421b7 [Hybrid Allocator] Support full attention with different hidden size (#25101 )

Signed-off-by: Chen Zhang <zhangch99@outlook.com>
Signed-off-by: yewentao256 <zhyanwentao@126.com>

2025-10-03 13:35:53 -07:00

__init__.py

2025-07-14 23:01:46 -07:00

test_async_scheduler.py

2025-08-18 17:20:38 -07:00

test_encoder_cache_manager.py

2025-09-12 21:42:23 +08:00

test_kv_cache_utils.py

2025-10-03 13:35:53 -07:00

test_prefix_caching.py

2025-09-15 21:48:27 +00:00

test_scheduler_e2e.py

2025-07-21 12:18:33 +01:00

test_scheduler.py

2025-09-18 09:20:27 +00:00

test_single_type_kv_cache_manager.py

2025-09-08 21:34:37 -07:00

utils.py

2025-09-08 21:34:37 -07:00