vllm/core at 035fd2bd2cd2fb70f5834f5ca6c2ea30cdae9187 - vllm

Files

Chen Zhang 9607d5eb44 [Hybrid Allocator] Support full attention with different hidden size (#25101 )

Signed-off-by: Chen Zhang <zhangch99@outlook.com>

2025-09-19 23:43:59 -07:00

__init__.py

2025-07-14 23:01:46 -07:00

test_async_scheduler.py

2025-08-18 17:20:38 -07:00

test_encoder_cache_manager.py

2025-09-12 21:42:23 +08:00

test_kv_cache_utils.py

2025-09-19 23:43:59 -07:00

test_prefix_caching.py

2025-09-15 21:48:27 +00:00

test_scheduler_e2e.py

2025-07-21 12:18:33 +01:00

test_scheduler.py

2025-09-18 09:20:27 +00:00

test_single_type_kv_cache_manager.py

2025-09-08 21:34:37 -07:00

utils.py

2025-09-08 21:34:37 -07:00