This website requires JavaScript.
Explore
Help
Sign In
youngkingdom
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
36fb68f94792a8cec8df5b58bab7ab4d4d6158b4
vllm
/
tests
/
core
/
block
History
leiwen83
24750f4cad
[Core] Enable prefix caching with block manager v2 enabled (
#4142
)
...
Co-authored-by: Lei Wen <
wenlei03@qiyi.com
> Co-authored-by: Sage Moore <
sagemoore@utexas.edu
>
2024-05-01 11:20:32 -07:00
..
e2e
[Core] Enable prefix caching with block manager v2 enabled (
#4142
)
2024-05-01 11:20:32 -07:00
__init__.py
[Core][Bugfix]Refactor block manager for better testability (
#3492
)
2024-03-27 23:59:28 -07:00
conftest.py
[Misc] [CI/Build] Speed up block manager CPU-only unit tests ~10x by opting-out of GPU cleanup (
#3783
)
2024-04-02 00:49:51 +00:00
test_block_manager_v2.py
[Speculative decoding 4/9] Lookahead scheduling for speculative decoding (
#3250
)
2024-04-01 22:55:24 +00:00
test_block_table.py
[Speculative decoding 4/9] Lookahead scheduling for speculative decoding (
#3250
)
2024-04-01 22:55:24 +00:00
test_common.py
[Core][Bugfix]Refactor block manager for better testability (
#3492
)
2024-03-27 23:59:28 -07:00
test_cpu_gpu_block_allocator.py
[Core][Bugfix]Refactor block manager for better testability (
#3492
)
2024-03-27 23:59:28 -07:00
test_naive_block.py
[Core][Bugfix]Refactor block manager for better testability (
#3492
)
2024-03-27 23:59:28 -07:00
test_prefix_caching_block.py
[Core] Enable prefix caching with block manager v2 enabled (
#4142
)
2024-05-01 11:20:32 -07:00