This website requires JavaScript.
Explore
Help
Sign In
youngkingdom
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
3e36fcbee642f41278a4881c9e2bfbbd7c28e607
vllm
/
tests
/
compile
/
piecewise
History
Yong Hoon Shin
4ac7713e32
Add test case for compiling multiple graphs (
#21044
)
...
Signed-off-by: Yong Hoon Shin <
yhshin@meta.com
>
2025-07-23 11:00:47 -07:00
..
__init__.py
[torch.compile] rework compile control with piecewise cudagraph (
#9715
)
2024-10-29 23:03:49 -07:00
test_full_cudagraph.py
[Feature][ROCm] Add full graph capture support for TritonAttentionBackend (
#19158
)
2025-06-17 17:03:06 -04:00
test_multiple_graphs.py
Add test case for compiling multiple graphs (
#21044
)
2025-07-23 11:00:47 -07:00
test_simple.py
[CUDA] Enable full cudagraph for FlashMLA (
#18581
)
2025-06-13 18:12:26 +00:00
test_toy_llama.py
[CUDA] Enable full cudagraph for FlashMLA (
#18581
)
2025-06-13 18:12:26 +00:00