vllm/worker at a490aafa3671da1b6b2be6cff4568913fcb1732c - vllm - Gitea: Git with a cup of tea

youngkingdom/vllm

Files

History

Woosuk Kwon 12659a0bd7 Add CUDA graph-based all reduce launcher (#26 )

2023-04-05 11:16:57 -07:00

..

cache_engine.py

Support tensor parallel (#2 )

2023-03-21 13:45:42 -07:00

controller.py

Add CUDA graph-based all reduce launcher (#26 )

2023-04-05 11:16:57 -07:00

worker.py

Add CUDA graph-based all reduce launcher (#26 )

2023-04-05 11:16:57 -07:00