Logo
Explore Help
Sign In
youngkingdom/vllm
1
0
Fork 0
You've already forked vllm
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
Files
54814fd85b5182fc140febfebbb2560420d2ed2a
vllm/tests/distributed
History
Lily Liu 7041de4384 [Kernel] Flashinfer for prefill & decode, with Cudagraph support for decode (#4628)
Co-authored-by: LiuXiaoxuanPKU <llilyliupku@gmail.com>, bong-furiosa <bongwon.jang@furiosa.ai>
2024-06-28 15:28:49 -07:00
..
__init__.py
[CI/Build] Move test_utils.py to tests/utils.py (#4425)
2024-05-13 23:50:09 +09:00
test_basic_distributed_correctness.py
[Kernel] Flashinfer for prefill & decode, with Cudagraph support for decode (#4628)
2024-06-28 15:28:49 -07:00
test_chunked_prefill_distributed.py
[CI/Test] improve robustness of test (vllm_runner) (#5357)
2024-06-08 08:59:20 +00:00
test_comm_ops.py
[Distributed] Add send and recv helpers (#5719)
2024-06-23 14:42:28 -07:00
test_custom_all_reduce.py
[Distributed] Add send and recv helpers (#5719)
2024-06-23 14:42:28 -07:00
test_parallel_state.py
[Distributed] Make it clear that % should not be in tensor dict keys. (#5927)
2024-06-28 15:20:22 +00:00
test_pynccl.py
[Distributed] Add send and recv helpers (#5719)
2024-06-23 14:42:28 -07:00
test_same_node.py
[Core][Distributed] add same-node detection (#5369)
2024-06-11 10:53:59 -07:00
test_shm_broadcast.py
[bugfix][distributed] fix shm broadcast when the queue size is full (#5801)
2024-06-25 21:56:02 -07:00
test_utils.py
[Hardware][AMD][CI/Build][Doc] Upgrade to ROCm 6.1, Dockerfile improvements, test fixes (#5422)
2024-06-25 15:56:15 -07:00
Powered by Gitea Version: 1.24.2 Page: 111ms Template: 5ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API