vllm/moe at 26b4fa45bead5d65d4e15bfaffaa52ac71bea270 - vllm

Files

Tyler Michael Smith 6e588da0f4 [Build/CI] Fix CUDA 11.8 build (#17679 )

Signed-off-by: Tyler Michael Smith <tyler@neuralmagic.com>
Signed-off-by: Lucas Wilkinson <lwilkinson@neuralmagic.com>
Signed-off-by: Tyler Michael Smith <tysmith@redhat.com>
Co-authored-by: Lucas Wilkinson <lwilkinson@neuralmagic.com>

2025-05-22 12:13:54 -07:00

test_batched_moe.py

Modularize fused experts and integrate PPLX kernels (#15956 )

2025-05-14 13:11:54 -07:00

test_cutlass_moe.py

Modularize fused experts and integrate PPLX kernels (#15956 )

2025-05-14 13:11:54 -07:00

test_moe_permute_unpermute.py

[Build/CI] Fix CUDA 11.8 build (#17679 )

2025-05-22 12:13:54 -07:00

test_moe.py

[Bugfix] Reduce moe_sum test size to avoid OOM (#18484 )

2025-05-21 06:46:39 -07:00

test_nvfp4_moe.py

[Hardware/NVIDIA/Kernel] Enable nvidia/DeepSeek-R1-FP4 Model (#16362 )

2025-05-09 16:24:41 -07:00

test_pplx_moe.py

Modularize fused experts and integrate PPLX kernels (#15956 )

2025-05-14 13:11:54 -07:00

test_rocm_aiter_topk.py

[FEAT] [ROCm] [V1]: Add AITER biased group topk for DeepSeekV3 (#17955 )

2025-05-13 22:03:47 -07:00

test_triton_moe_ptpc_fp8.py

Modularize fused experts and integrate PPLX kernels (#15956 )

2025-05-14 13:11:54 -07:00