vllm/modular_kernel_tools at use-uv-python-for-docker - vllm

Files

Shu Wang 54e42b72db Support mnnvl all2allv from Flashinfer (#21003 )

Signed-off-by: Shu Wang <shuw@nvidia.com>
Signed-off-by: Shu Wang. <shuw@nvidia.com>
Signed-off-by: Tyler Michael Smith <tyler@neuralmagic.com>
Signed-off-by: Tyler Michael Smith <tlrmchlsmth@gmail.com>
Co-authored-by: Tyler Michael Smith <tyler@neuralmagic.com>
Co-authored-by: Tyler Michael Smith <tlrmchlsmth@gmail.com>

2025-09-24 14:38:16 -04:00

__init__.py

[Misc] Add unit tests for MoE ModularKernel combinations + Profiling utility (#20449 )

2025-07-11 07:51:46 -07:00

cli_args.py

[Kernel] DeepGemm MoE : Integrate triton permute / unpermute kernels (#20903 )

2025-07-17 08:10:37 +00:00

common.py

[Kernel] Delegate construction of FusedMoEQuantConfig to FusedMoEMethodBase subclasses (#22537 )

2025-09-17 17:43:31 -06:00

make_feature_matrix.py

[Kernel] Delegate construction of FusedMoEQuantConfig to FusedMoEMethodBase subclasses (#22537 )