Logo
Explore Help
Sign In
youngkingdom/vllm
1
0
Fork 0
You've already forked vllm
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
Files
2edb533af26d2cdf7e4b7bdd3da0df11c009f654
vllm/csrc/moe
History
Tyler Michael Smith 6e588da0f4 [Build/CI] Fix CUDA 11.8 build (#17679)
Signed-off-by: Tyler Michael Smith <tyler@neuralmagic.com>
Signed-off-by: Lucas Wilkinson <lwilkinson@neuralmagic.com>
Signed-off-by: Tyler Michael Smith <tysmith@redhat.com>
Co-authored-by: Lucas Wilkinson <lwilkinson@neuralmagic.com>
2025-05-22 12:13:54 -07:00
..
marlin_moe_wna16
[Bugfix] fix an illegal memory access was encountered of marlin kernel + act_order (#18245)
2025-05-16 16:02:44 -07:00
permute_unpermute_kernels
[Build/CI] Fix CUDA 11.8 build (#17679)
2025-05-22 12:13:54 -07:00
moe_align_sum_kernels.cu
Modularize fused experts and integrate PPLX kernels (#15956)
2025-05-14 13:11:54 -07:00
moe_ops.h
[Build/CI] Fix CUDA 11.8 build (#17679)
2025-05-22 12:13:54 -07:00
moe_permute_unpermute_op.cu
[Build/CI] Fix CUDA 11.8 build (#17679)
2025-05-22 12:13:54 -07:00
moe_wna16_utils.h
pre-commit autoupdate (#17380)
2025-04-29 06:46:55 -07:00
moe_wna16.cu
[BugFix] Accuracy fix for llama4 int4 - improperly casted scales (#16801)
2025-04-17 22:13:29 -07:00
topk_softmax_kernels.cu
Modularize fused experts and integrate PPLX kernels (#15956)
2025-05-14 13:11:54 -07:00
torch_bindings.cpp
[Build/CI] Fix CUDA 11.8 build (#17679)
2025-05-22 12:13:54 -07:00
Powered by Gitea Version: 1.24.2 Page: 136ms Template: 7ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API