Logo
Explore Help
Sign In
youngkingdom/vllm
1
0
Fork 0
You've already forked vllm
Code Issues Pull Requests Actions Packages Projects Releases Wiki Activity
Files
6881107948c00a8564bc2fa85308f6fc2f065d64
vllm/csrc/moe
History
Tyler Michael Smith 6e588da0f4 [Build/CI] Fix CUDA 11.8 build (#17679)
Signed-off-by: Tyler Michael Smith <tyler@neuralmagic.com>
Signed-off-by: Lucas Wilkinson <lwilkinson@neuralmagic.com>
Signed-off-by: Tyler Michael Smith <tysmith@redhat.com>
Co-authored-by: Lucas Wilkinson <lwilkinson@neuralmagic.com>
2025-05-22 12:13:54 -07:00
..
marlin_moe_wna16
[Bugfix] fix an illegal memory access was encountered of marlin kernel + act_order (#18245)
2025-05-16 16:02:44 -07:00
permute_unpermute_kernels
[Build/CI] Fix CUDA 11.8 build (#17679)
2025-05-22 12:13:54 -07:00
moe_align_sum_kernels.cu
Modularize fused experts and integrate PPLX kernels (#15956)
2025-05-14 13:11:54 -07:00
moe_ops.h
[Build/CI] Fix CUDA 11.8 build (#17679)
2025-05-22 12:13:54 -07:00
moe_permute_unpermute_op.cu
[Build/CI] Fix CUDA 11.8 build (#17679)
2025-05-22 12:13:54 -07:00
moe_wna16_utils.h
pre-commit autoupdate (#17380)
2025-04-29 06:46:55 -07:00
moe_wna16.cu
[BugFix] Accuracy fix for llama4 int4 - improperly casted scales (#16801)
2025-04-17 22:13:29 -07:00
topk_softmax_kernels.cu
Modularize fused experts and integrate PPLX kernels (#15956)
2025-05-14 13:11:54 -07:00
torch_bindings.cpp
[Build/CI] Fix CUDA 11.8 build (#17679)
2025-05-22 12:13:54 -07:00
Powered by Gitea Version: 1.24.2 Page: 125ms Template: 5ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API