This website requires JavaScript.
Explore
Help
Sign In
youngkingdom
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
85fee74b337522f7e0807fc100b9e00682ff45e1
vllm
/
csrc
/
quantization
/
gptq
History
Xiangyu Li
5cc6bddb6e
[Kernel] Add GPTQv2 format support for low-bit or asymmetric quantization, by adapting gptq_gemm (
#26092
)
2025-10-23 23:26:13 -04:00
..
compat.cuh
[CI/Build] Enforce style for C++ and CUDA code with
clang-format
(
#4722
)
2024-05-22 07:18:41 +00:00
matrix_view.cuh
[CI/Build] Enforce style for C++ and CUDA code with
clang-format
(
#4722
)
2024-05-22 07:18:41 +00:00
q_gemm.cu
[Kernel] Add GPTQv2 format support for low-bit or asymmetric quantization, by adapting gptq_gemm (
#26092
)
2025-10-23 23:26:13 -04:00
qdq_2.cuh
[CI/Build] Enforce style for C++ and CUDA code with
clang-format
(
#4722
)
2024-05-22 07:18:41 +00:00
qdq_3.cuh
[CI/Build] Enforce style for C++ and CUDA code with
clang-format
(
#4722
)
2024-05-22 07:18:41 +00:00
qdq_4.cuh
[CI/Build] Enforce style for C++ and CUDA code with
clang-format
(
#4722
)
2024-05-22 07:18:41 +00:00
qdq_8.cuh
[CI/Build] Enforce style for C++ and CUDA code with
clang-format
(
#4722
)
2024-05-22 07:18:41 +00:00
qdq_util.cuh
[CI/Build] Enforce style for C++ and CUDA code with
clang-format
(
#4722
)
2024-05-22 07:18:41 +00:00