This website requires JavaScript.
Explore
Help
Sign In
youngkingdom
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
26b4fa45bead5d65d4e15bfaffaa52ac71bea270
vllm
/
tests
/
models
/
quantization
History
Isotr0py
1f1b1bc03b
[V1][Quantization] Add CUDA graph compatible v1 GGUF support (
#18646
)
...
Signed-off-by: Isotr0py <
mozf@mail2.sysu.edu.cn
> Signed-off-by: Isotr0py <
2037008807@qq.com
>
2025-05-27 04:40:28 +00:00
..
__init__.py
[CI/Build] Reorganize models tests (
#17459
)
2025-04-30 23:03:08 -07:00
test_aqlm.py
[ROCm] Skip tests for quantizations incompatible with ROCm (
#17905
)
2025-05-12 18:39:28 -06:00
test_awq.py
[Misc] Rename assets for testing (
#17575
)
2025-05-02 03:29:25 -07:00
test_bitblas.py
[Misc] Clean up test docstrings and names (
#17521
)
2025-05-01 05:19:32 -07:00
test_fp8.py
[ROCm] Skip tests for quantizations incompatible with ROCm (
#17905
)
2025-05-12 18:39:28 -06:00
test_gguf.py
[V1][Quantization] Add CUDA graph compatible v1 GGUF support (
#18646
)
2025-05-27 04:40:28 +00:00
test_gptq_bitblas.py
[Misc] Clean up test docstrings and names (
#17521
)
2025-05-01 05:19:32 -07:00
test_gptq_marlin_24.py
[ROCm] Skip tests for quantizations incompatible with ROCm (
#17905
)
2025-05-12 18:39:28 -06:00
test_gptq_marlin.py
[ROCm] Skip tests for quantizations incompatible with ROCm (
#17905
)
2025-05-12 18:39:28 -06:00
test_modelopt.py
[CI/Build] Reorganize models tests (
#17459
)
2025-04-30 23:03:08 -07:00
test_mxfp4.py
[Quantization] Quark MXFP4 format loading (
#16943
)
2025-05-07 15:05:05 -04:00
test_nvfp4.py
[Minor] Rename quantization nvfp4 to modelopt_fp4 (
#18356
)
2025-05-20 09:08:37 -07:00