This website requires JavaScript.
Explore
Help
Sign In
youngkingdom
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
4bb53e2dde809ea5727b8cac95a080893733a1ef
vllm
/
tests
/
models
History
Robert Shaw
73c8d677e5
[Kernel] Marlin Expansion: Support AutoGPTQ Models with Marlin (
#3922
)
...
Co-authored-by: alexm <
alexm@neuralmagic.com
> Co-authored-by: mgoin <
michael@neuralmagic.com
>
2024-04-29 09:35:34 -07:00
..
test_aqlm.py
AQLM CUDA support (
#3287
)
2024-04-23 13:59:33 -04:00
test_big_models.py
[Test] Make model tests run again and remove --forked from pytest (
#3631
)
2024-03-28 21:06:40 -07:00
test_gptq_marlin.py
[Kernel] Marlin Expansion: Support AutoGPTQ Models with Marlin (
#3922
)
2024-04-29 09:35:34 -07:00
test_llava.py
[Test] Make model tests run again and remove --forked from pytest (
#3631
)
2024-03-28 21:06:40 -07:00
test_marlin.py
[Kernel] Marlin Expansion: Support AutoGPTQ Models with Marlin (
#3922
)
2024-04-29 09:35:34 -07:00
test_mistral.py
[Test] Make model tests run again and remove --forked from pytest (
#3631
)
2024-03-28 21:06:40 -07:00
test_models.py
[Core][5/N] Fully working chunked prefill e2e (
#3884
)
2024-04-10 17:56:48 -07:00
test_oot_registration.py
[Core] enable out-of-tree model register (
#3871
)
2024-04-06 17:11:41 -07:00
utils.py
[Kernel] Marlin Expansion: Support AutoGPTQ Models with Marlin (
#3922
)
2024-04-29 09:35:34 -07:00