This website requires JavaScript.
Explore
Help
Sign In
youngkingdom
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
4c33d6732148fdaeb9780fa86fca1f87f2a93c19
vllm
/
docs
/
source
/
features
/
quantization
History
Reid
3a500cd0b6
[doc] miss result (
#17589
)
...
Signed-off-by: reidliu41 <
reid201711@gmail.com
> Co-authored-by: reidliu41 <
reid201711@gmail.com
>
2025-05-02 07:04:49 -07:00
..
auto_awq.md
[doc] update wrong hf model links (
#17184
)
2025-04-25 16:40:54 +00:00
bitblas.md
[doc] update wrong hf model links (
#17184
)
2025-04-25 16:40:54 +00:00
bnb.md
[doc] update wrong hf model links (
#17184
)
2025-04-25 16:40:54 +00:00
fp8.md
[doc] miss result (
#17589
)
2025-05-02 07:04:49 -07:00
gguf.md
doc: fix some typos in doc (
#16154
)
2025-04-07 05:32:06 +00:00
gptqmodel.md
[doc] update wrong model id (
#17287
)
2025-04-28 04:20:51 -07:00
index.md
[Kernel] Support Microsoft Runtime Kernel Lib for our Low Precision Computation - BitBLAS (
#6036
)
2025-04-22 09:01:36 +01:00
int4.md
[doc] add install tips (
#17373
)
2025-04-30 17:02:41 +00:00
int8.md
[doc] add install tips (
#17373
)
2025-04-30 17:02:41 +00:00
quantized_kvcache.md
[doc] add install tips (
#17373
)
2025-04-30 17:02:41 +00:00
quark.md
[doc] add install tips (
#17373
)
2025-04-30 17:02:41 +00:00
supported_hardware.md
[Bugfix] Temporarily disable gptq_bitblas on ROCm (
#17411
)
2025-04-30 19:51:45 -07:00
torchao.md
[doc] update wrong hf model links (
#17184
)
2025-04-25 16:40:54 +00:00