This website requires JavaScript.
Explore
Help
Sign In
youngkingdom
/
vllm
Watch
1
Star
0
Fork
0
You've already forked vllm
Code
Issues
Pull Requests
Actions
Packages
Projects
Releases
Wiki
Activity
Files
05e1fbfc52ca575e6539de63dbb5fab929683162
vllm
/
docs
/
source
/
features
/
quantization
History
Reid
9c1244de57
[doc] update to hyperlink (
#17096
)
...
Signed-off-by: reidliu41 <
reid201711@gmail.com
> Co-authored-by: reidliu41 <
reid201711@gmail.com
>
2025-04-24 00:58:08 -07:00
..
auto_awq.md
[doc] update to hyperlink (
#17096
)
2025-04-24 00:58:08 -07:00
bitblas.md
[Doc] Add top anchor and a note to quantization/bitblas.md (
#17042
)
2025-04-23 07:32:16 -07:00
bnb.md
[Misc] Auto detect bitsandbytes pre-quantized models (
#16027
)
2025-04-04 23:30:45 -07:00
fp8.md
[Doc] Convert docs to use colon fences (
#12471
)
2025-01-29 11:38:29 +08:00
gguf.md
doc: fix some typos in doc (
#16154
)
2025-04-07 05:32:06 +00:00
gptqmodel.md
[Docs] Add GPTQModel (
#14056
)
2025-03-03 21:59:09 +00:00
index.md
[Kernel] Support Microsoft Runtime Kernel Lib for our Low Precision Computation - BitBLAS (
#6036
)
2025-04-22 09:01:36 +01:00
int4.md
[Doc] int4 w4a16 example (
#12585
)
2025-01-31 15:38:48 -08:00
int8.md
[Doc] int4 w4a16 example (
#12585
)
2025-01-31 15:38:48 -08:00
quantized_kvcache.md
[FP8][Kernel] Dynamic kv cache scaling factors computation (
#11906
)
2025-01-23 18:04:03 +00:00
quark.md
[Doc] Quark quantization documentation (
#15861
)
2025-04-01 08:32:45 -07:00
supported_hardware.md
[Kernel] Support Microsoft Runtime Kernel Lib for our Low Precision Computation - BitBLAS (
#6036
)
2025-04-22 09:01:36 +01:00
torchao.md
Torchao (
#14231
)
2025-04-07 19:39:28 -04:00