[Doc] Add documentation for GGUF quantization (#8618)

This commit is contained in:
Isotr0py
2024-09-20 03:15:55 +08:00
committed by GitHub
parent e42c634acb
commit ea4647b7d7
2 changed files with 74 additions and 0 deletions

View File

@ -107,6 +107,7 @@ Documentation
quantization/supported_hardware
quantization/auto_awq
quantization/bnb
quantization/gguf
quantization/int8
quantization/fp8
quantization/fp8_e5m2_kvcache