vllm/fp8 at 7728dd77bb802e1876012eb264df4d2fa2fc6f3c - vllm

Files

Wentao Ye 75d29cf4e1 [Perf] Cuda Kernel for Int8 Per Token Group Quant (#21476 )

Signed-off-by: yewentao256 <zhyanwentao@126.com>

2025-07-25 17:07:07 -07:00

2025-06-15 20:05:28 -07:00

2024-08-05 16:00:01 -04:00

common.cu

2025-07-22 07:07:44 -07:00

common.cuh

2025-06-03 13:48:25 -07:00

per_token_group_quant.cu

2025-07-25 17:07:07 -07:00