vllm/fp8 at 0f46a780d4f53b8564a37370f9f068cdf4e69604 - vllm

Files

Wentao Ye 1b0a155534 [Perf] Using __nv_fp8_e4m3 instead of c10::e4m3 for per_token_group_quant (#21867 )

Signed-off-by: yewentao256 <zhyanwentao@126.com>

2025-07-29 21:50:46 -06:00

2025-06-15 20:05:28 -07:00

2024-08-05 16:00:01 -04:00

common.cu

2025-07-22 07:07:44 -07:00

common.cuh

2025-06-03 13:48:25 -07:00

per_token_group_quant.cu

2025-07-29 21:50:46 -06:00