vllm/worker at e97f802b2d74861af77997691a7d1c36498f6dca - vllm

Files

Gregory Shtrasberg e97f802b2d [FP8][Kernel] Dynamic kv cache scaling factors computation (#11906 )

Signed-off-by: Gregory Shtrasberg <Gregory.Shtrasberg@amd.com>
Co-authored-by: Micah Williamson <micah.williamson@amd.com>

2025-01-23 18:04:03 +00:00

__init__.py

2024-01-21 16:31:47 -08:00

test_encoder_decoder_model_runner.py

2024-12-13 06:57:50 +00:00

test_model_input.py

2025-01-23 18:04:03 +00:00

test_model_runner.py

2024-12-13 06:57:50 +00:00

test_profile.py

2024-12-16 13:32:25 -08:00

test_swap.py

2024-11-02 07:35:05 -07:00