vllm/tpu at 760e3ecc8fa0cee06eff55fe08f91f621d4e2221 - vllm

Files

Akshat Tripathi c20ef40fd0 [Hardware][TPU][V1] Multi-LoRA implementation for the V1 TPU backend (#14238 )

Signed-off-by: Akshat Tripathi <akshat@krai.ai>
Signed-off-by: Chengji Yao <chengjiyao@google.com>
Co-authored-by: Chengji Yao <chengjiyao@google.com>

2025-05-07 16:28:47 -04:00

lora

[Hardware][TPU][V1] Multi-LoRA implementation for the V1 TPU backend (#14238 )

2025-05-07 16:28:47 -04:00

__init__.py

[torch.compile] avoid Dynamo guard evaluation overhead (#7898 )

2024-08-28 16:10:12 -07:00

test_compilation.py

[TPU][V1] Refine tpu_model_runner to mitigate future recompilation issues (#16275 )

2025-04-09 18:51:51 -06:00

test_custom_dispatcher.py

[V1] TPU - Fix CI/CD runner (#14974 )

2025-03-17 21:07:07 +00:00

test_moe_pallas.py

[TPU] Add kernel test for moe_pallas (#17496 )

2025-05-06 17:59:57 -07:00

test_quantization_accuracy.py

Correct capitalisation: VLLM -> vLLM (#14562 )

2025-03-10 16:36:21 +00:00