vllm/tpu at 5989f4684d62d5cb1852624ce0fd04fc08dd239b - vllm

Files

Nicolò Lucchesi 5989f4684d [TPU][V1] Fix padding recompilation when max-num-batched-tokens is not even (#16726 )

Signed-off-by: NickLucche <nlucches@redhat.com>

2025-04-17 18:09:57 +00:00

2025-04-17 18:09:57 +00:00

__init__.py

2025-03-08 08:19:38 -05:00

test_basic.py

2025-03-31 13:25:20 -04:00

test_mha_attn.py

2025-03-21 08:50:39 -07:00

test_pallas.py

2025-04-09 14:46:32 +08:00

test_perf.py

2025-03-31 13:25:20 -04:00

test_sampler.py

2025-04-10 17:05:44 -04:00

test_topk_topp_sampler.py

2025-04-02 17:18:08 -07:00