[DOC] Fix path of v1 related figures (#21868)

Signed-off-by: Chen Zhang <zhangch99@outlook.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
This commit is contained in:
Chen Zhang
2025-07-29 19:45:18 -07:00
committed by GitHub
parent ba5c5e5404
commit 76080cff79
15 changed files with 12 additions and 12 deletions

View File

@ -47,7 +47,7 @@ This initial compilation time ranges significantly and is impacted by many of th
#### max model len vs. most model len
![most_model_len](../assets/design/v1/tpu/most_model_len.png)
![most_model_len](../assets/design/tpu/most_model_len.png)
If most of your requests are shorter than the maximum model length but you still need to accommodate occasional longer requests, setting a high maximum model length can negatively impact performance. In these cases, you can try introducing most model len by specifying the `VLLM_TPU_MOST_MODEL_LEN` environment variable.