fix typo (#2136)

Co-authored-by: XiaoDong <xiaod@nvidia.com>
2025-03-18 10:19:43 +08:00
parent 6c6b78550e
commit bd03b22f64
1 changed files with 1 additions and 1 deletions
--- a/media/docs/grouped_scheduler.md
+++ b/media/docs/grouped_scheduler.md
@ -340,7 +340,7 @@ Thus, there are 216 tiles across the group.
 Suppose this grouped GEMM is run on GA100, which has 108 SMs. Suppose that
 the occupancy given the parameters of the grouped GEMM is one -- one threadblock
 can be active at a time on an SM. The grouped GEMM will, thus, run with 108
-persistent threadblocks, each of which computes (256 / 108) = 2 tiles.
+persistent threadblocks, each of which computes (216 / 108) = 2 tiles.

 Under the round-robin assignment of tiles to threadblocks employed by
 the grouped GEMM scheduler, the assignment of tiles to threadblocks