Update 0x_gemm_tutorial.md (#2090)

This commit is contained in:
botbw
2025-02-11 05:46:43 +08:00
committed by GitHub
parent 833f6990e0
commit 0642d46dd4

View File

@ -202,7 +202,7 @@ Use of static layouts has a few advantages.
As stated, the shared memory layouts can be anything that satisfy those conditions. Optimizing kernels like these is often performed by finding a good shared memory layout that provides good access patterns for both the writes to and the reads from shared memory. This includes the ability to vectorize reads and writes as well as avoid shared memory bank conflicts.
Wih the static smem layouts, the `gemm_device` kernel can allocate the required shared memory and create the smem `Tensor`s.
With the static smem layouts, the `gemm_device` kernel can allocate the required shared memory and create the smem `Tensor`s.
```cpp
// Shared memory buffers