CUTLASS 3.2 (#1024)

* CUTLASS 3.2
This commit is contained in:
ANIKET SHIVAM
2023-08-07 14:50:32 -10:00
committed by GitHub
parent a0d787b746
commit 4575443d44
392 changed files with 47559 additions and 7940 deletions

View File

@ -12,3 +12,7 @@
Shows how to declare, compile, and run a grouped GEMM operation via the Python interface,
along with how the emitted kernel can be easily exported to a PyTorch CUDA extension.
* [03_basic_conv2d](/examples/python/03_basic_conv2d.ipynb)
Shows how to declare, configure, compile, and run a CUTLASS Conv2d using the Python interface