CUTLASS 3.2 (#1024)

* CUTLASS 3.2
2023-08-07 14:50:32 -10:00
parent a0d787b746
commit 4575443d44
392 changed files with 47559 additions and 7940 deletions
--- a/examples/python/README.md
+++ b/examples/python/README.md
@ -12,3 +12,7 @@

    Shows how to declare, compile, and run a grouped GEMM operation via the Python interface,
    along with how the emitted kernel can be easily exported to a PyTorch CUDA extension.
+
+* [03_basic_conv2d](/examples/python/03_basic_conv2d.ipynb)
+
+    Shows how to declare, configure, compile, and run a CUTLASS Conv2d using the Python interface