|
|
331a1f5b3f
|
cutlass 3.9 update (#2255)
* cutlass 3.9 update
* rebase
* fixes out of shared memory for blockwise Blackwell
* doc format
* fix issue 2253
* disable host ref by default
* fix sm120 smem capacity
---------
Co-authored-by: yuzhai <yuzhai@nvidia.com>
Co-authored-by: Haicheng Wu <haichengw@nvidia.com>
|
2025-04-24 15:42:40 -04:00 |
|
|
|
b84e9802d8
|
update 3.8 v2 (#2112)
* update 3.8 v2
* update 3.8
---------
Co-authored-by: yuzhai <yuzhai@nvidia.com>
|
2025-02-19 22:03:14 -05:00 |
|
|
|
b78588d163
|
CUTLASS 3.7 (#2045)
* CUTLASS 3.7
* clean up changelog
---------
Co-authored-by: yuzhai <yuzhai@nvidia.com>
Co-authored-by: Haicheng Wu <haichengw@nvidia.com>
|
2025-01-18 09:53:07 -05:00 |
|
|
|
751eb9a885
|
Update license year (#1306)
|
2024-01-16 14:37:22 -05:00 |
|
|
|
277bd6e537
|
CUTLASS 3.0.0 (#786)
* CUTLASS 3.0.0
|
2023-01-23 20:55:28 -05:00 |
|