|
|
8bdbfca682
|
v4.0 update. (#2371)
|
2025-06-06 02:39:20 -04:00 |
|
|
|
2e2af190bd
|
Revert "[ex77] fix mla split; add fwd lse; add bwd varlen (#2366)" (#2370)
This reverts commit f12b1d75c9.
|
2025-06-05 23:14:57 -04:00 |
|
|
|
f12b1d75c9
|
[ex77] fix mla split; add fwd lse; add bwd varlen (#2366)
|
2025-06-05 18:39:46 -04:00 |
|
|
|
331a1f5b3f
|
cutlass 3.9 update (#2255)
* cutlass 3.9 update
* rebase
* fixes out of shared memory for blockwise Blackwell
* doc format
* fix issue 2253
* disable host ref by default
* fix sm120 smem capacity
---------
Co-authored-by: yuzhai <yuzhai@nvidia.com>
Co-authored-by: Haicheng Wu <haichengw@nvidia.com>
|
2025-04-24 15:42:40 -04:00 |
|
|
|
79fc51f4b8
|
v3.9 update (#2213)
Co-authored-by: yuzhai <yuzhai@nvidia.com>
|
2025-04-03 02:10:16 -04:00 |
|
|
|
b84e9802d8
|
update 3.8 v2 (#2112)
* update 3.8 v2
* update 3.8
---------
Co-authored-by: yuzhai <yuzhai@nvidia.com>
|
2025-02-19 22:03:14 -05:00 |
|
|
|
833f6990e0
|
v3.8.0 update (#2082)
* 3.8 update
* fix Markus' name
---------
Co-authored-by: yuzhai <yuzhai@nvidia.com>
|
2025-02-06 21:33:40 -05:00 |
|
|
|
389e493055
|
CUTLASS 3.8 Release (#2059)
* CUTLASS 3.8 Release
* update
* Update README.md
* Revert "Update README.md"
This reverts commit b353e36fe8.
* update
* update
---------
Co-authored-by: Haicheng Wu <57973641+hwu36@users.noreply.github.com>
Co-authored-by: Haicheng Wu <haichengw@nvidia.com>
|
2025-01-25 02:44:06 -05:00 |
|