|
|
fd6cfe1ed0
|
v4.1 release update v2. (#2481)
|
2025-07-21 22:03:55 -04:00 |
|
|
|
b78588d163
|
CUTLASS 3.7 (#2045)
* CUTLASS 3.7
* clean up changelog
---------
Co-authored-by: yuzhai <yuzhai@nvidia.com>
Co-authored-by: Haicheng Wu <haichengw@nvidia.com>
|
2025-01-18 09:53:07 -05:00 |
|
|
|
3d261a5974
|
3.6.0 update (#2005)
* 3.6.0 update
* doc and swap stuff
---------
Co-authored-by: yuzhai <yuzhai@nvidia.com>
Co-authored-by: Haicheng Wu <haichengw@nvidia.com>
|
2024-12-25 01:34:40 -05:00 |
|
|
|
751eb9a885
|
Update license year (#1306)
|
2024-01-16 14:37:22 -05:00 |
|
|
|
557be3ab0e
|
Fix several typos (#1169)
Co-authored-by: isaacw <isaacw@nvidia.com>
|
2023-11-02 23:54:46 -04:00 |
|
|
|
90d3b0fb18
|
CUTLASS 3.2.1 (#1113)
* Updates for 3.2.1 release.
* Minor fix in gemm op profiler for raster order.
* Add scheduler mapping for raster order in the kernels.
|
2023-09-26 17:24:26 -04:00 |
|
|
|
8783c41851
|
Replace 0x1f with 0xffffffff in __shfl_sync (#1097)
This fixes compatibility with H100 and resolves #1094
|
2023-09-18 19:58:19 -04:00 |
|
|
|
7e370c9637
|
Fix typos 2 (#842)
Co-authored-by: Haicheng Wu <57973641+hwu36@users.noreply.github.com>
|
2023-03-09 23:22:56 -05:00 |
|
|
|
66d9cddc83
|
New updates for 2.11 (#775)
* New updates.
* Minor profiler updates
Co-authored-by: Aniket Shivam <ashivam@nvidia.com>
|
2023-01-20 16:32:57 -05:00 |
|
|
|
c975e2ccbb
|
releaase 2.11 (#703)
|
2022-11-19 09:02:15 -05:00 |
|