|
|
8bdbfca682
|
v4.0 update. (#2371)
|
2025-06-06 02:39:20 -04:00 |
|
|
|
9354bfd7c1
|
Keep the documentation consistent with the sgemm_1.cu code. (#2285)
* Keep the documentation consistent with the sgemm_1.cu code.
* fix typo
---------
Co-authored-by: zky <zky@126.com>
|
2025-05-19 22:53:15 -04:00 |
|
|
|
5e9b8e2a25
|
fix docx (#2290)
Co-authored-by: xiayongqiang <xiayq1@chinatelecom.cn>
|
2025-05-19 22:52:37 -04:00 |
|
|
|
f115c3f854
|
Release v4.0.0 (#2294)
|
2025-05-13 15:55:29 -04:00 |
|
|
|
331a1f5b3f
|
cutlass 3.9 update (#2255)
* cutlass 3.9 update
* rebase
* fixes out of shared memory for blockwise Blackwell
* doc format
* fix issue 2253
* disable host ref by default
* fix sm120 smem capacity
---------
Co-authored-by: yuzhai <yuzhai@nvidia.com>
Co-authored-by: Haicheng Wu <haichengw@nvidia.com>
|
2025-04-24 15:42:40 -04:00 |
|
|
|
bb4dd682dd
|
Fix broken links and alt text in cluster launch control docs (#2234)
* Fix broken links in cluster launch control docs
* Improve titles and alt text
|
2025-04-21 00:01:12 -04:00 |
|
|
|
5e497243f7
|
fix: fig link in cute docs (#2216)
|
2025-04-10 14:51:41 -04:00 |
|
|
|
dd76dec4ef
|
[Doc] Make C++ code more plausible (#2156)
Co-authored-by: Haicheng Wu <haichengw@nvidia.com>
|
2025-04-10 14:35:46 -04:00 |
|
|
|
09df6ac464
|
[Doc]fix typo (#2174)
Co-authored-by: wenju.li <wenju.li@deepctr.cn>
Co-authored-by: Haicheng Wu <haichengw@nvidia.com>
|
2025-04-10 12:46:53 -04:00 |
|
|
|
79fc51f4b8
|
v3.9 update (#2213)
Co-authored-by: yuzhai <yuzhai@nvidia.com>
|
2025-04-03 02:10:16 -04:00 |
|
|
|
6f4921858b
|
v3.9 update (#2203)
* v3.9 update
* voidD
---------
Co-authored-by: yuzhai <yuzhai@nvidia.com>
|
2025-04-02 15:11:18 -04:00 |
|