Commit Graph
Select branches
Hide Pull Requests
2.11
Deepseek
cutlass-3.5.0
feature/2.10/updates_before_tagging
feature/3.0.0
feature/enable-mxfp-group-gemm-sm120
main
oss_ci
redirect
release/3.2.x
release/4.2
strided_output_conv
thakkarV-patch-1
thakkarv/4.0-changelog
v4
#10
#100
#1006
#1007
#1012
#1019
#102
#1021
#1022
#1024
#1035
#1037
#1041
#1043
#1047
#1049
#1053
#1059
#1065
#1068
#107
#1071
#1072
#1073
#1078
#1080
#1082
#1084
#1089
#1090
#1091
#1097
#1100
#1101
#1102
#1104
#1109
#1112
#1113
#1116
#1119
#1120
#1121
#1124
#1127
#1128
#1128
#1132
#1134
#1135
#1140
#1143
#1146
#1147
#1153
#1167
#1168
#1169
#1172
#1173
#1175
#1177
#1179
#1180
#1185
#1187
#1189
#1190
#1191
#1192
#1193
#1194
#1195
#1196
#1197
#1200
#1209
#1218
#1218
#1224
#1225
#1232
#1249
#1251
#1257
#1258
#1264
#1273
#1274
#1274
#1275
#1278
#1279
#1286
#1287
#1294
#13
#1302
#1303
#1305
#1306
#1308
#1318
#1325
#1328
#133
#1339
#134
#1346
#135
#1350
#1357
#1377
#1380
#1380
#1384
#1386
#1400
#1404
#141
#1411
#1413
#1415
#1416
#1417
#1420
#1428
#1433
#1437
#1439
#1451
#1453
#1453
#1454
#1458
#1465
#1468
#1469
#147
#1470
#1470
#1471
#1473
#1477
#1479
#148
#1486
#1491
#1494
#1495
#1498
#15
#150
#151
#1512
#1517
#1526
#1527
#1528
#1528
#1529
#1534
#1534
#1539
#1539
#1543
#1553
#1554
#1554
#1569
#1578
#1584
#1593
#1593
#1604
#1604
#1618
#1618
#162
#1623
#1630
#1632
#1632
#1638
#1639
#1641
#1647
#1650
#1652
#1653
#1653
#1656
#1656
#1658
#1661
#1664
#1665
#1666
#1667
#1673
#1674
#1674
#1679
#1680
#1695
#1700
#1702
#1702
#1708
#1709
#1713
#1714
#1727
#1733
#1753
#1765
#1771
#1774
#1776
#1782
#1784
#1787
#179
#1790
#1795
#1796
#1799
#1803
#1820
#1826
#1832
#1832
#1833
#1835
#1843
#1850
#1853
#1855
#1855
#1856
#1864
#187
#1870
#1871
#1878
#1880
#1883
#1887
#1887
#189
#1890
#1891
#1891
#1894
#1896
#1899
#1907
#1912
#192
#1925
#1926
#193
#1931
#1932
#1935
#1942
#1951
#1960
#1961
#1962
#1966
#1968
#1972
#1977
#1982
#1983
#1989
#1993
#2
#2005
#2020
#2021
#2024
#2026
#2030
#2031
#2033
#2035
#2035
#2037
#2045
#2051
#2059
#2066
#2069
#2078
#2078
#2082
#2086
#2089
#2090
#2095
#2104
#2110
#2111
#2112
#2120
#2122
#2123
#2124
#2129
#2130
#2134
#2135
#2136
#2137
#2139
#214
#2141
#2141
#2142
#2143
#2155
#2156
#2159
#216
#2160
#2160
#2161
#2167
#217
#2171
#2172
#2174
#2177
#2179
#2179
#218
#2180
#2185
#2188
#219
#2194
#2195
#2196
#2199
#220
#2203
#2204
#2211
#2213
#2216
#2219
#2220
#2221
#2224
#2234
#2248
#2249
#2250
#2251
#2255
#2256
#2257
#2257
#2267
#2269
#2269
#2270
#2273
#2275
#2276
#2279
#228
#2283
#2285
#2290
#2291
#2292
#2294
#2295
#2298
#2299
#230
#2305
#2305
#2307
#2311
#2315
#2317
#2318
#2324
#2328
#2328
#2329
#2330
#2333
#2340
#235
#2351
#2358
#2359
#2361
#2366
#237
#2370
#2371
#2374
#2375
#2377
#2378
#2379
#2383
#2385
#2387
#239
#2390
#2391
#2398
#2399
#24
#2400
#2401
#2402
#2402
#2407
#2414
#2416
#2417
#2419
#2420
#2421
#2422
#2425
#2429
#2436
#2439
#2447
#2448
#2457
#2457
#246
#2462
#2465
#2466
#2469
#2469
#247
#2472
#2477
#2480
#2481
#2485
#2489
#2492
#25
#2502
#2502
#2506
#251
#2510
#2511
#2514
#2516
#2517
#2526
#2527
#2527
#2529
#2536
#2537
#2538
#2540
#2543
#2544
#2548
#2548
#2553
#2554
#2554
#2556
#2558
#2558
#256
#2561
#2562
#2564
#2564
#2565
#2567
#2567
#2568
#2568
#2571
#2575
#2579
#2580
#2582
#2587
#259
#2591
#2592
#2594
#2594
#2596
#2598
#2599
#26
#2605
#2605
#2607
#2609
#2610
#2610
#2611
#2612
#2615
#2621
#2621
#2623
#2627
#2635
#2638
#2639
#264
#2644
#2645
#2646
#2646
#2648
#2650
#2651
#2652
#266
#2660
#2661
#2661
#2666
#2667
#2669
#2670
#2670
#2671
#2671
#2678
#2678
#2680
#2680
#2682
#2682
#2684
#2685
#2685
#2686
#2687
#2687
#2688
#2688
#2689
#2689
#2690
#2690
#2691
#2694
#2694
#2702
#2702
#2704
#2704
#2705
#2709
#2713
#2713
#2714
#2718
#2718
#2719
#2719
#272
#2721
#2721
#2729
#2729
#2731
#2731
#2734
#2734
#2739
#2739
#274
#2740
#2740
#2741
#2741
#2742
#277
#28
#285
#290
#290
#292
#295
#297
#298
#30
#301
#303
#305
#306
#308
#313
#318
#325
#33
#331
#341
#345
#363
#364
#365
#366
#375
#378
#379
#38
#381
#382
#383
#386
#388
#391
#392
#393
#394
#402
#403
#406
#407
#412
#413
#415
#419
#42
#424
#429
#433
#437
#440
#441
#442
#444
#446
#447
#449
#450
#451
#452
#453
#456
#46
#467
#468
#469
#47
#471
#472
#473
#477
#478
#479
#48
#480
#482
#486
#487
#488
#489
#493
#497
#497
#503
#507
#514
#516
#518
#518
#52
#53
#531
#532
#542
#543
#546
#550
#559
#562
#563
#564
#574
#576
#586
#587
#590
#597
#6
#6
#603
#604
#607
#608
#61
#615
#616
#618
#62
#620
#622
#623
#624
#626
#628
#629
#63
#631
#632
#633
#634
#635
#636
#637
#638
#639
#64
#641
#645
#646
#65
#650
#658
#659
#662
#669
#670
#671
#672
#677
#682
#691
#698
#7
#70
#701
#703
#704
#714
#717
#719
#720
#726
#727
#728
#730
#741
#743
#749
#752
#753
#754
#759
#760
#761
#764
#765
#766
#768
#773
#775
#776
#779
#786
#789
#790
#791
#796
#8
#805
#806
#807
#812
#82
#822
#823
#826
#828
#829
#83
#830
#832
#836
#838
#839
#841
#842
#844
#845
#846
#849
#853
#855
#857
#858
#862
#869
#87
#871
#878
#879
#883
#885
#891
#892
#893
#895
#896
#897
#9
#903
#905
#91
#912
#914
#915
#916
#917
#918
#920
#921
#925
#927
#932
#936
#937
#939
#940
#942
#945
#950
#951
#952
#957
#958
#96
#961
#967
#970
#976
#977
#979
#984
#992
#993
#995
#996
v0.1.0
v0.1.1
v1.0.0
v1.0.1
v1.1.0
v1.2.0
v1.3.0
v1.3.2
v1.3.3
v2.0.0
v2.1.0
v2.10.0
v2.11.0
v2.2.0
v2.3.0
v2.4.0
v2.5.0
v2.6.0
v2.6.1
v2.7.0
v2.8.0
v2.9.0
v2.9.1
v3.0.0
v3.1.0
v3.2.0
v3.2.1
v3.2.2
v3.3.0
v3.4.0
v3.4.1
v3.5.0
v3.5.1
v3.6.0
v3.7.0
v3.8.0
v3.9.0
v3.9.1
v3.9.2
v4.0.0
v4.1.0
v4.2.0
v4.2.1
Select branches
Hide Pull Requests
2.11
Deepseek
cutlass-3.5.0
feature/2.10/updates_before_tagging
feature/3.0.0
feature/enable-mxfp-group-gemm-sm120
main
oss_ci
redirect
release/3.2.x
release/4.2
strided_output_conv
thakkarV-patch-1
thakkarv/4.0-changelog
v4
#10
#100
#1006
#1007
#1012
#1019
#102
#1021
#1022
#1024
#1035
#1037
#1041
#1043
#1047
#1049
#1053
#1059
#1065
#1068
#107
#1071
#1072
#1073
#1078
#1080
#1082
#1084
#1089
#1090
#1091
#1097
#1100
#1101
#1102
#1104
#1109
#1112
#1113
#1116
#1119
#1120
#1121
#1124
#1127
#1128
#1128
#1132
#1134
#1135
#1140
#1143
#1146
#1147
#1153
#1167
#1168
#1169
#1172
#1173
#1175
#1177
#1179
#1180
#1185
#1187
#1189
#1190
#1191
#1192
#1193
#1194
#1195
#1196
#1197
#1200
#1209
#1218
#1218
#1224
#1225
#1232
#1249
#1251
#1257
#1258
#1264
#1273
#1274
#1274
#1275
#1278
#1279
#1286
#1287
#1294
#13
#1302
#1303
#1305
#1306
#1308
#1318
#1325
#1328
#133
#1339
#134
#1346
#135
#1350
#1357
#1377
#1380
#1380
#1384
#1386
#1400
#1404
#141
#1411
#1413
#1415
#1416
#1417
#1420
#1428
#1433
#1437
#1439
#1451
#1453
#1453
#1454
#1458
#1465
#1468
#1469
#147
#1470
#1470
#1471
#1473
#1477
#1479
#148
#1486
#1491
#1494
#1495
#1498
#15
#150
#151
#1512
#1517
#1526
#1527
#1528
#1528
#1529
#1534
#1534
#1539
#1539
#1543
#1553
#1554
#1554
#1569
#1578
#1584
#1593
#1593
#1604
#1604
#1618
#1618
#162
#1623
#1630
#1632
#1632
#1638
#1639
#1641
#1647
#1650
#1652
#1653
#1653
#1656
#1656
#1658
#1661
#1664
#1665
#1666
#1667
#1673
#1674
#1674
#1679
#1680
#1695
#1700
#1702
#1702
#1708
#1709
#1713
#1714
#1727
#1733
#1753
#1765
#1771
#1774
#1776
#1782
#1784
#1787
#179
#1790
#1795
#1796
#1799
#1803
#1820
#1826
#1832
#1832
#1833
#1835
#1843
#1850
#1853
#1855
#1855
#1856
#1864
#187
#1870
#1871
#1878
#1880
#1883
#1887
#1887
#189
#1890
#1891
#1891
#1894
#1896
#1899
#1907
#1912
#192
#1925
#1926
#193
#1931
#1932
#1935
#1942
#1951
#1960
#1961
#1962
#1966
#1968
#1972
#1977
#1982
#1983
#1989
#1993
#2
#2005
#2020
#2021
#2024
#2026
#2030
#2031
#2033
#2035
#2035
#2037
#2045
#2051
#2059
#2066
#2069
#2078
#2078
#2082
#2086
#2089
#2090
#2095
#2104
#2110
#2111
#2112
#2120
#2122
#2123
#2124
#2129
#2130
#2134
#2135
#2136
#2137
#2139
#214
#2141
#2141
#2142
#2143
#2155
#2156
#2159
#216
#2160
#2160
#2161
#2167
#217
#2171
#2172
#2174
#2177
#2179
#2179
#218
#2180
#2185
#2188
#219
#2194
#2195
#2196
#2199
#220
#2203
#2204
#2211
#2213
#2216
#2219
#2220
#2221
#2224
#2234
#2248
#2249
#2250
#2251
#2255
#2256
#2257
#2257
#2267
#2269
#2269
#2270
#2273
#2275
#2276
#2279
#228
#2283
#2285
#2290
#2291
#2292
#2294
#2295
#2298
#2299
#230
#2305
#2305
#2307
#2311
#2315
#2317
#2318
#2324
#2328
#2328
#2329
#2330
#2333
#2340
#235
#2351
#2358
#2359
#2361
#2366
#237
#2370
#2371
#2374
#2375
#2377
#2378
#2379
#2383
#2385
#2387
#239
#2390
#2391
#2398
#2399
#24
#2400
#2401
#2402
#2402
#2407
#2414
#2416
#2417
#2419
#2420
#2421
#2422
#2425
#2429
#2436
#2439
#2447
#2448
#2457
#2457
#246
#2462
#2465
#2466
#2469
#2469
#247
#2472
#2477
#2480
#2481
#2485
#2489
#2492
#25
#2502
#2502
#2506
#251
#2510
#2511
#2514
#2516
#2517
#2526
#2527
#2527
#2529
#2536
#2537
#2538
#2540
#2543
#2544
#2548
#2548
#2553
#2554
#2554
#2556
#2558
#2558
#256
#2561
#2562
#2564
#2564
#2565
#2567
#2567
#2568
#2568
#2571
#2575
#2579
#2580
#2582
#2587
#259
#2591
#2592
#2594
#2594
#2596
#2598
#2599
#26
#2605
#2605
#2607
#2609
#2610
#2610
#2611
#2612
#2615
#2621
#2621
#2623
#2627
#2635
#2638
#2639
#264
#2644
#2645
#2646
#2646
#2648
#2650
#2651
#2652
#266
#2660
#2661
#2661
#2666
#2667
#2669
#2670
#2670
#2671
#2671
#2678
#2678
#2680
#2680
#2682
#2682
#2684
#2685
#2685
#2686
#2687
#2687
#2688
#2688
#2689
#2689
#2690
#2690
#2691
#2694
#2694
#2702
#2702
#2704
#2704
#2705
#2709
#2713
#2713
#2714
#2718
#2718
#2719
#2719
#272
#2721
#2721
#2729
#2729
#2731
#2731
#2734
#2734
#2739
#2739
#274
#2740
#2740
#2741
#2741
#2742
#277
#28
#285
#290
#290
#292
#295
#297
#298
#30
#301
#303
#305
#306
#308
#313
#318
#325
#33
#331
#341
#345
#363
#364
#365
#366
#375
#378
#379
#38
#381
#382
#383
#386
#388
#391
#392
#393
#394
#402
#403
#406
#407
#412
#413
#415
#419
#42
#424
#429
#433
#437
#440
#441
#442
#444
#446
#447
#449
#450
#451
#452
#453
#456
#46
#467
#468
#469
#47
#471
#472
#473
#477
#478
#479
#48
#480
#482
#486
#487
#488
#489
#493
#497
#497
#503
#507
#514
#516
#518
#518
#52
#53
#531
#532
#542
#543
#546
#550
#559
#562
#563
#564
#574
#576
#586
#587
#590
#597
#6
#6
#603
#604
#607
#608
#61
#615
#616
#618
#62
#620
#622
#623
#624
#626
#628
#629
#63
#631
#632
#633
#634
#635
#636
#637
#638
#639
#64
#641
#645
#646
#65
#650
#658
#659
#662
#669
#670
#671
#672
#677
#682
#691
#698
#7
#70
#701
#703
#704
#714
#717
#719
#720
#726
#727
#728
#730
#741
#743
#749
#752
#753
#754
#759
#760
#761
#764
#765
#766
#768
#773
#775
#776
#779
#786
#789
#790
#791
#796
#8
#805
#806
#807
#812
#82
#822
#823
#826
#828
#829
#83
#830
#832
#836
#838
#839
#841
#842
#844
#845
#846
#849
#853
#855
#857
#858
#862
#869
#87
#871
#878
#879
#883
#885
#891
#892
#893
#895
#896
#897
#9
#903
#905
#91
#912
#914
#915
#916
#917
#918
#920
#921
#925
#927
#932
#936
#937
#939
#940
#942
#945
#950
#951
#952
#957
#958
#96
#961
#967
#970
#976
#977
#979
#984
#992
#993
#995
#996
v0.1.0
v0.1.1
v1.0.0
v1.0.1
v1.1.0
v1.2.0
v1.3.0
v1.3.2
v1.3.3
v2.0.0
v2.1.0
v2.10.0
v2.11.0
v2.2.0
v2.3.0
v2.4.0
v2.5.0
v2.6.0
v2.6.1
v2.7.0
v2.8.0
v2.9.0
v2.9.1
v3.0.0
v3.1.0
v3.2.0
v3.2.1
v3.2.2
v3.3.0
v3.4.0
v3.4.1
v3.5.0
v3.5.1
v3.6.0
v3.7.0
v3.8.0
v3.9.0
v3.9.1
v3.9.2
v4.0.0
v4.1.0
v4.2.0
v4.2.1
-
8afb19d904
update CITATION.cff
main
Haicheng Wu
2025-10-28 23:42:37 -04:00 -
b2ca083d2b
Fixed compilation error when using StreamK scheduler + PDL. (#2686)
Qi Yuhang
2025-10-22 11:11:14 +08:00 -
b1d6e2c9b3
v4.3 update. (#2709)
Junkai-Wu
2025-10-22 02:26:30 +08:00 -
e6e2cc29f5
fix (#2684)
Lain
2025-10-15 11:46:38 -07:00 -
6aa1894093
Enable mxfp8-mxfp4 group gemm on cutlass
feature/enable-mxfp-group-gemm-sm120
Faraz Khoubsirat
2025-09-25 00:32:11 +00:00 -
f3fde58372
Update pyproject.toml
v4.2.1
release/4.2
Haicheng Wu
2025-09-24 01:19:30 -04:00 -
c6aeb9179c
Update pyproject.toml
Haicheng Wu
2025-09-24 01:18:51 -04:00 -
a8749e67ba
Update CHANGELOG.md
Haicheng Wu
2025-09-23 17:33:42 -04:00 -
95a5ff14c0
Update CHANGELOG.md
Haicheng Wu
2025-09-23 17:33:00 -04:00 -
c609b86db2
Feature/add bottom causal mask (#2480)
Aya Z. Ibrahim
2025-09-18 14:11:23 -07:00 -
177a82e251
Rename python/cutlass to python/cutlass_cppgen (#2652)
Jack Kosaian
2025-09-18 13:26:57 -05:00 -
4260d4aef9
4.2.1 update
Haicheng Wu
2025-09-23 13:45:13 -07:00 -
fb8b43ef05
Merge pull request #2669 from NVIDIA/421_update
ANIKET SHIVAM
2025-09-23 14:02:29 -07:00 -
f874df19ac
4.2.1 update
Haicheng Wu
2025-09-23 13:45:13 -07:00 -
ee914c3cec
v4.2.1 update. (#2667)
Junkai-Wu
2025-09-24 02:25:14 +08:00 -
7a6d4ee099
v4.2.1 update. (#2666)
Junkai-Wu
2025-09-24 01:25:43 +08:00 -
2b8dff1f90
Fix bfloat16 epsilon (#2607)
GTO
2025-09-22 06:43:59 +03:00 -
fd0312ddf6
Remove duplicate function calls (#1584)
103yiran
2025-09-22 11:16:59 +08:00 -
64579189ec
Feature/add bottom causal mask (#2480)
Aya Z. Ibrahim
2025-09-18 14:11:23 -07:00 -
b234a8c024
Rename python/cutlass to python/cutlass_cppgen (#2652)
Jack Kosaian
2025-09-18 13:26:57 -05:00 -
59b61c606f
add support matrix
v4.2.0
Haicheng Wu
2025-09-17 20:20:50 -07:00 -
6b73aedb11
Fxied a typo in pipeline descript docs. (#2623)
wbn
2025-09-16 10:32:27 +08:00 -
ebf5e5effd
Fix: a calculation error in the example of dividing out in the 02_layout_algebra doc (#2635)
Asuka
2025-09-16 10:31:33 +08:00 -
df3923b0bb
Fix doc cute 03_tensor.md link typo (#2627)
Wanshe
2025-09-16 10:26:43 +08:00 -
74825181f2
Remove old-version dsl examples. (#2644)
Junkai-Wu
2025-09-18 10:23:30 +08:00 -
a49f8062e3
Remove old-version dsl examples (#2645)
Junkai-Wu
2025-09-18 10:23:07 +08:00 -
8825e8be4f
Add required changes for github pipeline. (#2648)
Junkai-Wu
2025-09-18 10:22:45 +08:00 -
7817e47154
Fxied a typo in pipeline descript docs. (#2623)
wbn
2025-09-16 10:32:27 +08:00 -
25ccb875b8
Fix: a calculation error in the example of dividing out in the 02_layout_algebra doc (#2635)
Asuka
2025-09-16 10:31:33 +08:00 -
29c1ad704a
Fix doc cute 03_tensor.md link typo (#2627)
Wanshe
2025-09-16 10:26:43 +08:00 -
57e3cfb47a
doc change for 4.2 (#2639)
v4
Haicheng Wu
2025-09-15 22:02:45 -04:00 -
e7e0adddac
Update version.h
Haicheng Wu
2025-09-15 12:40:58 -04:00 -
6a35b4d22f
v4.2 tag release. (#2638)
Junkai-Wu
2025-09-16 00:21:53 +08:00 -
56f0718a97
ex77 backwards GQA (#2556)
Richard Cai
2025-09-09 09:53:28 -07:00 -
76c96b0be3
Fix incorrect shapes in copy_atom doc comments. (#2575)
Lifu Huang
2025-09-04 16:57:24 -07:00 -
d98e7bf7ce
Fix comment in mma_atom.hpp (#2579)
ao jia
2025-09-05 07:56:39 +08:00 -
b6ccf34aef
Fix Copy_Atom type mismatch in sgemm_sm80.cu (#2582)
Lifu Huang
2025-09-04 16:56:17 -07:00 -
2288c0c901
Fix bugs in matrix.h (#2598)
Andrei Alexandrescu
2025-09-04 19:55:11 -04:00 -
b2dd65dc86
more robust imports in heuristics.py and heuristics_provider.py (#2596)
Harrison Barclay
2025-08-28 22:32:55 -04:00 -
496654bf2c
Fix sm100 gemm wrong static constexpr that breaks compilation on Windows (#2167)
Javier
2025-08-28 21:13:00 -05:00 -
9ca7e877b2
fix gqa issue for blackwell fmha.py (#2599)
Linfeng Zheng
2025-08-28 23:15:20 +08:00 -
a49a78ffef
v4.2 release. (#2587)
Junkai-Wu
2025-08-23 06:11:24 +08:00 -
11cad1f67b
fix a typo. (#2561)
qqwqqw689
2025-08-20 10:23:09 +08:00 -
931359cec1
Fix typo in functional.h (#2571)
zkyue
2025-08-20 10:22:31 +08:00 -
42e7c546c4
Add movmatrix support (movmatrix.sync.aligned.m8n8.trans.b16) (#2562)
Inoday Yadav
2025-08-19 22:22:02 -04:00 -
ec18e8043b
Make swizzle in pycute work (#2553)
melonedo
2025-08-20 10:21:00 +08:00 -
5b76420d6a
[DOC] Add more exposition to composition example (#2536)
Srinath Kailasa
2025-08-12 03:20:36 +01:00 -
19772cd63e
Fix typo in smem_allocator.py (#2517)
Horace He
2025-08-10 19:44:22 -07:00 -
052afcd314
fix typo (#2529)
zkyue
2025-08-11 10:44:02 +08:00 -
86cf63e2d4
NIT: Grammar (#2537)
Srinath Kailasa
2025-08-11 03:42:45 +01:00 -
a267d47f9b
Update batched_gemm.cu (#2538)
Tarun Paparaju
2025-08-10 19:42:21 -07:00 -
9e6ab77d27
Fix a copy error in the SM70 main loop when loading data from smem to rmem (#2540)
starwang1024
2025-08-11 10:42:01 +08:00 -
d0eada85a3
Support both CUDA 12 and 13 cccl header locations (#2543)
Robert Maynard
2025-08-10 22:41:25 -04:00 -
23139309e9
Fix incorrect K dim in CuTe MMA Atom doc. (#2544)
Lifu Huang
2025-08-10 19:40:56 -07:00 -
6dd13d4278
Facebook:This commit makes its files safe for use with -Wimplicit-fallthrough. (#2324)
Wenxin Cheng
2025-07-31 17:55:19 -07:00 -
3b054767b3
Fix typo (#2514)
Srinath Kailasa
2025-07-31 03:14:54 +01:00 -
6fb5e667c1
[Doc fix] incorrect compute cap. for Blackwell RTX (#2511)
Ali Hassani
2025-07-30 22:14:13 -04:00 -
6c891db9f6
Fix epilogue:🧵:Convert cannot be used with cute::collective::DefaultEpilogue. (#2333)
Wenbo Yang
2025-07-31 10:12:53 +08:00 -
da47886e34
Fix example bug (#2351)
botbw
2025-07-31 10:12:33 +08:00 -
26b7450023
support fp16 accmulator for sm89 fp8 mma (#2378)
kf-zhang
2025-07-31 10:12:08 +08:00 -
a39cf6b511
Fix example in CuTe tutorials (#2416)
Luca Wehrstedt
2025-07-31 04:11:47 +02:00 -
f09045d660
Corrected minor nit in mma_traits.hpp (#2447)
Aditya Kane
2025-07-30 19:11:23 -07:00 -
84a27b3926
fix: examples/cute/tutorial/blackwell/04_mma_tma_2sm_sm100.cu GridDim miscalculated (#2492)
xiangjiaojun
2025-07-31 10:11:04 +08:00 -
e093b4f691
Fix tutorial comment in sgemm_1.cu: use tCrC instead of tCsA in axpby explanation (#2448)
kernyan
2025-07-30 22:09:55 -04:00 -
664c4f7b3e
Update CUTLASS version to 4.1
Haicheng Wu
2025-07-26 20:11:04 -04:00 -
0e026982ce
Example 77 add blackwell fmha bwd for MLA shape (#2466)
Zeyu WANG
2025-07-25 06:41:11 +08:00 -
9a9a579714
Merge pull request #2489 from NVIDIA/update_workflow_script
Larry Wu
2025-07-23 15:33:43 +08:00 -
51d730b8be
Support "CuTe DSL" auto-labeling in workflow
Larry Wu
2025-07-23 00:28:01 -07:00 -
6c0c8b7484
1. Update bug/feature report template to add component selection. (#2485)
Larry Wu
2025-07-23 00:38:03 +08:00 -
e51efbfe18
Update CHANGELOG.md
v4.1.0
Haicheng Wu
2025-07-21 22:09:56 -04:00 -
fd6cfe1ed0
v4.1 release update v2. (#2481)
Junkai-Wu
2025-07-22 10:03:55 +08:00 -
9baa06dd57
Add Blackwell MLA forward (shape: d=192, dv=128) implementation in example_77 (#2472)
zhang
2025-07-18 13:27:48 +08:00 -
ebe98c549a
cache procedural_name in GemmOperation (#2317)
Colin Peppler
2025-07-16 19:25:02 -07:00 -
9892624b66
Fix typos in the text (#2417)
Oleksandr Pavlyk
2025-07-16 20:51:12 -05:00 -
dc9876eeb2
Delete all docs files except index.html
redirect
Larry Wu
2025-07-06 05:49:10 -07:00 -
6c584d0e47
Redirect Github pages to NVIDIA Doc hub
Larry Wu
2025-07-06 05:46:58 -07:00 -
a1aaf2300a
v4.1 release
Junkai-Wu
2025-07-03 20:07:53 +08:00 -
b995f93317
4.0 doc change (#2425)
v4.0.0
Haicheng Wu
2025-06-27 21:35:06 +08:00 -
f2a17553d5
Added CODEOWNERS
oss_ci
Zekun Fan
2025-06-03 16:52:29 -07:00 -
889ff20648
v4.0 update v2. (#2420)
Junkai-Wu
2025-06-26 00:56:25 +08:00 -
dc4817921e
v4.0 update. (#2398)
Junkai-Wu
2025-06-12 21:10:29 +08:00 -
5c6bca0441
Update requirements.txt (#2390)
brandonsun
2025-06-10 14:31:49 +08:00 -
c2ad7c5b20
fix link in readme (#2379)
drazi
2025-06-07 19:38:38 +08:00 -
cc23f6d1e9
fix link (#2377)
drazi
2025-06-07 18:00:39 +08:00 -
5a287538c2
"Update CHANGELOG for 4.0 tagging" (#2374)
Vijay Thakkar
2025-06-06 10:07:36 -04:00 -
58a5197b9d
"Update CHANGELOG for 4.0 tagging"
thakkarv/4.0-changelog
Vijay Thakkar
2025-06-06 09:43:11 -04:00 -
8bdbfca682
v4.0 update. (#2371)
Junkai-Wu
2025-06-06 14:39:20 +08:00 -
2e2af190bd
Revert "[ex77] fix mla split; add fwd lse; add bwd varlen (#2366)" (#2370)
Manish Gupta
2025-06-05 20:14:57 -07:00 -
f12b1d75c9
[ex77] fix mla split; add fwd lse; add bwd varlen (#2366)
Markus Hoehnerbach
2025-06-05 15:39:46 -07:00 -
b244379d9b
Merge pull request #2359 from NVIDIA/oss_ci
zekunf-nv
2025-06-03 14:04:35 -07:00 -
9d165a3b8e
Handle get_masked_trip_count for small length in fmha example (#2292)
Taebum Kim
2025-05-31 11:51:18 +09:00 -
b9b110a9ea
Correct divmod order in example 77 (blackwell fmha) (#2291)
Taebum Kim
2025-05-31 11:50:40 +09:00 -
8206e7a0f5
Pre-compile in CuteDsl/ampere/elementwise_apply.py (#2340)
Gabriel Wu
2025-05-28 22:24:39 +08:00 -
6316b6f867
Fix typos (#2311)
co63oc
2025-05-23 20:30:10 +08:00 -
9354bfd7c1
Keep the documentation consistent with the sgemm_1.cu code. (#2285)
zkyue
2025-05-20 10:53:15 +08:00 -
5e9b8e2a25
fix docx (#2290)
1096125073
2025-05-20 10:52:37 +08:00 -
1ec230c4bf
Fix typo (#2299)
Ruyman
2025-05-15 14:38:42 +01:00 -
f89cd95b16
Update elementwise_add.ipynb (#2298)
Driss Guessous
2025-05-15 06:38:27 -07:00 -
f115c3f854
Release v4.0.0 (#2294)
Kihiro Bando
2025-05-13 15:55:29 -04:00 -
ad7b2f5e84
3.9.2 doc/version (#2279)
v3.9.2
Haicheng Wu
2025-05-04 00:00:15 -04:00