3.9.1 doc/version change (#2273)

This commit is contained in:
Haicheng Wu
2025-05-01 00:27:00 -04:00
committed by GitHub
parent e3cb8a773a
commit f535c33634
6 changed files with 10 additions and 6 deletions

View File

@ -1,5 +1,9 @@
# NVIDIA CUTLASS Changelog
## [3.9.1](https://github.com/NVIDIA/cutlass/releases/tag/v3.9.1) (2025-04-30)
* Fixed Group Gemm hang issue in CUTLASS 3.x
* Improved Hopper [Blockwise](./examples/67_hopper_fp8_warp_specialized_gemm_with_blockwise_scaling/67_hopper_fp8_warp_specialized_gemm_with_blockwise_scaling.cu) and [Groupwise](./examples/67_hopper_fp8_warp_specialized_gemm_with_blockwise_scaling/67_hopper_fp8_warp_specialized_gemm_with_groupwise_scaling.cu) GEMM performance.
## [3.9.0](https://github.com/NVIDIA/cutlass/releases/tag/v3.9.0) (2025-04-24)

View File

@ -1,8 +1,8 @@
![ALT](./media/images/gemm-hierarchy-with-epilogue-no-labels.png "Complete CUDA GEMM decomposition")
# CUTLASS 3.9.0
# CUTLASS 3.9.1
_CUTLASS 3.9.0 - April 2025_
_CUTLASS 3.9.1 - April 2025_
CUTLASS is a collection of CUDA C++ template abstractions for implementing
high-performance matrix-matrix multiplication (GEMM) and related computations at all levels

View File

@ -36,7 +36,7 @@
#define CUTLASS_MAJOR 3
#define CUTLASS_MINOR 9
#define CUTLASS_PATCH 0
#define CUTLASS_PATCH 1
#ifdef CUTLASS_VERSIONS_GENERATED
#include "cutlass/version_extended.h"

View File

@ -133,7 +133,7 @@ def get_option_registry():
this._option_registry = OptionRegistry(device_cc())
return this._option_registry
this.__version__ = '3.9.0'
this.__version__ = '3.9.1'
from cutlass.backend import create_memory_pool
from cutlass.emit.pytorch import pytorch

View File

@ -36,7 +36,7 @@ from setuptools import setup
def perform_setup():
setup(
name='cutlass_library',
version='3.9.0',
version='3.9.1',
description='CUTLASS library generation scripts',
packages=['cutlass_library']
)

View File

@ -36,7 +36,7 @@ from setuptools import setup
def perform_setup():
setup(
name='pycute',
version='3.9.0',
version='3.9.1',
description='Python implementation of CuTe',
packages=['pycute'],
)