Quick Start Guide#

The CUTLASS DSL 4.0 release currently supports Linux and Python 3.12 only. To install CUTLASS DSLs (limited to CuTe DSL for now), use the following command

Installation#

To ensure compatibility with the examples and code on GitHub, use the requirements.txt file from the corresponding commit in the repository.

git clone https://github.com/NVIDIA/cutlass.git
pip install -r cutlass/python/CuTeDSL/requirements.txt

If you just want to try out the last known stable release of the CUTLASS DSL (may not compatible with the latest examples and code), run:

pip install nvidia-cutlass-dsl

The nvidia-cutlass-dsl wheel includes everything needed to generate GPU kernels. It requires the same NVIDIA driver version as the CUDA Toolkit 12.9.