Quick Start Guide#

The CUTLASS DSL 4.0 release currently supports Linux and Python 3.12 only. To install CUTLASS DSLs (limited to CuTe DSL for now), use the following command

Installation#

To install the CUTLASS DSL, run:

pip install nvidia-cutlass-dsl

The nvidia-cutlass-dsl wheel includes everything needed to generate GPU kernels. It requires the same NVIDIA driver version as the CUDA Toolkit 12.9.

To ensure compatibility with the examples and code on GitHub, use the requirements.txt file from the corresponding commit in the repository.