Quick Start Guide#
The CUTLASS DSL 4.0 release currently supports Linux and Python 3.12 only. To install CUTLASS DSLs (limited to CuTe DSL for now), use the following command
Installation#
To install the CUTLASS DSL, run:
pip install nvidia-cutlass-dsl
The nvidia-cutlass-dsl
wheel includes everything needed to generate GPU kernels. It requires
the same NVIDIA driver version as the
CUDA Toolkit 12.9.
To ensure compatibility with the examples and code on GitHub,
use the requirements.txt
file from the corresponding commit in the repository.
Recommended Dependencies#
To run examples and begin development, we recommend installing:
pip install torch jupyter