Quick Start Guide#
The CUTLASS DSL 4.0 release currently supports Linux and Python 3.12 only. To install CUTLASS DSLs (limited to CuTe DSL for now), use the following command
Installation#
To install the CUTLASS DSL, run:
pip install nvidia-cutlass-dsl
The nvidia-cutlass-dsl wheel includes everything needed to generate GPU kernels. It requires
the same NVIDIA driver version as the
CUDA Toolkit 12.9.
To ensure compatibility with the examples and code on GitHub,
use the requirements.txt file from the corresponding commit in the repository.
Recommended Dependencies#
To run examples and begin development, we recommend installing:
pip install torch jupyter
Recommended Python environment variables for jupyter notebooks#
We recommend setting the following environment variable when running jupyter notebooks.
export PYTHONUNBUFFERED=1