Quick Start Guide#
The CUTLASS DSL 4.0 release currently supports Linux and Python 3.12 only. To install CUTLASS DSLs (limited to CuTe DSL for now), use the following command
Installation#
To ensure compatibility with the examples and code on GitHub, use the requirements.txt file from the corresponding commit in the repository.
git clone https://github.com/NVIDIA/cutlass.git
pip install -r cutlass/python/CuTeDSL/requirements.txt
If you just want to try out the last known stable release of the CUTLASS DSL (may not compatible with the latest examples and code), run:
pip install nvidia-cutlass-dsl
The nvidia-cutlass-dsl
wheel includes everything needed to generate GPU kernels. It requires
the same NVIDIA driver version as the
CUDA Toolkit 12.9.
Recommended Dependencies#
To run examples and begin development, we recommend installing:
pip install torch jupyter
Recommended Python environment variables for jupyter notebooks#
We recommend setting the following environment variable when running jupyter notebooks.
export PYTHONUNBUFFERED=1