CUTLASS 2.x# Layouts and Tensors CUTLASS Layout Concept Accessing elements within a tensor Summary: GEMM API CUTLASS GEMM Model CUTLASS GEMM Components Tile Iterator Concepts Definitions Frequently Used Tile Iterator Concepts Utilities Tensor Allocation and I/O Device Allocations Tensor Initialization Reference Implementations Debugging Asynchronous Kernels with CUTLASS’s Built-in synclog Tool