Linear Algebra#

Overview#

The Linear Algebra module nvmath.linalg in nvmath-python leverages various NVIDIA math libraries to support dense [1] linear algebra computations. As of version 0.7.0, we offer both a generic matrix multiplication API based on the cuBLAS and NVPL libraries and a specialized matrix multiplication API (nvmath.linalg.advanced) based on the cuBLASLt library. See Generic and Specialized APIs for motivation.

At a high-level, if your use case is predominantly GEMM and requires particular flexibility in matrix data layouts, input and/or compute types, and also in choosing the algorithmic implementation, look at the specialized APIs. Otherwise, look at the generic APIs.

API Reference#

Generic Linear Algebra APIs (`nvmath.linalg`)#

The generic linear algebra module includes matrix multiplication APIs which accept structured matrices as input, but do not allow for control over computational precision or algorithm selection and planning.

`matmul`(a, b, /[, c, alpha, beta, ...])	Perform the specified matrix multiplication computation \(\alpha a @ b + \beta c\).
`Matmul`(a, b, /[, c, alpha, beta, ...])	Create a stateful object encapsulating the specified matrix multiplication computation \(\alpha a @ b + \beta c\) and the required resources to perform the operation.
`matrix_qualifiers_dtype`	A NumPy custom dtype which describes a structured matrix.
`DiagonalMatrixQualifier`()	A class which constructs and validates `matrix_qualifiers_dtype` for a diagonal matrix.
`GeneralMatrixQualifier`()	A class which constructs and validates `matrix_qualifiers_dtype` for a general rectangular matrix.
`HermitianMatrixQualifier`()	A class which constructs and validates `matrix_qualifiers_dtype` for a hermitian matrix.
`InvalidMatmulState`
`SymmetricMatrixQualifier`()	A class which constructs and validates `matrix_qualifiers_dtype` for a symmetric matrix.
`TriangularMatrixQualifier`()	A class which constructs and validates `matrix_qualifiers_dtype` for a triangular matrix.
`ExecutionCPU`(*[, num_threads])	A data class for providing CPU execution options.
`ExecutionCUDA`(*[, device_id])	A data class for providing GPU execution options.
`MatmulOptions`(*, allocator, blocking, ] =, ...)	A dataclass for providing options to a `Matmul` object.

Specialized Linear Algebra APIs (`nvmath.linalg.advanced`)#

The specialized linear algebra module includes a matrix multiplication API which only accepts general matrices, but provides extra functionality such as epilog functions, more options and controls over computational precision, and control over algorithm selection and planning.

`matmul`(a, b, /[, c, alpha, beta, epilog, ...])	Perform the specified matrix multiplication computation \(F(\alpha a @ b + \beta c)\), where \(F\) is the epilog.
`matrix_qualifiers_dtype`	NumPy dtype object that encapsulates the matrix qualifiers in linalg.advanced.
`Algorithm`(algorithm)	An interface class to query algorithm capabilities and configure the algorithm.
`Matmul`(a, b, /[, c, alpha, beta, ...])	Create a stateful object encapsulating the specified matrix multiplication computation \(\alpha a @ b + \beta c\) and the required resources to perform the operation.
`MatmulComputeType`	alias of `ComputeType`
`MatmulEpilog`	alias of `Epilogue`
`MatmulInnerShape`(value[, names, module, ...])	See `cublasLtMatmulInnerShape_t`.
`MatmulNumericalImplFlags`(value[, names, ...])	These flags can be combined with the \| operator: OP_TYPE_FMA \| OP_TYPE_TENSOR_HMMA ...
`MatmulReductionScheme`	alias of `ReductionScheme`
`MatmulEpilogPreferences`([aux_type, aux_amax])	A data class for providing epilog options as part of `preferences` to the `Matmul.plan()` method and the wrapper function `matmul()`.
`MatmulOptions`([compute_type, scale_type, ...])	A data class for providing options to the `Matmul` object and the wrapper function `matmul()`.
`MatmulPlanPreferences`([...])	A data class for providing options to the `Matmul.plan()` method and the wrapper function `matmul()`.
`MatmulQuantizationScales`([a, b, c, d])	A data class for providing quantization_scales to `Matmul` constructor and the wrapper function `matmul()`.

Helpers#

The Specialized Linear Algebra helpers module nvmath.linalg.advanced.helpers provides helper functions to facilitate working with some of the complex features of nvmath.linalg.advanced module.

Matmul helpers (`nvmath.linalg.advanced.helpers.matmul`)#

`create_mxfp8_scale`(x, exponent[, stream])	Create MXFP8 block scale with the same value for the whole tensor `x`.
`invert_mxfp8_scale`(scale)	Compute a reciprocal of MXFP8 block scale.
`apply_mxfp8_scale`(x, scale)	Apply MXFP8 block scale factors to tensor `x`.
`get_mxfp8_scale_offset`(x, index)	Computes the offset of MXFP8 scale used for element `x[index]`.

Footnotes

Linear Algebra#

Overview#

API Reference#

Generic Linear Algebra APIs (nvmath.linalg)#

Specialized Linear Algebra APIs (nvmath.linalg.advanced)#

Helpers#

Matmul helpers (nvmath.linalg.advanced.helpers.matmul)#

Generic Linear Algebra APIs (`nvmath.linalg`)#

Specialized Linear Algebra APIs (`nvmath.linalg.advanced`)#

Matmul helpers (`nvmath.linalg.advanced.helpers.matmul`)#