Release Notes

cuSOLVERMp v0.5.0

  • Improved the performance of cusolverMpStedc().

  • Introduced a new option to force NCCL usage by setting the CUSOLVERMP_FORCE_NCCL=1 environment flag. This is only applicable in parts of the eigensolver for now.

cuSOLVERMp v0.4.3

  • Supported CUDA 12.1.1.

  • Fixed a bug that processors are hanging when a problem is tiny and fits in a single processor.

Known Issues

  • CUDA 12.1.1 is compatible with NCCL up to v2.16.x; higher NCCL version may hang intermittently for certain processor grids.

cuSOLVERMp v0.4.2

  • Fixed a bug in cusolverMpSyevd() that the code returns an internal error for a matrix filled with zero entires; the correct behavior is to return zero eigenvalues and unit eigenvectors.

  • Supported CUDA 12.1.1

  • Note that the code is compatible with NCCL up to v2.16.x

cuSOLVERMp v0.4.1

  • Added support for row major grid in SYEVD.

cuSOLVERMp v0.4.0

  • Released with HPC-SDK 23.5.

  • Added routines for symmetric (Hermitian) generalized eigen solver

    • cusolverMpSygst() reduces the symmetric (Hermitian) generalized eigen problem to standard form.

    • cusolverMpSygvd() computes all eigenvalues and eigenvectors of symmetric (Hermitian) generalized eigen problem.

cuSOLVERMp v0.3.1

  • Released with HPC-SDK 23.3.

  • Minor bugfixes are included

cuSOLVERMp v0.3.0

cuSOLVERMp v0.2.0

cuSOLVERMp v0.1.0