Release Notes

This section provides detailed information for releases and upgrades available for the DGX Software Stack for Red Hat Enterprise Linux 9 and Rocky Linux 9.

Current Contents of the Repositories

The following table provides the current version information for the software included in the DGX Software Stack.

Component

Latest versions in the repositories

GPU Driver

535.104.05

CUDA Toolkit

12.2.0

NCCL

2.18.3

CuDNN

8.9.2.26

DCGM

3.1.8

GPU Direct Storage

1.7.2.

NVIDIA System Management (NVSM)

23.06.04

Docker Engine

23.0.4

NVIDIA Container Runtime

  • nvidia-docker2: 2.13.1-1

  • nvidia-container-toolkit (and base): 1.13.1-1

  • libnvidia-container-tools: 1.13.1-1

  • libnvidia-container1: 1.13.1-1

MIG Configuration Tool

0.5.1

NGC CLI

3.17.0

DLFW (Deep Learning Frameworks)

23.07

The following table provides information about the supported OS and matching firmware versions for Mellanox OFED.

OS

DGX-1, DGX-2

ConnectX-4 or ConnectX-5

DGX A100

ConnectX-6 (CX-6)

DGX A100

ConnectX-7 (CX-7)

DGX H100

ConnectX-7 (CX-7)

RHEL 8

5.8-3.0.7.0

  • CX-5: 16.35.3006

  • CX-4: 12.28.2006

  • RHEL 8.8

5.8-3.0.7.0

  • CX-6: 20.35.3006

  • RHEL 8.8

5.4-3.7.5.0

  • CX-7: 28.34.4000

  • RHEL 8.8

5.9-0.5.6.0.127

  • CX-7: 28.36.2050

  • RHEL 8.7

RHEL 9

5.8-3.0.7.0

  • CX-5: 16.35.3006

  • CX-4: 12.28.2006

  • RHEL 9.2

5.8-3.0.7.0

  • CX-6: 20.35.3006

  • RHEL 9.2

5.4-3.7.5.0

  • CX-7: 28.34.4000

  • RHEL 9.2

5.9-0.5.6.0.127

  • CX-7: 28.36.2050

  • RHEL 9.1

Note

For information about LTS software versions for related networking components, refer to the Networking Long-Term Support Releases page.

Releases Information

This section provides details of each DGX Software for Red Hat Enterprise Linux release. These include mostly new NVIDIA features and accumulated bug fixes and security updates.

  • To check the latest Red Hat Enterprise Linux 9 version, Refer to Red Hat Knowledgebase article 3078.

  • To check the MLNX_OFED package OS support, visit Mellanox and click the latest NVIDIA MLNX_OFED software version and use the side menu to navigate to Release Notes - General Support in MLNX_OFED and view Supported Operating Systems.

Important

Installing or updating to the DGX Software also updates the installed Red Hat Enterprise Linux 9 distribution to the latest version.

If you use NVIDIA MLNX_OFED, then before installing or updating to EL9-23.08, be sure that there is a MLNX_OFED package version available that supports the latest Red Hat Enterprise Linux 9 version.

EL9-23.08 Release

Release Highlights

  • Add support for NVIDIA DGX H100 System. Support is limited to the Red Hat Enterprise Linux 9.1 release.

  • Add support for Red Hat Enterprise Linux 9.2 and Rocky Linux 9.2.

Qualified Software Stack

The following table provides version information for EL9-23.08 and the software it has been qualified:

Component

Latest versions in the repositories

Linux Distribution

Red Hat Enterprise Linux 9.2 and Rocky Linux 9.2

For NVIDIA DGX H100 Systems, only Red Hat Enterprise Linux 9.1 is supported.

GPU Driver

535.86.10

CUDA Toolkit

12.2.0

NCCL

2.18.3

CuDNN

8.9.2.26

DCGM

3.1.8

MLNX OFED

  • ConnectX-7 with DGX H100: 5.9-0.5.6.0.125

  • ConnectX-7 with DGX A100: 5.4-3.7.5.0

  • ConnectX-6 with DGX A100: 5.8-3.0.7.0

  • ConnectX-5 and ConnectX-4: 5.8-3.0.7.0

MLNX FW

  • ConnectX-7 and DGX H100: 28.36.2050

  • ConnectX-7 and DGX A100: 28.34.4000

  • ConnectX-6 and DGX A100: 20.35.4000

  • ConnectX-5: 16.35.3006

  • ConnectX-4: 12.28.2006

GPU Direct Storage

1.7.2

NVIDIA System Management (NVSM)

23.06.04

Docker Engine

23.0.4

NVIDIA Container Runtime

  • nvidia-docker2: 2.13.1-1

  • nvidia-container-toolkit (and base): 1.13.1-1

  • libnvidia-container-tools: 1.13.1-1

  • libnvidia-container1: 1.13.1-1

MIG Configuration Tool

0.5.1

NGC CLI

3.17.0

DLFW (Deep Learning Frameworks)

23.07

Hardware Compatibility

NVIDIA has validated and tested EL9-23.08 with the following DGX systems:

  • NVIDIA DGX H100

  • NVIDIA DGX A100

  • NVIDIA DGX Station A100

  • NVIDIA DGX Station

  • NVIDIA DGX-2

  • NVIDIA DGX-1

EL9-23.01 Release

Initial release of the DGX Software Stack for Red Hat Enterprise Linux 9.

Qualified Software Stack

The following table provides version information for EL9-23.01 and the software it has been qualified:

Component

Versions in this release

Linux Distribution

Red Hat Enterprise Linux 9.1 and Rocky Linux 9.1

GPU Driver

525.105.17

CUDA Toolkit

12.0

NCCL

2.18.1

CuDNN

8.9.1.23

DCGM

3.1.8

NVIDIA MLNX_OFED

5.8-2.0.3.0

NVIDIA ConnectX Firmware

  • CX-4: 12.28.2006

  • CX-5: 16.35.2000

  • CX-6: 20.35.2000

NVIDIA System Management (NVSM)

22.12.04

Docker Engine

23.0.4

NVIDIA Container Runtime

  • nvidia-docker2: 2.13.0-1

  • nvidia-container-toolkit (and base): 1.13.1-1

  • libnvidia-container-tools: 1.13.1-1

  • libnvidia-container1: 1.13.1-1

MIG Configuration Tool

0.5.1

NGC CLI

3.17.0

DLFW (Deep Learning Frameworks)

23.03

Hardware Compatibility

NVIDIA has validated and tested EL9-23.01 with the following DGX systems:

  • DGX-1

  • DGX-2

  • DGX Station

  • DGX A100

  • DGX Station A100