Release Notes#
This section provides detailed information for releases and upgrades available for the NVIDIA DGX™ Software Stack for Red Hat Enterprise Linux 9 and Rocky Linux 9.
Current Software Versions#
The following table shows the current version information of the software packages provided in the NVIDIA repositories for the NVIDIA DGX Software Stack.
Component |
Version |
Additional Information |
---|---|---|
GPU Driver |
EL8: RPM installer
EL9: RPM installer
|
|
GPU Driver |
EL8: RPM installer
EL9: RPM installer
|
|
GPU Driver |
EL8: RPM installer
EL9: RPM installer
|
|
CUDA Toolkit |
R570: 12.8 Update 1 download |
|
CUDA Toolkit |
R550: 12.4 Update 1 download |
|
CUDA Toolkit |
R535: 12.2 Update 2 download |
|
DOCA OFED |
||
Inbox OFED |
39.0-1 |
For DGX OS 6 only. |
NCCL |
||
cuDNN |
||
DCGM |
||
GPUDirect Storage (GDS) |
|
|
NVIDIA Container Toolkit |
NVIDIA Container Toolkit includes the following packages:
|
|
nvidia-peer-memory |
1.3 |
Note
For all DGX Stations installed with DGX OS, the CUDA Toolkit is installed by default. For all DGX servers installed with DGX OS and all systems installed with NVIDIA DGX Software for Red Hat Enterprise, the CUDA Toolkit is not installed by default; however, you can manually install a qualified CUDA Toolkit release. Refer to the CUDA Release Notes for driver compatibility information.
For CUDA Toolkit minor version compatibility and the minimum required driver version, refer to CUDA Compatibility.
For information about the MLNX_OFED release transition, refer to the MLNX_OFED section in Adapter Software.
The following table provides information about the matching firmware versions for the NVIDIA DOCA™ Host package with the doca-ofed installation profile v2.9.1.
OS
|
DGX A100
ConnectX-6
|
DGX A100
ConnectX-7
|
DGX H100/H200
ConnectX-7
|
---|---|---|---|
EL 9 |
Firmware for the NVIDIA® BlueField® DPU in NIC mode:
OS
|
DGX H100/H200
BlueField-3
|
DGX B200
BlueField-3
|
---|---|---|
EL 9 |
For installation instructions, refer to
NVIDIA DOCA-OFED: Installing NVIDIA DOCA-OFED
NVIDIA MLNX_OFED: Installing NVIDIA MLNX_OFED
ConnectX®-7 adapter cards: Installing ConnectX-7 Firmware
ConnectX®-6 adapter cards: Firmware Downloads
Note
For information about LTS software versions for related networking components, refer to the Networking Long-Term Support Releases page.
Latest Release#
Important
Installing or updating to the DGX Software also updates the installed Red Hat Enterprise Linux 9 distribution to the latest version.
If you use NVIDIA MLNX_OFED, before installing or updating to EL9-25.04, refer to the MLNX_OFED section in Adapter Software about transitioning to NVIDIA DOCA-OFED and consider any effect the updates might have on MLNX_OFED-dependent applications.
To check the latest Red Hat Enterprise Linux 9 version, refer to Red Hat Knowledgebase article 3078.
To check the MLNX_OFED package OS support, visit Mellanox and click the latest NVIDIA MLNX_OFED software version. Use the side menu to navigate to Release Notes > General Support and view Supported Operating Systems.
Release EL9-25.04#
Release Date: April 10, 2025
Release Highlights#
Supports Red Hat Enterprise Linux 9.5 and Rocky 9.5.
Introduces support for the NVIDIA DGX™ B200 system.
Refer to DGX B200 System Firmware Update Guide Version 25.04.1 for supported firmware.
No support for the NVIDIA DGX™ H800 system.
Updates GPU drivers:
Release 570.124.06 (default) with CUDA Toolkit 12.8 Update 1
Release 550.144.03 with CUDA Toolkit 12.4 Update 1
Release 535.230.02 with CUDA Toolkit 12.2 Update 2
Adds support for the NVIDIA® BlueField®-3 DPU in NIC mode on DGX H100, DGX H200, and DGX B200.
Includes support for the NVIDIA DOCA™ Host package with the doca-ofed installation profile v2.9.1 (providing the MLNX_OFED functionality).
Updates the DGX Software Stack.
Qualified Software Stack#
The following table shows the current version information of the software packages provided in the NVIDIA repositories for the NVIDIA DGX Software Stack.
Component |
Latest versions in the repositories |
---|---|
DGX Base OS |
EL9-25.04 |
OS |
Red Hat Enterprise Linux 9.5 and Rocky Linux 9.5 |
Kernel |
5.14.0-503.35.1.el9_5.x86_64 |
GPU Driver |
|
CUDA Toolkit |
|
NCCL |
|
cuDNN |
|
DCGM |
|
GPU Direct Storage |
|
NVIDIA System Management (NVSM) |
|
Docker CE |
|
NVIDIA Container Runtime |
|
MIG Configuration Tool |
|
GDRCopy |
|
DLFW (Deep Learning Frameworks) |
Supported DGX Systems#
The EL9-25.04 release supports the following DGX systems:
DGX B200 1,440 GB
DGX H200 1,128 GB
DGX H100 640 GB
DGX A100 640 GB
DGX A100 320 GB
DGX A800 640 GB
DGX Station A100 320 GB
DGX Station A100 160 GB
DGX Station A800 320 GB
DGX Station 32 GB
The EL9-25.04 release does not support the following DGX systems:
DGX H800
DGX-2
DGX-1 32 GB
Previous Releases#
Release EL9-24.12#
Release Date: December 18, 2024
Release Highlights#
Adds support for Red Hat Enterprise Linux 9.5 and Rocky 9.5.
Introduces support for the NVIDIA DGX™ H200 system.
No support for the NVIDIA DGX™ H800 system.
Updates GPU drivers:
Release 550.127.08 with CUDA Toolkit 12.4 Update 1
Release 535.216.03 with CUDA Toolkit 12.2 Update 2
Adds support for the NVIDIA® BlueField®-3 DPU in NIC mode v32.43.2026 LTS on DGX H100 and H200 systems.
Includes support for the NVIDIA DOCA™ Host package with the doca-ofed installation profile v2.9.1 (providing the MLNX_OFED functionality).
Updates the DGX Software Stack.
Qualified Software Stack#
The following table shows the current version information of the software packages provided in the NVIDIA repositories for the NVIDIA DGX Software Stack.
Component |
Latest versions in the repositories |
---|---|
DGX Base OS |
EL9-24.12 |
OS |
Red Hat Enterprise Linux 9.5 and Rocky Linux 9.5 |
Kernel |
5.14.0-503.15.1.el9_5.x86_64 |
GPU Driver |
|
CUDA Toolkit |
|
NCCL |
|
cuDNN |
|
DCGM |
|
GPU Direct Storage |
|
NVIDIA System Management (NVSM) |
|
Docker CE |
|
NVIDIA Container Runtime |
|
MIG Configuration Tool |
|
GDRCopy |
|
DLFW (Deep Learning Frameworks) |
The following table provides information about the supported OS and matching firmware versions for NVIDIA DOCA™ Host package with the doca-ofed installation profile v2.9.1 and the NVIDIA® OpenFabrics Enterprise Distribution for Linux (MLNX_OFED) v24.10-1.1.4.0.
OS
|
DGX-1, DGX-2
ConnectX-4 (CX-4) or
ConnectX-5 (CX-5)
|
DGX A100
ConnectX-6
|
DGX A100
ConnectX-7
|
DGX H100/H200
ConnectX-7
|
---|---|---|---|---|
RHEL 9
|
CX-5: 16.35.4030 CX-4: 12.28.2006 |
Supported DGX Systems#
The EL9-24.12 release supports the following DGX systems:
DGX H200 1,128 GB
DGX H100 640 GB
DGX A100 640 GB
DGX A100 320 GB
DGX A800 640 GB
DGX-2
DGX-1 32 GB
DGX Station A100 320 GB
DGX Station A100 160 GB
DGX Station A800 320 GB
DGX Station 32 GB
Release EL9-24.06#
Release Date: July 11, 2024
Release Highlights#
Added support for Red Hat Enterprise Linux 9.4 and Rocky 9.4.
Introduced support for the NVIDIA DOCA™ Host package with the doca-ofed installation profile v2.7.0.
Included support for the NVIDIA® OpenFabrics Enterprise Distribution for Linux (MLNX_OFED) v24.04-0.6.6.0
Continued support for single-port ConnectX-7 VPI adapter card for DGX A100 System.
Updated the DGX Software Stack.
Qualified Software Stack#
The following table shows the current version information of the software packages provided in the NVIDIA repositories for the NVIDIA DGX Software Stack.
Component |
Latest versions in the repositories |
---|---|
DGX Base OS |
EL9-24.06 |
OS |
Red Hat Enterprise Linux 9.4 and Rocky Linux 9.4 |
Kernel |
5.14.0-427.18.1.el9_4.x86_64 |
GPU Driver |
|
CUDA Toolkit |
|
NCCL |
|
cuDNN |
|
DCGM |
|
GPU Direct Storage |
|
NVIDIA System Management (NVSM) |
|
Docker CE |
|
NVIDIA Container Runtime |
|
MIG Configuration Tool |
|
GDRCopy |
|
DLFW (Deep Learning Frameworks) |
The following table provides information about the supported OS and matching firmware versions for the NVIDIA® OpenFabrics Enterprise Distribution for Linux (MLNX_OFED) v24.04-0.6.6.0 and the NVIDIA DOCA™ Host package with the doca-ofed installation profile v2.7.0.
OS
|
DGX-1, DGX-2
ConnectX-4 (CX-4) or
ConnectX-5 (CX-5)
|
DGX A100
ConnectX-6
|
DGX A100
ConnectX-7
|
DGX H100
ConnectX-7
|
---|---|---|---|---|
RHEL 9
|
CX-5: 16.35.3502 CX-4: 12.28.2006 |
20.41.1000 |
28.41.1000 |
28.41.1000 |
Supported DGX Systems#
The EL9-24.06 release supports the following DGX systems:
DGX H100
DGX A100 640 GB
DGX A100 320 GB
DGX A800 640 GB
DGX-2
DGX-1 32 GB
DGX Station A100 320 GB
DGX Station A100 160 GB
DGX Station A800 320 GB
DGX Station 32 GB
Release EL9-23.12#
Release Date: December 19, 2023
Release Highlights#
Added support for Red Hat Enterprise Linux 9.3 and Rocky 9.3.
Continued support for Red Hat Enterprise Linux 9.2 and Rocky Linux 9.2.
Added support for single-port ConnectX-7 VPI adapter card for DGX A100 System.
Added support for NVIDIA® OpenFabrics Enterprise Distribution for Linux (MLNX_OFED) version 23.10-1.1.9.0 - a long-term support (LTS) release.
Continued support for DGX H100.
Qualified Software Stack#
The following table shows the current version information of the software packages provided in the NVIDIA repositories for the NVIDIA DGX Software Stack.
Component |
Latest versions in the repositories |
---|---|
DGX Base OS |
EL9-23.12 |
OS |
Red Hat Enterprise Linux 9.3 and Rocky Linux 9.3 |
Kernel |
5.14.0-362.8.1.el9_3 |
GPU Driver and CUDA Toolkit |
CUDA Toolkit 12.2 and GPU Driver 535.129.03 (Default) |
NCCL |
|
cuDNN |
|
DCGM |
|
GPU Direct Storage |
1.7.2 or later |
NVIDIA System Management (NVSM) |
|
Docker-CE |
|
NVIDIA Container Runtime |
|
MIG Configuration Tool |
0.5.4-1 |
NGC CLI |
3.17.0-1 |
DLFW (Deep Learning Frameworks) |
23.10 |
The following table provides information about the supported OS and matching firmware versions for NVIDIA® OpenFabrics Enterprise Distribution for Linux (MLNX_OFED) version 23.10-1.1.9.0.
OS
|
DGX-1, DGX-2
ConnectX-4 (CX-4) or
ConnectX-5 (CX-5)
|
DGX A100
ConnectX-6
|
DGX A100
ConnectX-7
|
DGX H100
ConnectX-7
|
---|---|---|---|---|
RHEL 9
|
CX-5: 16.35.3006 CX-4: 12.28.2006 |
20.39.1002 |
28.39.1002 |
28.39.1002 |
Supported DGX Systems#
NVIDIA has validated and tested EL9-23.12 with the following DGX systems:
DGX H100
DGX A100 640 GB
DGX A100 320 GB
DGX A800 640 GB
DGX-2
DGX-1 32 GB
DGX Station A100 320 GB
DGX Station A100 160 GB
DGX Station 32 GB
Resolved Issues#
The following issues have been resolved in the EL9-23.12 release:
Bug ID |
Issue |
---|---|
4108242 |
Running |
4386925 |
GPUDirect RDMA bandwidth test failed with the |
Release EL9-23.08#
Release Highlights#
Add support for NVIDIA DGX H100 System. Support is limited to the Red Hat Enterprise Linux 9.1 release.
Add support for Red Hat Enterprise Linux 9.2 and Rocky Linux 9.2.
Qualified Software Stack#
The following table provides version information for EL9-23.08 and the software it has been qualified:
Component |
Latest versions in the repositories |
---|---|
Linux Distribution |
Red Hat Enterprise Linux 9.2 and Rocky Linux 9.2 For NVIDIA DGX H100 Systems, only Red Hat Enterprise Linux 9.1 is supported. |
GPU Driver |
|
CUDA Toolkit |
|
NCCL |
|
CuDNN |
8.9.2.26 |
DCGM |
3.1.8 |
MLNX OFED |
|
MLNX FW |
|
GPU Direct Storage |
1.7.2 |
NVIDIA System Management (NVSM) |
23.06.04 |
Docker Engine |
23.0.4 |
NVIDIA Container Runtime |
|
MIG Configuration Tool |
0.5.1 |
NGC CLI |
3.17.0 |
DLFW (Deep Learning Frameworks) |
23.07 |
The following table provides information about the supported OS and matching firmware versions for Mellanox OFED.
OS |
DGX-1, DGX-2 ConnectX-4 or ConnectX-5 |
DGX A100 ConnectX-6 (CX-6) |
DGX A100 ConnectX-7 (CX-7) |
DGX H100 ConnectX-7 (CX-7) |
---|---|---|---|---|
RHEL 8 |
|
|
|
|
RHEL 9 |
|
|
|
|
Supported DGX Systems#
NVIDIA has validated and tested EL9-23.08 with the following DGX systems:
NVIDIA DGX H100
NVIDIA DGX A100
NVIDIA DGX Station A100
NVIDIA DGX Station
NVIDIA DGX-2
NVIDIA DGX-1
Release EL9-23.01#
Initial release of the DGX Software Stack for Red Hat Enterprise Linux 9.
Qualified Software Stack#
The following table provides version information for EL9-23.01 and the software it has been qualified:
Component |
Versions in this release |
---|---|
Linux Distribution |
Red Hat Enterprise Linux 9.1 and Rocky Linux 9.1 |
GPU Driver |
|
CUDA Toolkit |
12.0 |
NCCL |
2.18.1 |
CuDNN |
8.9.1.23 |
DCGM |
3.1.8 |
NVIDIA MLNX_OFED |
5.8-2.0.3.0 |
NVIDIA ConnectX Firmware |
|
NVIDIA System Management (NVSM) |
22.12.04 |
Docker Engine |
23.0.4 |
NVIDIA Container Runtime |
|
MIG Configuration Tool |
0.5.1 |
NGC CLI |
3.17.0 |
DLFW (Deep Learning Frameworks) |
23.03 |
Supported DGX Systems#
NVIDIA has validated and tested EL9-23.01 with the following DGX systems:
DGX-1
DGX-2
DGX Station
DGX A100
DGX Station A100