Version EL8-24.01#
This section provides details of each DGX Software for Red Hat Enterprise Linux release. These include mostly new NVIDIA features and accumulated bug fixes and security updates.
Attention
If your system is running a version earlier than EL8-22.05, you need to update the keys on the system. Refer to Rotating the GPG Key for more information about how to rotate the keys.
Important
Installing or updating to EL8-24.01 also updates the installed Red Hat Enterprise Linux 8 distribution to the latest version.
If you need to use the Mellanox OpenFabrics Enterprise Distribution for Linux (MLNX_OFED
), before you install or update to EL8-23.08, ensure that there is a MLNX_OFED
package version available that supports the latest Red Hat Enterprise Linux 8 version.
Note
Unlike the DGX OS shipped with the NVIDIA DGX system, the DGX software stack for Red Hat does not include the
Mellanox OpenFabrics Enterprise Distribution (MLNX_OFED
) for Linux.
When using MLNX_OFED
with Red Hat, ensure you install a supported MLNX_OFED kernel version to avoid
incompatibilities with the Red Hat distribution kernel.
Refer to the DGX Software for Red Hat Enterprise Linux 8 Installation Guide for instructions.
Release Highlights
Added support for Red Hat Enterprise Linux 8.9 and Rocky 8.9.
Added support for NVIDIA® OpenFabrics Enterprise Distribution for Linux (MLNX_OFED) version 23.10-1.1.9.0 - a long-term support (LTS) release.
Added support for single-port ConnectX-7 VPI adapter card for DGX A100 System.
Updated component versions.
Continued support for DGX H100.
Qualified Software Stack
Component |
Latest versions in the repositories |
---|---|
DGX Base OS |
EL8-24.01 |
OS |
Red Hat Enterprise Linux 8.9 and Rocky Linux 8.9 |
Kernel (Red Hat) |
4.18.0-513.9.1.el8_9.x86_64 |
CUDA Toolkit and GPU Driver |
CUDA Toolkit 12.2 and 535.129.03 (Default) CUDA Toolkit 11.4 and 470.223.02 |
NCCL |
|
cuDNN |
|
DCGM |
|
GPUDirect Storage (GDS) |
1.7.2 or later |
NVIDIA System Management (NVSM) |
|
Docker-CE |
|
NVIDIA Container Runtime |
|
MIG Configuration Tool |
nvidia-mig-manager 0.5.4 (Refer to NVIDIA mig-parted github pages and deployments.) |
NGC CLI |
3.33.0 |
DLFW |
23.10 |
The following table provides information about the supported OS and matching firmware versions for NVIDIA® OpenFabrics Enterprise Distribution for Linux (MLNX_OFED) version 23.10-1.1.9.0.
OS
|
DGX-1, DGX-2
ConnectX-4 (CX-4) or
ConnectX-5 (CX-5)
|
DGX A100
ConnectX-6
|
DGX A100
ConnectX-7
|
DGX H100
ConnectX-7
|
---|---|---|---|---|
RHEL 8
|
CX-5: 16.35.3006 CX-4: 12.28.2006 |
20.39.2048 |
28.39.2048 |
28.39.2048 |
Supported DGX Systems
NVIDIA has validated and tested EL8-24.01 with the following DGX systems:
DGX H100
DGX A100 640 GB
DGX A100 320 GB
DGX A800 640 GB
DGX-2
DGX-1 32 GB
DGX Station A100 320 GB
DGX Station A100 160 GB
DGX Station 32 GB
Update Instructions
See the section Installing and Updating the Software for instructions.