Version EL8-24.01

This section provides details of each DGX Software for Red Hat Enterprise Linux release. These include mostly new NVIDIA features and accumulated bug fixes and security updates.

Attention

If your system is running a version earlier than EL8-22.05, you need to update the keys on the system. Refer to Rotating the GPG Key for more information about how to rotate the keys.

Important

Installing or updating to EL8-24.01 also updates the installed Red Hat Enterprise Linux 8 distribution to the latest version.

If you need to use the Mellanox OpenFabrics Enterprise Distribution for Linux (MLNX_OFED), before you install or update to EL8-23.08, ensure that there is a MLNX_OFED package version available that supports the latest Red Hat Enterprise Linux 8 version.

Note

Unlike the DGX OS shipped with the NVIDIA DGX system, the DGX software stack for Red Hat does not include the Mellanox OpenFabrics Enterprise Distribution (MLNX_OFED) for Linux. When using MLNX_OFED with Red Hat, ensure you install a supported MLNX_OFED kernel version to avoid incompatibilities with the Red Hat distribution kernel.

Refer to the DGX Software for Red Hat Enterprise Linux 8 Installation Guide for instructions.

Release Highlights

  • Added support for Red Hat Enterprise Linux 8.9 and Rocky 8.9.

  • Added support for NVIDIA® OpenFabrics Enterprise Distribution for Linux (MLNX_OFED) version 23.10-1.1.9.0 - a long-term support (LTS) release.

  • Added support for single-port ConnectX-7 VPI adapter card for DGX A100 System.

  • Updated component versions.

  • Continued support for DGX H100.

Qualified Software Stack

Component

Latest versions in the repositories

DGX Base OS

EL8-24.01

OS

Red Hat Enterprise Linux 8.9 and Rocky Linux 8.9

Kernel (Red Hat)

4.18.0-513.9.1.el8_9.x86_64

CUDA Toolkit and GPU Driver

CUDA Toolkit 12.2 and R535TeslaRD5 535.129.03 (Default)

CUDA Toolkit 11.4 and R470TeslaRD10 470.223.02

NCCL

2.19.3

cuDNN

8.9.7

DCGM

3.3.0-002

GPUDirect Storage (GDS)

1.7.2 or later

NVIDIA System Management (NVSM)

23.09.02

Docker-CE

24.0.7-1

NVIDIA Container Runtime

  • nvidia-docker2: 2.13.0-1

  • libnvidia-container-tools: 1.13.4-1

  • libnvidia-container1: 1.13.4-1

  • nvidia-container-toolkit (and base): 1.13.4-1

MIG Configuration Tool

nvidia-mig-manager 0.5.4 (Refer to NVIDIA mig-parted github pages and deployments.)

NGC CLI

3.33.0

DLFW

23.10

The following table provides information about the supported OS and matching firmware versions for NVIDIA® OpenFabrics Enterprise Distribution for Linux (MLNX_OFED) version 23.10-1.1.9.0.

OS
DGX-1, DGX-2
ConnectX-4 (CX-4) or
ConnectX-5 (CX-5)
DGX A100
ConnectX-6
DGX A100
ConnectX-7
DGX H100
ConnectX-7
RHEL 8

CX-5: 16.35.3006

CX-4: 12.28.2006

20.39.2048

28.39.2048

28.39.2048

Supported DGX Systems

NVIDIA has validated and tested EL8-24.01 with the following DGX systems:

  • DGX H100

  • DGX A100 640 GB

  • DGX A100 320 GB

  • DGX A800 640 GB

  • DGX-2

  • DGX-1 32 GB

  • DGX Station A100 320 GB

  • DGX Station A100 160 GB

  • DGX Station 32 GB

Update Instructions

See the section Installing and Updating the Software for instructions.