Version EL8-22.08

Attention: If your system is running a version earlier than EL8-22.05, you need to update the keys on the system. Refer to Rotating the GPG Key for more information about how to rotate the keys

The DGX Software for Red Hat Enterprise Linux 8, EL8-22.08, is available.

EL8-22.08 supports all DGX products - DGX A100, DGX-2, DGX-1, DGX Station, and DGX Station A100.

Important: Installing or updating to EL8-22.08 also updates the installed Red Hat Enterprise Linux 8 distribution to the latest version.
  • NVIDIA GPUDirect Storage (GDS) v1.1 does not support Red Hat Enterprise Linux 8.5.

    If you are using GDS 1.1, contact NVIDIA Enterprise Support before performing the upgrade.

  • If you need to use the Mellanox OpenFabrics Enterprise Distribution for Linux (MLNX_OFED), before you install or update to EL8-22.08, ensure that there is a MLNX_OFED package version available that supports the latest Red Hat Enterprise Linux 8 version.

    Refer to the DGX Software for Red Hat Enterprise Linux 8 Installation Guide for instructions.

Change Highlights

  • Updated R450, R470 GPU drivers (see Software Contents below for versions)
    • Attention: R515 GPU driver is currently not supported.
  • Updated NVSM to 22.06.02
  • Updated DCGM to 2.4.7
  • Updated MLNX OFED to 5.4-3.5.8.0
  • Updated NCCL to 2.15.1
  • Updated cuDNN to 8.4.1
  • Updated docker-ce: 20.10.18

Software Contents

The following table provides version information for software included in the DGX Software Stack for Red Hat Enterprise Linux 8.

Note: Unlike the DGX OS shipped with the NVIDIA DGX system, the DGX software stack for Red Hat does not include the Mellanox OpenFabrics Enterprise Distribution (MLNX_OFED) for Linux. When using MLNX_OFED with Red Hat, ensure you install a supported MLNX_OFED kernel version to avoid incompatibilities with the Red Hat distribution kernel. Refer to the DGX Software for Red Hat Enterprise Linux 8 Installation Guide for instructions.
Table 1. Contents of the Repositories
Component Version Additional Information
OS RHEL 8.6  
Kernel 4.18.0-372.13.1 or later  
CUDA Toolkit 11.4 Refer to the NVIDIA CUDA Toolkit Release Notes.
Note: CUDA 11.4 has been qualified with Red Hat Enterprise Linux 8.4 and older. For newer releases, please refer to Installing Required Components for installation instructions of the driver.
GPU Driver

R450: 450.203.08

R470: 470.141.10

R510: 510.85.02

Refer to the NVIDIA Data Center GPU documentation

Attention: R515 GPU driver is currently not supported.

NCCL 2.15.1  
cuDNN 8.4.1  
DCGM 2.4.7 Refer to the DCGM Release Notes.
Mellanox OFED

MLNX 5.4-3.5.8.0

Refer to MLNX 5.4-3.5.8.0
MLNX FW

ConnectX-4 12.28.2006

ConnectX-5 16.31.2006

ConnectX-6 20.31.2354

ConnectX-7 28.34.4000

 
NVSM

22.06.02

Refer to the NVIDIA System Management Documentation.
Docker Engine

docker-ce: 20.10.18

Note: If necessary, the following components require separate installation via sudo apt install:
  • docker-ce-rootless-extras 20.10.17
  • docker-scan-plugin 0.9.0
Refer to v20.10.17
NVIDIA Container Toolkit

nvidia-container-toolkit: 1.10.0-1

nvidia-docker2: 2.11.0-1

libnvidia-container1: 1.10.0-1

libnvidia-container-tools: 1.10.0-1

Refer to the NVIDIA Container Toolkit documentation.
NGC CLI 2.2.0-1 Refer to the NGC CLI Documentation
GPUDirect Storage (GDS) v1.0 Refer to GDS Documentation
MIG Configuration Tool nvidia-mig-manager 0.4.3 Refer to the following NVIDIA mig-parted github pages: https://github.com/NVIDIA/mig-parted and https://github.com/NVIDIA/mig-parted/tree/master/deployments/systemd
nvipmitool 1.0.6.0  
nvidia-peer-memory/nvidia-peer-memory DKMS 1.3.0  

Compatibility

NVIDIA has validated and tested DGX Software version EL8-22.08 on the following systems:
  • Linux Distribution and kernel:
    • Red Hat Enterprise Linux 8.6
    • Rocky Linux 8
    • Kernel 4.18.0-372.13.1
  • NVIDIA DGX systems
    • NVIDIA DGX A100 with Red Hat Enterprise Linux 8.6 and Rocky Linux 8
    • NVIDIA DGX-2 with Red Hat Enterprise Linux 8.6 and Rocky Linux 8
    • NVIDIA DGX-1 (V100) with Red Hat Enterprise Linux 8.6 and Rocky Linux 8
    • NVIDIA DGX Station with Red Hat Enterprise Linux 8.6 and Rocky Linux 8
    • NVIDIA DGX Station A100 with Red Hat Enterprise Linux 8.6 and Rocky Linux 8
  • 22.08 Deep Learning Framework containers
  • NVIDIA GPUDirect Storage v1.0 (refer to the GDS documentation for additional information)
  • MLNX OFED version 5.4-3.5.8.0
  • ConnectX Firmware: see table 1 above

Update Instructions

See the section Installing and Updating the Software for instructions.