DGX OS 5 Releases

The following are the key features of DGX OS Release 5:

  • Supports all NVIDIA servers, DGX Station, and DGX Station A100 in one ISO image.
  • Based on Ubuntu 20.04
  • Includes drive encryption for added security.

UPDATE ADVISEMENT

  • NVIDIA KVM not Supported

    This release does not support the Linux Kernel-based Virtual Mode (KVM) on DGX systems.

    Note: NVIDIA KVM is available only with DGX-2 systems. DGX-2 customers that require this feature should stay with the latest DGX OS Server 4.x release.
  • Update DGX OS on DGX A100 before updating VBIOS

    DGX A100 systems running DGX OS earlier than version 4.99.8 should be updated to the latest version before updating the VBIOS to version 92.00.18.00.0 or later. Failure to do so will result in the GPUs not getting recognized.

  • NGC Containers

    With DGX OS 5, customers should update their NGC containers to container release 20.10.17 or later if they are using multi-node training. For all other use cases, refer to the NCG Framework Containers Support Matrix.

    Refer to the NVIDIA Deep Learning Frameworks documentation for information about the latest container releases and how to access the releases.

  • Ubuntu Security Updates

    Customers are responsible for keeping the DGX server up to date with the latest Ubuntu security updates using the ‘apt full-upgrade’ procedure. See the Ubuntu Wiki Upgrades web page for more information. Also, the Ubuntu Security Notice site (Ubuntu Security Notices) lists known Common Vulnerabilities and Exposures (CVEs), including those that can be resolved by updating the DGX OS software.

CURRENT VERSIONS

Here is a current list of the main DGX software stack component versions in the software repositories:

In addition to upgrading to the versions described in this section, performing an over-the-network update will also upgrade the Ubuntu 20.04 LTS version and Ubuntu kernel, depending on when the upgrade is performed.

For a list of updates in DGX OS 5, see Update History.

New Features in DGX OS Release 5.4

Here are the new features in DGX OS 5.4 (see also the Update History for important changes made since the initial release):
  • GPUDirect Storage 1.0 was added
Upgraded Software packages:
  • R450, R470 GPU drivers (see Software Contents below for versions)
  • NCCL 2.15.1
  • NVSM to 22.06.02
  • DCGM to 2.4.7
  • MLNX OFED to 5.4-3.5.8.0
  • docker-ce: 20.10.18
  • MIG Configuration Tool: 0.4.3
The newest version of nvidia-mig-parted now contains a set of checkpoint/restore commands. These allow one to checkpoint (and later restore) the MIG configuration applied across all GPUs on a node, regardless of what tool was used to set up those MIG configurations.

In previous versions of `nvidia-mig-parted`, all MIG configurations had to be done via `nvidia-mig-parted` itself in order for it to recognize and subsequently reconfigure the MIG state on set of GPUs. With this new checkpoint/restore feature, tools such as `nvidia-smi` can be used to configure MIG as well.

To use this feature, one would run (for example):
$ sudo nvidia-smi mig -C -cgi 1g.5gb,1g.5gb,1g.5gb,1g.5gb,1g.5gb,1g.5gb,1g.5gb
$ sudo -E nvidia-mig-parted checkpoint
This will save a checkpoint of the current MIG state to the default location of `/var/lib/nvidia-mig-manager/checkpoint.json`. Later (after a reboot, for example) users can run `restore` to ensure that the checkpointed MIG configuration is properly restored:
$ sudo -E nvidia-mig-parted restore

New Features in DGX OS Release 5.3

Important: The features and component versions in DGX OS 5.3 are identical to the versions in DGX OS 5.2. In DGX OS 5.3, the GPG keys that are used to sign the packages and metadata in those repositories need to be rotated. Refer to Rotating the GPG Keys for more information.

See also the Update History for important changes made since the initial release.

Rotating the GPG Keys

NVIDIA constantly evaluates and improves security implementations. As part of these improvements, we are rolling out changes to harden the security and reliability of our repositories. These changes require rotating the GPG keys that are used to sign the metadata and packages in those repositories.

Rotating the GPG Key For a Default Installation or After Reimaging

This section provides information about how to rotate the GPG keys for a default DGX OS installation from the factory or after you reimage with the DGX OS ISO.

  1. Download the new repository setup packages.
    wget https://repo.download.nvidia.com/baseos/ubuntu/focal/x86_64/pool/common/n/nvidia-repo-keys/nvidia-repo-keys_22.04-1_all.deb
    wget https://repo.download.nvidia.com/baseos/ubuntu/focal/x86_64/pool/dgx/n/nvidia-repos/dgx-repo_21.07-1_amd64.deb
    wget https://repo.download.nvidia.com/baseos/ubuntu/focal/x86_64/pool/common/n/nvidia-repos/cuda-compute-repo_21.07-1_amd64.deb
  2. Directly install the .deb packages, which skips the GPG check performed in apt.
    Note: If prompted, ensure that you accept the maintainer’s version for all files.
    $ sudo dpkg --force-confnew -i ./nvidia-repo-keys_22.04-1_all.deb ./dgx-repo_21.07-1_amd64.deb ./cuda-compute-repo_21.07-1_amd64.deb
  3. Manually revoke the previous DGX and CUDA GPG keys.
    sudo apt-key del 629C85F2
    sudo apt-key del 7FA2AF80
OTA updates can now occur as normal.

Rotating the GPG Keys for the DGX Software Stack

This section provides information about how to rotate the GPG keys if you installed Ubuntu and the DGX Software Stack.

  1. Download the updated dgx-repo-files tarball and extract its contents onto the root filesystem.
    curl https://repo.download.nvidia.com/baseos/ubuntu/focal/dgx-repo-files.tgz | sudo tar xzf - -C /
  2. Manually revoke the previous DGX and CUDA GPG keys.
    $ sudo apt-key del 629C85F2
    $ sudo apt-key del 7FA2AF80
OTA updates can now occur as normal.

New Features in DGX OS Release 5.2

Here are the new features in DGX OS 5.2 (see also the Update History for important changes made since the initial release):

  • Updated NVSM to 21.09.14
  • Updated DCGM to 2.3.2
  • Added DGX Software Stack installation method

    The DGX Software Stack provides the option to install a vanilla version of Ubuntu 20.04 and then separately install the additional NVIDIA software (NVIDIA DGX Software Stack). This option is available for DGX servers (DGX A100, DGX-2, DGX-1). The DGX Software Stack is a stream-lined version of the software stack incorporated into the DGX OS ISO image, and includes meta-packages to simplify the installation process. Refer to the DGX Software Stack for Ubuntu Installation Guide for instructions.

UPDATE ADVISEMENT

  • IMPORTANT: This release incorporates the following updates.
    • NVIDIA MLNX_OFED 5.4

    Customers are advised to consider these updates and any effect they may have on their application. For example, some MOFED-dependent applications may be affected.

    A best practice is to upgrade on select systems and verify that your applications work as expected before deploying on more systems.

New Features in DGX OS Release 5.1

Here are the new features in DGX OS 5.1 (see also the Update History for important changes made since the initial release):

  • Added NVIDIA GPU driver Release 470.
    Note: When upgrading DGX OS, the system remains on the installed GPU driver branch. For example, the GPU driver branch on the system does not automatically switch from R450 to R470. Refer to the Changing Your GPU Branch section of the DGX OS User Guide for instructions on switching GPU driver branches.
  • Supports the CUDA Toolkit up to 11.4 natively, or newer versions via the compatibility module.
  • Updated the Docker Engine to 20.10.
  • Incorporates NVIDIA MLNX_OFED 5.4.
  • Updated NVSM
    • Added ability to generate a test alert/email.
    • NVSM dump/show health includes firmware version information (incorporates 'nvsm show -level all' in the command).
    • NVSM binds port 273 to 127.0.0.1 to limit external communications.

      To open other ports for IPV4 or IPV6, edit nvsm.config (bindaddress) and then restart NVSM

  • Added NVML libraries
  • Includes MOFED 5.4
  • Added NGC CLI
  • Added MIG Configuration Tool to define MIG partitions and provide a systemd service to make MIG partitions persist across reboots.
    • MIG is disabled by default
    • The MIG configuration file overrides any MIG-related nvidia-smi commands. Use nvidia-mig-parted instead of nvidia-smi for MIG configuration.
  • arp_ignore=1 and arp_announce=2 are now set on all InfiniBand configured interfaces.
  • Added LLDPd for validating network cabling

    The default configuration is now set to use the PortID of the interface name rather than the MAC address.

  • Supports GPUDirect Storage 1.0 (Refer to GDS Documentation for installation instructions)

UPDATE ADVISEMENT

  • IMPORTANT: This release incorporates the following updates.
    • NVIDIA MLNX_OFED 5.4

    Customers are advised to consider these updates and any effect they may have on their application. For example, some MOFED-dependent applications may be affected.

    A best practice is to upgrade on select systems and verify that your applications work as expected before deploying on more systems.

New Features in DGX OS Release 5.0

Here are the new features in DGX OS 5.0 (see also the Update History for important changes made since the initial release):

  • NVIDIA GPU driver Release 450.
  • Supports the CUDA Toolkit up to 11.0 natively, or newer versions via the compatibility module.
  • Incorporates NVIDIA MLNX_OFED 5.1.
  • Added rootfs encryption option, configurable during the re-imaging process.
  • Added option to password protect the GRUB menu, configurable during the first boot process.
  • Updated NVSM
  • Added support for custom drive partitioning
  • Added monitoring of firmware health
  • Updated the default InfiniBand network naming policy.

    The InfinBand interfaces, enumerated as ibx in previous releases, now enumerate as ibpxsy (similar to Ethernet (enpxsy). Refer to the DGX A100 User Guide for the new naming.

UPDATE ADVISEMENT

  • IMPORTANT: This release incorporates the following updates.
    • NVIDIA MLNX_OFED 5.1

    Customers are advised to consider these updates and any effect they may have on their application. For example, some MOFED-dependent applications may be affected.

    A best practice is to upgrade on select systems and verify that your applications work as expected before deploying on more systems.

Update History

This section provides information about the updates to DGX OS 5.

The updates listed include:

  • Major component updates in the Ubuntu repositories.
  • NVIDIA driver updates in the Ubuntu repository

Refer to Installing the DGX OS (Reimaging the System) for instructions on how to install DGX OS from the ISO image,

Refer to Performing Package Updates for instructions on how to update DGX OS with all the latest DGX OS 5 updates from the network repositories.

Update: November 22, 2022

  • GPUDirect Storage 1.0 was added.
  • The following changes were made to the Ubuntu repositories:
    • R515 NVIDIA GPU Driver: 515.86.01
    • R470 NVIDIA GPU Driver: 470.161.03
    • R450 NVIDIA GPU Driver: 450.216.04
    • Note: When upgrading DGX OS, the system remains on the installed GPU driver branch. For example, the GPU driver branch on the system does not automatically switch from R450 to R470. Refer to the Changing Your GPU Branch section of the DGX OS User Guide for instructions on switching GPU driver branches.
    Here are the contents of the DGX OS 5.4.1 ISO:
    Component Version Additional Information
    CUDA Toolkit 11.4 Refer to the NVIDIA CUDA Toolkit Release Notes.
    Note: For DGX servers, CUDA is updated only if it has been previously installed.
    GPU Driver

    R450: 450.216.04

    R470: 470.161.03

    R515: 515.86.01

    Refer to the NVIDIA Data Center GPU documentation
    NCCL 2.15.1  
    cuDNN 8.4.1  
    DCGM 2.4.7 Refer to the DCGM Release Notes.
    Mellanox OFED

    MLNX 5.4-3.5.8.0

    Refer to MLNX 5.4-3.5.8.0
    MLNX FW

    ConnectX-4 12.28.2006

    ConnectX-5 16.31.2006

    ConnectX-6 20.31.2354

    ConnectX-7 28.34.4000

     
    NVSM

    22.09.03

    Refer to the NVIDIA System Management Documentation.
    Docker Engine

    docker-ce: 20.10.21

    Note: If necessary, the following components require separate installation via sudo apt install:
    • docker-ce-rootless-extras 20.10.18
    • docker-scan-plugin 0.9.0
    Refer to v20.10.21
    NVIDIA Container Toolkit

    nvidia-container-toolkit: 1.10.0-1

    nvidia-docker2: 2.11.0-1

    libnvidia-container1: 1.10.0-1

    libnvidia-container-tools: 1.10.0-1

    Refer to the NVIDIA Container Toolkit documentation.
    NGC CLI 2.2.0-1 Refer to the NGC CLI Documentation
    GPUDirect Storage (GDS) v1.0 Refer to GDS Documentation
    MIG Configuration Tool nvidia-mig-manager 0.4.3 Refer to the following NVIDIA mig-parted github pages: https://github.com/NVIDIA/mig-parted and https://github.com/NVIDIA/mig-parted/tree/master/deployments/systemd
    nvipmitool 1.0.6.0  
    nvidia-peer-memory/nvidia-peer-memory DKMS 1.3.0  

Update: October 14, 2022

  • GPUDirect Storage 1.0 was added.
  • The following changes were made to the Ubuntu repositories:
    • R470 NVIDIA GPU Driver: 470.129.06
    • R450 NVIDIA GPU Driver: 450.203.03
    • Note: When upgrading DGX OS, the system remains on the installed GPU driver branch. For example, the GPU driver branch on the system does not automatically switch from R450 to R470. Refer to the Changing Your GPU Branch section of the DGX OS User Guide for instructions on switching GPU driver branches.
  • The following changes were made to the Ubuntu repositories:
    • NCCL 2.15.1
    • DCGM 2.4.7
    • MOFED 5.4-3.5.8.0
    • NVSM 22.06.02
    • Docker-ce 20.10.18
    • MIG Configuration Tool: 0.4.3
  • Here are the contents of the DGX OS 5.4.1 ISO:
    Component Version Additional Information
    CUDA Toolkit 11.4 Refer to the NVIDIA CUDA Toolkit Release Notes.
    Note: For DGX servers, CUDA is updated only if it has been previously installed.
    GPU Driver

    R470: 470.129.06

    R450: 450.203.03
    Refer to the NVIDIA Data Center GPU documentation
    DCGM 2.4.7 Refer to the DCGM Release Notes.
    Mellanox OFED

    MLNX 5.4-3.5.8.0

    Refer to MLNX 5.4-3.5.8.0
    MLNX FW

    ConnectX-4 12.28.2006

    ConnectX-5 16.31.2006

    ConnectX-6 20.31.2354

    ConnectX-7 28.34.4000

     
    NVSM

    22.06.02

    Refer to the NVIDIA System Management Documentation.
    Docker Engine

    docker-ce: 20.10.18

    Note: If necessary, the following components require separate installation via sudo apt install:
    • docker-ce-rootless-extras 20.10.18
    • docker-scan-plugin 0.9.0
    Refer to v20.10.18
    NVIDIA Container Toolkit

    nvidia-container-toolkit: 1.10.0-1

    nvidia-docker2: 2.11.0-1

    libnvidia-container1: 1.10.0-1

    libnvidia-container-tools: 1.10.0-1

    Refer to the NVIDIA Container Toolkit documentation.
    NGC CLI 2.2.0-1 Refer to the NGC CLI Documentation
    GPUDirect Storage (GDS) v1.0 Refer to GDS Documentation
    MIG Configuration Tool nvidia-mig-manager 0.4.3 Refer to the following NVIDIA mig-parted github pages: https://github.com/NVIDIA/mig-parted and https://github.com/NVIDIA/mig-parted/tree/master/deployments/systemd
    nvipmitool 1.0.6.0  
    nvidia-peer-memory/nvidia-peer-memory DKMS 1.3.0  

Update: June 7, 2022

  • Installer version updated to 5.3.1.
  • The following changes were made to the Ubuntu repositories:
    • R470 NVIDIA GPU Driver: 470.129.06
    • R450 NVIDIA GPU Driver: 450.191.01
    • Note: When upgrading DGX OS, the system remains on the installed GPU driver branch. For example, the GPU driver branch on the system does not automatically switch from R450 to R470. Refer to the Changing Your GPU Branch section of the DGX OS User Guide for instructions on switching GPU driver branches.
  • The following changes were made to the Ubuntu repositories:
    • DCGM: 2.3.6
    • NVSM: 22.03.05
    • Docker CE: 20.10.16
    • nvidia-peer-memory/nvidia-peer-memory DKMS: 1.3.0
  • The DGX OS 5.3.1 ISO has been released.
    Here are the contents of the DGX OS 5.3.1 ISO:
    Component Release with R450 Release with R470 Additional Information
    Ubuntu 20.04 LTS20.04 LTS Refer to the Ubuntu 20.04 Desktop Guide.
    Ubuntu kernel 5.4.0-113-generic See Linux 5.4.0-113-generic.
    GPU Driver

    450.191.01

    470.129.06

    Note: Updating from R450 to R470 does not happen automatically when updating DGX OS 5, but requires separate steps. Refer to the Changing Your GPU Branch section of the DGX OS User Guide for instructions.
    Refer to the NVIDIA Data Center GPU documentation.
    CUDA Toolkit 11.4 Refer to the NVIDIA CUDA Toolkit Release Notes.

    Note:CUDA is installed from the ISO only on DGX Station systems, including DGX Station A100.

    Docker Engine 20.10.16 Refer to v20.10.11-3.
    NVIDIA Container Toolkit

    nvidia-container-runtime: 2.8.0-1

    nvidia-container-toolkit: 1.7.0-1

    nvidia-docker2: 2 8.0-1

    libnvidia-container1: 1.7.0-1

    libnvidia-container-tools: 1.7.0-1

    Refer to the NVIDIA Container Toolkit documentation.
    NVSM

    22.03.05

    Refer to the NVIDIA System Management Documentation.
    DCGM 2.3.6 Refer to the DCGM Release Notes.
    NGC CLI 2.2.0-1 Refer to the NGC CLI Documentation
    Mellanox OFED

    MLNX 5.4-3.1.0.0

    Refer to MLNX_OFED v5.4-1.0.3.0
    MIG Configuration Tool 0.1.2-1 Refer to the following NVIDIA mig-parted github pages: https://github.com/NVIDIA/mig-parted and https://github.com/NVIDIA/mig-parted/tree/master/deployments/systemd
    nvipmitool 1.0.6.0  
    nvidia-peer-memory/nvidia-peer-memory DKMS 1.3.0  

Update: May 17, 2022

  • The following changes were made to the Ubuntu repositories:
    • NVIDIA GPU R470 Driver: 470.129.06
    • NVIDIA GPU R450 Driver: 450.191.01

Update: April 28, 2022

Important: In DGX OS 5.3, the GPG keys that are used to sign the packages and metadata in those repositories need to be rotated. Refer to Rotating the GPG Keys for more information.

Update: February 17, 2022

  • Installer version updated to 5.2.0.
  • Added DGX Software Stack installation method

    The DGX Software Stack provides the option to install a vanilla version of Ubuntu 20.04 and then separately install the additional NVIDIA software (NVIDIA DGX Software Stack). This option is available for DGX servers (DGX A100, DGX-2, DGX-1). The DGX Software Stack is a stream-lined version of the software stack incorporated into the DGX OS ISO image, and includes meta-packages to simplify the installation process. Refer to the DGX Software Stack for Ubuntu Installation Guide for instructions.

  • The following changes were made to the Ubuntu repositories:
    • R470 NVIDIA GPU Driver: 470.103.01
    • R450 NVIDIA GPU Driver: 470.172.01
    • Note: When upgrading DGX OS, the system remains on the installed GPU driver branch. For example, the GPU driver branch on the system does not automatically switch from R450 to R470. Refer to the Changing Your GPU Branch section of the DGX OS User Guide for instructions on switching GPU driver branches.
  • The following changes were made to the Ubuntu repositories:
    • DCGM: 2.3.2
    • NVSM: 21.09.14
    • Docker CE: 20.10.11
    • nvidia-peer-memory/nvidia-peer-memory DKMS: 1.3.0
  • The DGX OS 5.2.0 ISO has been released.
    Here are the contents of the DGX OS 5.2.0 ISO
    Component Version Additional Information
    Ubuntu 20.04 LTS Refer to the Ubuntu 20.04 Desktop Guide.
    Ubuntu kernel 5.4.0-xx-generic See Linux 5.4.0-80.90.
    GPU Driver

    R450: 450.172.01

    R470: 470.103.01

    Note: Updating from R450 to R470 does not happen automatically when updating DGX OS 5, but requires separate steps. Refer to the Changing Your GPU Branch section of the DGX OS User Guide for instructions.
    Refer to the NVIDIA Data Center GPU documentation.
    CUDA Toolkit 11.4 Refer to the NVIDIA CUDA Toolkit Release Notes.

    Note:CUDA is installed from the ISO only on DGX Station systems, including DGX Station A100.

    Docker Engine 20.10.11 Refer to v20.10.11.
    NVIDIA Container Toolkit

    nvidia-container-runtime: 3.5.0-1

    nvidia-container-toolkit: 1.7.0-1

    nvidia-docker2: 2 8.0-1

    libnvidia-container1: 1.7.0-1

    libnvidia-container-tools: 1.7.0-1

    Refer to the NVIDIA Container Toolkit documentation.
    NVSM

    21.09.14

    Refer to the NVIDIA System Management Documentation.
    DCGM 2.3.2 Refer to the DCGM Release Notes.
    NGC CLI 2.2.0 Refer to the NGC CLI Documentation
    Mellanox OFED

    MLNX 5.4-1.0.3.0

    Refer to MLNX_OFED v5.4-1.0.3.0
    MIG Configuration Tool 0.1.2-1 Refer to the following NVIDIA mig-parted github pages: https://github.com/NVIDIA/mig-parted and https://github.com/NVIDIA/mig-parted/tree/master/deployments/systemd
    nvipmitool 1.0.60  
    nvidia-peer-memory/nvidia-peer-memory DKMS 1.3.0  

Update: December 14, 2021

  • Installer version updated to 5.1.1.
  • The following changes were made to the Ubuntu repositories:
    • R470 NVIDIA GPU Driver: 470.82.01
  • The following changes were made to the Ubuntu repositories:
    • DCGM: 2.3.1
    • NVSM: 21.09.10
    • MOFED: MLNX 5.4-3.1.0.0
    • Docker CE: 20.10.11
    • nvidia-container stack:
      • nvidia-docker2-2.8.0-1

        nvidia-container-runtime-3.7.0-1

        nvidia-container-toolkit-1.7.0-1

        libnvidia-container-tools-1.7.0-1

        libnvidia-container1-1.7.0-1

    • nvipmitool: 1.0.6.0
    • nvidia-peer-memory/nvidia-peer-memory DKMS: 1.2.0

Update: October 26 , 2021

  • The following changes were made to the Ubuntu repositories:
    • NVIDIA GPU Driver: 450.156.00

DGX OS 5.1 Release: August 26, 2021

  • The following updates were made to the Ubuntu repositories
    • Docker Engine: 20.10.7
    • NVSM: 21.07.15
    • DCGM: 2.2.9
    • nvidia-container-runtime: 3.5.0-1
    • NVIDIA MLNX_OFED: 5.4-1.0.3.0
    • (New) NGC CLI: 2.2.0
    • (New) MIG Configuration Tool: 0.1.2-1
  • The following changes were made to the Ubuntu repositories
    • Added the release 470 GPU Driver: 470.57.02
      Note: When upgrading DGX OS, the system remains on the installed GPU driver branch. For example, the GPU driver branch on the system does not automatically switch from R450 to R470. Refer to the Changing Your GPU Branch section of the DGX OS User Guide for instructions on switching GPU driver branches.
  • The DGX OS 5.1.0 ISO has been released.
    Here are the contents of the DGX OS 5.1.0 ISO
    Component Version Additional Information
    Ubuntu 20.04 LTS Refer to the Ubuntu 20.04 Desktop Guide.
    Ubuntu kernel 5.4.0-81 See Linux 5.4.0-80.90.
    GPU Driver

    R450: 450.142.00

    R470: 470.57.02

    Note: Updating from R450 to R470 does not happen automatically when updating DGX OS 5, but requires separate steps. Refer to the Changing Your GPU Branch section of the DGX OS User Guide for instructions.
    Refer to the NVIDIA Data Center GPU documentation.
    CUDA Toolkit 11.4 Refer to the NVIDIA CUDA Toolkit Release Notes.

    Note:CUDA is installed from the ISO only on DGX Station systems, including DGX Station A100.

    Docker Engine 20.10.7 Refer to v20.10.7.
    NVIDIA Container Toolkit

    nvidia-container-runtime: 3.5.0-1

    nvidia-container-toolkit: 1.5.1-1

    nvidia-docker2: 2 6.0-1

    libnvidia-container1: 1.4.0-1

    libnvidia-container-tools: 1.4.0-1

    Refer to the NVIDIA Container Toolkit documentation.
    NVSM

    21.07.15

    Refer to the NVIDIA System Management Documentation.
    DCGM 2.2.9 Refer to the DCGM Release Notes.
    NGC CLI 2.2.0 Refer to the NGC CLI Documentation
    Mellanox OFED

    MLNX 5.4-1.0.3.0

    Refer to MLNX_OFED v5.4-1.0.3.0
    MIG Configuration Tool 0.1.2-1 Refer to the following NVIDIA mig-parted github pages: https://github.com/NVIDIA/mig-parted and https://github.com/NVIDIA/mig-parted/tree/master/deployments/systemd

Update: June 30 , 2021

Update: June 20 , 2021

  • The following changes were made to the Ubuntu repositories:
    • NVIDIA GPU Driver: 450.142.00

Update: June 2, 2021

  • The following changes were made to the Ubuntu repositories:
    • NVIDIA GPU Driver: 450.119.04

      These are signed drivers and replace the unsigned drivers provided in the Ubuntu repositories.

Update: May 27, 2021

  • The following changes were made to the Ubuntu repositories:
    • NVSM: 20.09.26
    • MOFED: MLNX 5.1-2.6.2.0

      Incorporates mlnx-fw-updater 5.2-1.0.4.0. When the update is made, the Mellanox FW updater updates the ConnectX card firmware as follows:

      Card Firmware Version
      ConnectX-4 12.28.2006

      To force a downgrade, see Downgrading Firmware for Mellanox ConnectX-4 Cards for more information.

      ConnectX-5 16.29.1016
      ConnectX-6 20.29.1016

Update: May 06, 2021

The following change was made in the DGX repositories:
  • NVIDIA GPU Driver: 450.119.04

    Unsigned precompiled 450.119.04 kernel modules have been added to the DGX repository which provides a fix for issue Driver Version Mismatch Reported. They will be removed once signed precompiled 450.119.04 kernel modules are provided by Canonical.

    Important: Do not update if your system has Secure Boot enabled. Since these are unsigned drivers, systems with Secure Boot enabled will fail to load the drivers.

Update: April 20, 2021

The following change was made in the Ubuntu repositories:

Update: April 13, 2021

The following changes were made to the Ubuntu repositories:

Update: March 30, 2021

The following changes were made to the Ubuntu repositories:
  • MOFED: MLNX 5.1-2.5.8.0.47

    If you have already updated to the latest Ubuntu kernel (uname -a reports 5.4.0-67 or later), then you need to uninstall MOFED and then reinstall it as follows.
    $ apt-get purge mlnx-ofed-all mlnx-ofed-kernel-dkms --auto-remove
    $ apt-get update
    $ apt-get install mlnx-ofed-all nvidia-peer-memory-dkms

Update: March 2, 2021

Update: February 23, 2021

The following change was made to Ubuntu repositories:
  • NVSM: 20.09.17

Update: January 20, 2021

The following change was made in the Ubuntu repositories:
  • NVIDIA GPU Driver: 450.102.04

Update: December 11, 2020

The following changes were made in the Ubuntu repositories:

  • MOFED: MLNX 5.1-2.5.8.0

    When the update is made, the Mellanox FW updater updates the ConnectX card firmware as follows:

    Card Firmware Version
    ConnectX-4 12.28.2006

    To force a downgrade, see Downgrading Firmware for Mellanox ConnectX-4 Cards for more information.

    ConnectX-5 16.28.4000
    ConnectX-6 20.28.4000
  • Docker: docker-ce 19.03.14

    This addresses CVE-2020-15257.

DGX OS 5.0 Release: October 31, 2020

DGX OS 5.0 was released with the DGX OS 5.0.0 ISO. Here are the contents of the DGX OS 5.0.0 ISO:
Component Version Additional Information
Ubuntu 20.04 LTS Refer to the Ubuntu 20.04 Desktop Guide.
Ubuntu kernel 5.4.0-52-generic See linux 5.4.0-52-generic.
GPU Driver 450.80.02 Refer to the NVIDIA Tesla documentation.
CUDA Toolkit 11.0 Refer to the NVIDIA CUDA Toolkit Release Notes.

Note: CUDA is installed from the ISO only on DGX Station systems, including DGX Station A100.

Docker Engine 19.03.13 Refer to v10.03.14.
NVIDIA Container Toolkit

libnvidia-container1: 1.3.0-1

libnvidia-container-tools: 1.3.0-1

nvidia-container-runtime: 3.4.0-1

nvidia-container-toolkit: 1.3.0-1

nvidia-docker: 2 2.5.0-1

Refer to the NVIDIA Container Toolkit documentation.
NVSM 20.07.40 Refer to the NVIDIA System Management Documentation.
DCGM 2.0.13 Refer to the DCGM Release Notes.
NVIDIA System Tools 20.09-1  
Mellanox OFED MLNX 5.1-2.4.6.0