Resolved Issues#

The following issues that were previously identified as known issues have been resolved.

DGX Station A100 Fails to Boot After Applying MIG Configurations#

Issue#

After MIG configurations were successfully applied to a DGX station A100 system running DGX OS 7.0.2, the system failed to boot when you ran the sudo reboot command. Resetting the GPUs by performing a DC power cycle could not recover the system.

Workaround#

The DGX OS 7.0.2 release does not support the DGX Station A100 system with MIG enabled. To resolve the boot failure, install DGX OS 6.3.2 on the system and then apply MIG configurations.

Status#

Resolved in GPU driver versions 570.117 and later, as well as 575.20 and later.

Update the MLNX Firmware for the Connect-X and Bluefield-3 Adapters#

Issue#

The online network repository for DOCA 2.9.1/Ubuntu 24.04 does not contain the mlnx_fw_updater tool, which is needed to update the Connect-X and Bluefield-3 adapters to their latest firmware versions.

Workaround#

Install the mlnx_fw_updater tool.

$ wget https://linux.mellanox.com/public/repo/mlnx_ofed/latest-24.10/ubuntu24.04/x86_64/mlnx-fw-updater_24.10-1.1.4.0_amd64.deb
$ sudo apt install mlnx-fw-updater_24.10-1.1.4.0_amd64.deb
$ sudo /opt/mellanox/mlnx-fw-updater/mlnx_fw_updater.pl

Status#

Resolved in version 7.1.0.

Errors Occur When Loading Mirrored Repositories on Air-Gapped Systems#

Issue#

When you run the apt update command to load mirrored repositories on an air-gapped system, the following error messages appear:

File not found - /media/repository/mirror/security.ubuntu.com/ubuntu/dists/jammy-security/main/cnf/Commands-amd64 (2: No such file or directory)
Failed to fetch file:/media/repository/mirror/security.ubuntu.com/ubuntu/dists/jammy-security/main/cnf/Commands-amd64  File not found - /media/repository/mirror/security.ubuntu.com/ubuntu/dists/jammy-security/main/cnf/Commands-amd64 (2: No such file or directory)

Explanation#

This issue occurs because a fix for the apt-mirror package, which is available in Ubuntu 23.10, has yet to be implemented in the Ubuntu 22.04 repositories. If you are using an apt-mirror package

  • Version later than 0.5.4-1: Contact NVIDIA Enterprise Services by filing a support case.

  • Version 0.5.4-1: Use the following workaround to mirror the repositories.

You can run the following command to determine the version of your apt-mirror package:

$ dpkg -l | grep apt-mirror

ii  apt-mirror                  0.5.4-1               all             APT sources mirroring tool

Status#

Resolved in version 7.1.0.