Resolved Issues#

In previous release, the following issues were identified as known issues; they have been resolved.

DGX Station A100 Fails to Boot After Applying MIG Configurations#

Issue

When MIG configurations are applied to a DGX station A100 system running DGX EL9-25.04 with the Release 570 GPU Driver installed, the system might fail to boot when you run the sudo reboot command. Resetting the GPUs by performing a DC power cycle does not recover the system.

Workaround

NVIDIA DGX Software for Red Hat Enterprise Linux 9 releases do not support the 570 release of the GPU Driver on DGX Station A100 systems.

To resolve the boot failure, reinstall DGX EL9-25.04, but do not set up MIG configurations. Alternatively, for systems other than the DGX B200, install a previous version of the GPU driver.

Status

Resolved in in GPU driver 570.117+ and 575.20+.

Tuned profiles do not take effect in graphical mode#

This issue only occurred on DGX Station systems and has been resolved. For instance, for the DGX Station A100, the dgxstation-a100-tuned-profile package that is installed by the Install DGX Station A100 Configurations step in Installing the DGX Software has been modified to mask the tuned-ppd.service or the power-profiles-daemon.service, whichever is in use, to ensure that TuneD service will be enabled after a reboot. See RedHat Solution 6955289 for more information.

Note

For reference, the issue that was resolved is shown below. It is no longer necessary to manually run one of the systemctl mask commands, since the appropriate dgxstation-*-tuned-profile package now runs the appropriate systemctl mask command.

Issue

DGX tuned profiles might not take effect due to a known Red Hat Enterprise Linux 9 issue. This affects systems that use a graphical target mode.

Workaround

This issue can be fixed by running one the following commands, depending on which service, tuned-ppd.service or power-profiles-daemon.service, is in use:

  • Run one of these commands, depending on which service is in use:

    systemctl mask tuned-ppd
    
    systemctl mask power-profiles-daemon
    
  • Reboot the system.

    reboot
    

Status

Resolved.