DGX Firmware Update Container Version 23.3.1

The DGX Firmware Update container version 23.3.1 is available.

  • Package name: nvfw-dgxstationa100_23.3.1_230306.tar.gz

  • Run file name: nvfw-dgxstationa100_23.3.1_230306.run

  • Image name: nvfw-dgxstationa100:23.3.1

  • ISO image: DGXSTATIONA100_FWUI-23.3.1-2023-03-08-00-10-54.iso

  • PXE netboot: pxeboot-DGXSTATIONA100_FWUI-23.3.1.tgz

Highlights and Changes in this Release

This release is supported with the following DGX OS software:

  • DGX OS 5.4 Update 3 or later

  • EL7-22.08 update3 or later

  • EL8-22.08 update3 or later

  • BMC:

Contents of the DGX Station A100 System Firmware Update Container

This container includes the firmware binaries and update utilities for the firmware listed in the following table.

Component

Version

Key Changes

BMC

2.01.00

See DGX Station A100 BMC Changes

SBIOS

10.16

No change

Retimer

1.0.125

No change

A100 VBIOS

  • 80G: 92.00.38.00.01

  • 40G: 92.00.48.00.01

No change

A800 VBIOS

  • 80GB: 92.00.AC.00.0D

New support

M.2 Micron 7300 MTFDHBG1T9TDF SSD

95420260

No change

U.2 KIOXIA CM6 SSD

0105

No change

FPGA

2.71

No change

Storage Backplane

0.3

No change

NVFlash

5.799.0

No change

Updating the Firmware to Version 23.3.1

This section explains how to update the firmware on the system by using the firmware update container. It includes instructions to complete a transitional update for systems that require the update.

stop all unnecessary system activities.

Caution

While an update is in progress, do not add additional loads on the system, such as Kubernetes jobs or other user jobs or diagnostics. A high GPU workload can disrupt the firmware update process and result in an unusable component.

The commands use the .run file, but you can also use any method described in Using the DGX Station A100 FW Update Utility.

  1. Determine whether updates are needed by checking the installed versions.

    $ sudo ./nvfw-dgxstationa100_23.3.1_230306.run show_version
    
    • If there is a no in any up-to-date column for updatable firmware, proceed to the next step.

    • If all up-to-date column entries display a yes, no updates are required and no additional action is necessary.

  2. Stop the gdm3 service.

    $ sudo systemctl stop gdm3
    
  3. Complete the update for all firmware that is supported by the container.

    $ sudo ./nvfw-dgxstationa100_23.3.1_230306.run update_fw all
    

    Depending on the firmware that is updated, you might be prompted to reboot the system or power cycle the system:

    • If you are prompted to reboot, issue the following command:

      $ sudo reboot
      
    • If you are prompted to power cycle, issue the following commands:

      $ sudo ipmitool chassis power cycle
      

You can verify the update by issuing the following command:

$ sudo ./nvfw-dgxstationa100_23.3.1_230306.run show_version

Here is an example output for a DGX Station A100 40GB system:

BMC DGX Station A100
======================
Image Id              Status         Location      Onboard Version   Manifest  up-to-date
N/A                   Online         Local         01.24.00          01.24.00     yes

 FPGA
========
Onboard version     Manifest  up-to-date
2.71                  2.71       yes

 Storage Backplane
==================
Bus               Onboard Version   Manifest         up-to-date
N/A                     0.3             0.3              yes

 Retimer Loc.
=============
PCIe Slot#      Onboard Version   Manifest         up-to-date
Retimer@slot4       1.0.125       1.0.125             yes
Retimer@slot5       1.0.125       1.0.125             yes
Retimer@slot6       1.0.125       1.0.125             yes
Retimer@slot7       1.0.125       1.0.125             yes

 SBIOS
=======
Image Id                           Onboard Version   Manifest        up-to-date
N/A                                L10.16            L10.16             yes

 Video BIOS
============
Bus            Model                Onboard Version   Manifest         up-to-date
0000:01:00.0   A100-SXM4-40GB       92.00.48.00.01    92.00.48.00.01      yes
0000:47:00.0   A100-SXM4-40GB       92.00.48.00.01    92.00.48.00.01      yes
0000:81:00.0   A100-SXM4-40GB       92.00.48.00.01    92.00.48.00.01      yes
0000:c2:00.0   A100-SXM4-40GB       92.00.48.00.01    92.00.48.00.01      yes

 Mass Storage
==============
Drive Name/Slot    Model Number                Onboard Version    Manifest    up-to-date
nvme0n1            Micron 7300_MTFDHBG1T9TDF    95420260          95420260     yes
nvme1n1            Kioxia KCM6DRUL7T68            0105              0105       yes