DGX Station A100 Firmware Update Container Version 24.1.1#
The DGX Firmware Update Container version 24.1.1 is available.
Package name:
nvfw-dgxstationa100_24.1.1_240110.tar.gz
Run file name:
nvfw-dgxstationa100_24.1.1_240110.run
Image name:
nvfw-dgxstationa100:24.1.1
ISO image:
DGXSTATIONA100_FWUI-24.1.1-2024-01-16-12-06-01.iso
PXE netboot:
pxeboot-DGXSTATIONA100_FWUI-24.1.1.tgz
Highlights and Changes in This Release#
Operating System Support#
This release is supported with the following DGX OS software:
DGX OS 5.5
DGX OS 6.1.0
EL8-22.08
EL9-23.08
The following issues were fixed in this release:
Fixed BMC Issues#
The pump (fan 6) speed control is removed from the BMC, and the pump pulse-width modulation (PWM) is locked to 40 percent.
The BMC update includes software security enhancements. Refer to NVIDIA Security Bulletin DGX for details.
The following table lists potential security vulnerabilities that have been reported by AMI or third-party vendors. They are addressed in DGX Station A100 BMC version 2.09.00.
Affected BMC versions: All BMC versions prior to 2.09.00
Updated BMC version: 2.09.00
Firmware container version: 24.1.1
CVE IDs Addressed
Vendor (per NVD)
CVE-2022-26872CVE-2022-40258CVE-2023-28863CVE-2023-34472CVE-2023-34329CVE-2023-34471CVE-2023-34473CVE-2023-34337AMI
CVE-2023-25191CVE-2023-25192CVE-2023-28863MITRE
CVE-2021-46279CVE-2021-4228Nozomi Networks Inc.
Fixed SBIOS Issues#
The SBIOS update includes software security enhancements. Refer to NVIDIA Security Bulletin DGX for details.
The following table lists potential security vulnerabilities that have been reported by AMI or AMD. They are addressed in DGX A100 Station SBIOS version 10.20.
Affected SBIOS versions: All SBIOS versions prior to 10.20
Updated SBIOS version: 10.20
Firmware container version: 24.1.1
CVE IDs Addressed
Vendor (per NVD)
CVE-2021-26316CVE-2021-39298CVE-2021-26402CVE-2022-23813CVE-2022-23814CVE-2021-26328CVE-2020-10713CVE-2020-34302CVE-2020-34303CVE-2017-5715CVE-2021-38578CVE-2021-30004CVE-2023-28005CVE-2021-33164CVE-2014-4860CVE-2014-4859CVE-2021-38575CVE-2019-14586CVE-2019-14559CVE-2019-14584CVE-2019-14563CVE-2019-14553CVE-2019-14587CVE-2021-38576AMI
CVE-2020-12954CVE-2020-12961CVE-2021-26331CVE-2021-46771CVE-2021-26335CVE-2021-26315CVE-2020-12946CVE-2021-26353CVE-2021-26352CVE-2021-26351CVE-2021-26337CVE-2021-26338CVE-2020-12951CVE-2021-26390CVE-2021-26370CVE-2020-12944CVE-2021-26332CVE-2020-12988CVE-2021-26329CVE-2021-26330CVE-2021-26321CVE-2021-26323CVE-2021-26324CVE-2021-26325CVE-2021-26326CVE-2021-26322CVE-2021-26327CVE-2021-26312CVE-2021-26408AMD
Known Issues#
Refer to DGX Station A100 Firmware Known Issues.
Contents of the DGX Station A100 System Firmware Update Container#
This container includes the firmware binaries and update utilities for the firmware listed in the following table.
Component |
Version |
Key Changes |
---|---|---|
BMC |
2.09.00 |
New update Refer to DGX Station A100 BMC Changes. |
SBIOS |
10.20 |
New update Refer to DGX Station A100 SBIOS Changes. |
Retimer |
1.0.125 |
No change |
A100 VBIOS |
|
No change |
A800 VBIOS |
|
No change |
M.2 Micron 7300 MTFDHBG1T9TDF SSD |
95420260 |
No change |
U.2 KIOXIA CM6 SSD |
0107 |
New update |
FPGA |
2.71 |
No change |
Storage Backplane |
0.3 |
No change |
NVFlash |
5.821.0 |
New update |
Updating the Firmware to Version 24.1.1#
This section explains how to update the firmware on the system by using the firmware update container. It includes instructions to complete a transitional update for systems that require the update.
Stop all unnecessary system activities.
Caution
While an update is in progress, do not add additional loads on the system, such as Kubernetes jobs or other user jobs or diagnostics. A high GPU workload can disrupt the firmware update process and result in an unusable component.
The commands use the .run
file, but you can also use any method described in Using the DGX Station A100 FW Update Utility.
Determine whether updates are needed by checking the installed versions.
$ sudo ./nvfw-dgxstationa100_24.1.1_240110.run show_version
If there is a
no
in any up-to-date column for updatable firmware, proceed to the next step.If all up-to-date column entries display a
yes
, no updates are required and no additional action is necessary.
Stop the
gdm3
service.$ sudo systemctl stop gdm3
Complete the update for all firmware that is supported by the container.
$ sudo ./nvfw-dgxstationa100_24.1.1_240110.run update_fw all
Depending on the firmware that is updated, you might be prompted to reboot the system or power cycle the system:
If you are prompted to reboot, issue the following command:
$ sudo reboot
If you are prompted to power cycle, issue the following command:
$ sudo ipmitool chassis power cycle
You can verify the update by issuing the following command:
$ sudo ./nvfw-dgxstationa100_24.1.1_240110.run show_version
Here is an example output for a DGX Station A100 40GB system:
BMC DGX Station A100
======================
Image Id Status Location Onboard Version Manifest up-to-date
N/A Online Local 2.09.00 2.09.00 yes
FPGA
========
Onboard version Manifest up-to-date
2.71 2.71 yes
Storage Backplane
==================
Bus Onboard Version Manifest up-to-date
N/A 0.3 0.3 yes
Retimer Loc.
=============
PCIe Slot# Onboard Version Manifest up-to-date
Retimer@slot4 1.0.125 1.0.125 yes
Retimer@slot5 1.0.125 1.0.125 yes
Retimer@slot6 1.0.125 1.0.125 yes
Retimer@slot7 1.0.125 1.0.125 yes
SBIOS
=======
Image Id Onboard Version Manifest up-to-date
N/A L10.20 L10.20 yes
Video BIOS
============
Bus Model Onboard Version Manifest up-to-date
0000:01:00.0 A100-SXM4-40GB 92.00.48.00.01 92.00.48.00.01 yes
0000:47:00.0 A100-SXM4-40GB 92.00.48.00.01 92.00.48.00.01 yes
0000:81:00.0 A100-SXM4-40GB 92.00.48.00.01 92.00.48.00.01 yes
0000:c2:00.0 A100-SXM4-40GB 92.00.48.00.01 92.00.48.00.01 yes
Mass Storage
==============
Drive Name/Slot Model Number Onboard Version Manifest up-to-date
nvme0n1 Micron 7300_MTFDHBG1T9TDF 95420260 95420260 yes
nvme1n1 Kioxia KCM6DRUL7T68 0107 0107 yes