Overview#
Rack Firmware update instructions use a common set of tools and commands for the NVIDIA DGX GB200 and DGX GB300 systems. The specific examples may call out GB200, but they apply to both products. Overall, firmware updates using the NVIDIA Base Command Manager (BCM) 11 software for a rack of DGX GB200 and DGX GB300 systems can be done once all the compute trays, NVLink switch trays, and power shelves are up in BCM. The latest FW/SW recipe must be followed for the installation to be successful.
The processes described in this guide apply to DGX GB200 systems running software version 1.2.2 and later.
Note
Compute FW packages for DGX SuperPOD are different from the reference GB200/GB300 architecture package.
Reference: DGX GB200/GB300 FW Files Required for Update on DGX SuperPOD
The following lists the general file names to expect for the DGX GB200 and DGX GB300 systems. For more information, look for the specific DGX GB200/GB300 SW/FW Release Notes on the NVIDIA Enterprise Support Portal. Specific filenames for each release can be found in the section “Multi-Node System Software Stack Package Contents”
Component |
DGX GB300 Filename |
DGX GB200 filename |
|---|---|---|
Compute BMC bundle |
nvfw_DGX-GBX00_0026_<date>.*_custom_prod-signed.fwpkg |
nvfw_DGX-GBX00_0023_<date>.*_custom_prod-signed.fwpkg |
Compute HMC bundle |
nvfw_HGX-GBX00_0026_<date>.*_custom_prod-signed.fwpkg |
nvfw_HGX-GBX00_0023_<date>.*_custom_prod-signed.fwpkg |
BF3 |
fw-Bluefield-3-rel-*.bin |
fw-Bluefield-3-rel-*.bin |
CX7/CX8 |
fw-ConnectX8-rel-*.bin |
fw-ConnectX7-rel-*.bin |
Switch NVOS |
nvos-amd64-*.bin |
nvos-amd64-*.bin |
Switch BMC bundle |
nvfw_GB300-P4093_0004.*.fwpkg |
nvfw_GB200-P4978_0004.*.fwpkg |
Switch BIOS bundle |
nvfw_GB300-P4093_0006.*.fwpkg |
nvfw_GB200-P4978_0006.*.fwpkg |
Switch CPLD bundle |
nvfw_GB300-P4093_0007.*.fwpkg |
nvfw_GB200-P4978_0007.*.fwpkg |
Powershelf PSU |
SP-2552-7RD.*.tar |
NVIDIA_5500_APP_.*.tar |
Powershelf PMC |
cm14mp2rd.*.tar |
common-pmc-3.*tar |
Firmware updates for the compute trays can be done by:
BCM 11 integrated firmware update tool
Standalone nvfwupd tool
GB200/GB300 Compute Tray Firmware Update - General Steps
Ensure that compute tray BMC has username “admin” enabled and that the credentials are known. If username “admin” does not exist or is disabled, it must be created and enabled before the compute tray update. BCM or any rack management systems should migrate to using “admin” as default BMC account going forward as the previously used “root” will be disabled going forward. Please see Appendix A.1 before proceeding with the Update.
If using BCM to do the firmware update
Place the files in /cm/local/apps/cmd/etc/htdocs/bios/firmware/gb200
Confirm that in the NVLink Switch bmcsettings, the firmware management mode is set to GB200/GB300
Check the current nodes FW versions against the update packages
Do a dry run to confirm the FW will update to the expected versions
Update the BMC package first (Compute BMC bundle), then the compute tray package (Compute HMC bundle). AUX power cycle the trays after each component update is complete
NVLink Switch Tray Firmware Update - General Steps
Note
CPLD4 only applies to DGX GB200
Obtain the NVLink Switch firmware
If using BCM to do the firmware update
Place the files in /cm/local/apps/cmd/etc/htdocs/bios/firmware/GBx00sw
Confirm that in the NVLink Switch bmcsettings, the firmware management mode is set to GB200sw
Check the current NVLink Switch FW versions against the update packages
Do a dry run to confirm the FW will update to the expected versions
Update the tray level firmware first in this order
BMC+FPGA+ERoT (Switch BMC bundle)
CPLD1 CPLD2 CPLD3 CPLD4 (Switch CPLD bundle).
SBIOS+EROT (Switch BIOS bundle)
Update the NVOS from within the OS or use ZTP. (Switch NVOS)
Aux power cycle the trays after each component update is complete.