Firmware Changes for NVIDIA DGX H100/H200 Systems#

BMC Changes for DGX H100/H200 Systems#

Changes in 25.02.12#

  • Improved handling for firmware update of PSUs.

  • Improved reporting on ERoT status.

  • Provided multiple stability fixes.

  • Improved user management in Redfish.

  • Enhanced stability in communication between the BMC and the HMC.

  • Corrected power supply sensors where no readings were shown.

  • Fixed the inability to query GPU firmware version via Redfish.

  • Addressed an issue where some telemetry information from the GPU tray was unavailable.

  • Resolved an issue related to chassis power cycling using Redfish.

  • Corrected fan readings so the fan will reflect 0 if it is not present or stops spinning.

  • Fixed an issue where NIC information was allocated to the incorrect device in Redfish.

Changes in 24.09.17#

  • Fixed where BMC configuration might reset after upgrading.

  • Added Redfish API support for creating, modifying, and deleting power policies.

  • Support for deploying firmware update using the Web UI.

  • Redfish Disable Host Interface: keeps redfish functional from BIOS to BMC but prevents the direct path from OS to BMC.

  • Added ability to specify intermediate certificate authorities in a provisioned certificate chain.

  • Included additional Redfish metrics reports.

  • Fixed SNMP, syslog, and rsyslog issues.

  • Added per BMC AES key for encrypting user/password files during the configuration save and restore process.

  • Fixed invalid domain issues in the LDAP/AD settings.

  • Enhanced Redfish diagnostics.

  • General performance improvements in Redfish APIs and IPMI.

  • Added support for ConnectX-7 temperature sensors.

  • Improved resolution for energy counters.

  • Enhanced Remote Media with support for port numbers and domain names.

  • General improvements to the Web UI.

Changes in 24.01.05#

  • Fixed where SEL logs might fill up for NVMe drives

  • Fixed low occurrence where HMC might not be visible in the BMC after BMC reboot

  • Ability to control IPMI visibility for Host (Allow All, Limited Command, Hide)

  • Higher resolution for CPU and GPU energy telemetry via Redfish

  • Improved reliability of Redfish inventory

  • Improved overall stability of telemetry collection and handling invalid/missing values

  • General improvements to WebUI

Changes in 23.09.20#

  • WebUI enhancements

  • Enabled GPU Info in WebUI

  • Enabled NVRAM clear via Redfish

  • Disabled RMCP / MD5 Auth Support after factory reset

  • Enabled EROT background copy

  • Enabled default SNMPv3 MIB

  • The BMC update includes software security enhancements. Refer to the NVIDIA DGX H100 - August 2023 Security Bulletin for details.

SBIOS Changes for DGX H100/H200 Systems#

Changes in v1.6.7#

  • Fixed an issue where the total memory size from the free -m command output is smaller than installed.

  • Updated to display full device path for HDD devices under the Redfish BootOptions page.

  • Fixed an issue to mandate administrator password after version 24.09.1 if SED drive is encrypted.

Changes in v1.05.03#

  • DIMM that experienced uncorrectable errors at runtime will be mapped out on the next boot.

  • Exposed the C1AutoDemotion, C1AutoUnDemotion, and C6Enable setup options.

  • Moved the CPU setup options page to under the Advanced page in the setup UI.

  • Added a setup option to restrict host access via IPMI.

  • Provided the NvramVarsProtectionInOs setup option to prevent the OS from changing the NVRAM at runtime.

  • Implemented uncorrectable error rate limiting, disabled CSMI (correctable system management interrupts) on error flooding and on the core that reported MLC (middle-level cache) yellow state, and SEL logging when ANF (advisory non-fatal error) threshold was crossed.

  • Changed the SncEn default setting to disable.

Changes in v1.01.03#

  • Added support for securing KCS

Changes in v1.01.01#

  • Fixed Boot options labeling for NIC ports

  • Fix for U.2 bay slot numbering

  • Set RestoreROWritePerf option to expert mode only

  • Expose TDX and IFS options in expert user mode only

nvfwupd Command Changes#

Changes in 2.0.5#

  • Added support for parallel firmware updates through the YAML configuration file.

  • Added the --json option to the update_fw, show_update_progress, and force_update commands.

  • Added IPv6 support.

  • Deprecated the targets sub-option for multi-target input. Use config.yaml input instead.

Changes in v2.0.4#

  • Enhanced automatic server type detection for DGX platforms.

  • No support for the activate_fw command.

Changes in v2.0.1#

  • Improved log sanitization to mask the IP address and login credentials by default.

  • Added support for the --target and --package override from the command-line interface (CLI) using a configuration file.

  • Enhanced the --target option with the servertype sub-option to resolve unidentified platform errors.

Changes in v2.0.0#

  • Support for using a platform-agnostic configuration file.

Changes in v1.1.3#

  • Support for abbreviated firmware update package names.

  • Enhanced the show_update_progress output to provide a full status report for Redfish.

  • Support for custom log file path.

  • The command exits with an error code 1 for any update failure or tool failure.

Changes in v1.1.1#

  • You can update all the system components on the motherboard tray at one time. Previously, you had to update the components individually.

  • You can create a JSON file with network addresses and credentials for multiple systems and automatically update multiple systems serially. Refer to Updating Multiple Systems for more information.