Firmware Changes for NVIDIA DGX H100/H200 Systems#
BMC Changes for DGX H100/H200 Systems#
Changes in 25.02.12#
Improved handling for firmware update of PSUs.
Improved reporting on ERoT status.
Provided multiple stability fixes.
Improved user management in Redfish.
Enhanced stability in communication between the BMC and the HMC.
Corrected power supply sensors where no readings were shown.
Fixed the inability to query GPU firmware version via Redfish.
Addressed an issue where some telemetry information from the GPU tray was unavailable.
Resolved an issue related to chassis power cycling using Redfish.
Corrected fan readings so the fan will reflect 0 if it is not present or stops spinning.
Fixed an issue where NIC information was allocated to the incorrect device in Redfish.
Changes in 24.09.17#
Fixed where BMC configuration might reset after upgrading.
Added Redfish API support for creating, modifying, and deleting power policies.
Support for deploying firmware update using the Web UI.
Redfish Disable Host Interface: keeps redfish functional from BIOS to BMC but prevents the direct path from OS to BMC.
Added ability to specify intermediate certificate authorities in a provisioned certificate chain.
Included additional Redfish metrics reports.
Fixed SNMP, syslog, and rsyslog issues.
Added per BMC AES key for encrypting user/password files during the configuration save and restore process.
Fixed invalid domain issues in the LDAP/AD settings.
Enhanced Redfish diagnostics.
General performance improvements in Redfish APIs and IPMI.
Added support for ConnectX-7 temperature sensors.
Improved resolution for energy counters.
Enhanced Remote Media with support for port numbers and domain names.
General improvements to the Web UI.
Changes in 24.01.05#
Fixed where SEL logs might fill up for NVMe drives
Fixed low occurrence where HMC might not be visible in the BMC after BMC reboot
Ability to control IPMI visibility for Host (Allow All, Limited Command, Hide)
Higher resolution for CPU and GPU energy telemetry via Redfish
Improved reliability of Redfish inventory
Improved overall stability of telemetry collection and handling invalid/missing values
General improvements to WebUI
Changes in 23.09.20#
WebUI enhancements
Enabled GPU Info in WebUI
Enabled NVRAM clear via Redfish
Disabled RMCP / MD5 Auth Support after factory reset
Enabled EROT background copy
Enabled default SNMPv3 MIB
The BMC update includes software security enhancements. Refer to the NVIDIA DGX H100 - August 2023 Security Bulletin for details.
SBIOS Changes for DGX H100/H200 Systems#
Changes in v1.6.7#
Fixed an issue where the total memory size from the
free -m
command output is smaller than installed.Updated to display full device path for HDD devices under the Redfish BootOptions page.
Fixed an issue to mandate administrator password after version 24.09.1 if SED drive is encrypted.
Changes in v1.05.03#
DIMM that experienced uncorrectable errors at runtime will be mapped out on the next boot.
Exposed the
C1AutoDemotion
,C1AutoUnDemotion
, andC6Enable
setup options.Moved the CPU setup options page to under the Advanced page in the setup UI.
Added a setup option to restrict host access via IPMI.
Provided the
NvramVarsProtectionInOs
setup option to prevent the OS from changing the NVRAM at runtime.Implemented uncorrectable error rate limiting, disabled CSMI (correctable system management interrupts) on error flooding and on the core that reported MLC (middle-level cache) yellow state, and SEL logging when ANF (advisory non-fatal error) threshold was crossed.
Changed the
SncEn
default setting todisable
.
Changes in v1.01.03#
Added support for securing KCS
Changes in v1.01.01#
Fixed Boot options labeling for NIC ports
Fix for U.2 bay slot numbering
Set RestoreROWritePerf option to expert mode only
Expose TDX and IFS options in expert user mode only
nvfwupd Command Changes#
Changes in 2.0.5#
Added support for parallel firmware updates through the YAML configuration file.
Added the
--json
option to theupdate_fw
,show_update_progress
, andforce_update
commands.Added IPv6 support.
Deprecated the
targets
sub-option for multi-target input. Useconfig.yaml
input instead.
Changes in v2.0.4#
Enhanced automatic server type detection for DGX platforms.
No support for the
activate_fw
command.
Changes in v2.0.1#
Improved log sanitization to mask the IP address and login credentials by default.
Added support for the
--target
and--package
override from the command-line interface (CLI) using a configuration file.Enhanced the
--target
option with theservertype
sub-option to resolve unidentified platform errors.
Changes in v2.0.0#
Support for using a platform-agnostic configuration file.
Changes in v1.1.3#
Support for abbreviated firmware update package names.
Enhanced the
show_update_progress
output to provide a full status report for Redfish.Support for custom log file path.
The command exits with an error code
1
for any update failure or tool failure.
Changes in v1.1.1#
You can update all the system components on the motherboard tray at one time. Previously, you had to update the components individually.
You can create a JSON file with network addresses and credentials for multiple systems and automatically update multiple systems serially. Refer to Updating Multiple Systems for more information.