NVIDIA BlueField-2 DPU Firmware Release Notes v24.43.1014
NVIDIA BlueField-2 DPU Firmware Release Notes v24.43.1014

Changes and New Features

Note

Security Hardening Enhancements: This release contains important reliability improvements and security hardening enhancements. NVIDIA recommends upgrading your devices' firmware to this release to improve the devices’ firmware security and reliability.

Info

To generate PLDM packages for firmware updates, users must install and use the MFT version that corresponds with the respective firmware release.

Feature/Change

Description

24.43.1014

RDMA Telemetry

Added the option to indicate an error CQE event on every selected function per eSwitch manager. This indication is defined as a new WQE including the relevant information about the error (such as: syndrome, function_id, timestamp, QPs num etc.).

The feature is configured using a new general object: RDMA-Telemetry object, and depends on the following new caps: HCA_CAP.rdma_telemetry_notification_types and HCA_CAP.rdma_telemetry.

UID Permissions

Extended kernel lockdown permission set. The following sub-operations can now be called by tools (permission TOOLS_RESORCES) using new HCA capability bitmask field: tool_partial_cap.

The 5 sub-operations are:

  • QUERY_HCA_CAP with other function

  • QUERY_VUID with direct data

  • QUERY_ROCE_ADDRESS with other vport

  • SET_HCA_CAP with other function

  • POSTPONE_CONNECTED_QP_TIMEOUT with other vport

The new added caps are:

  • tool_partial_cap.postpone_conn_qp_timeout_other_vport,

  • tool_partial_cap.set_hca_cap_other_func

  • tool_partial_cap.query_roce_addr_other_vport

  • tool_partial_cap.query_vuid_direct_data

  • tool_partial_cap.query_hca_cap_other_func

Jump from NIC_TX to FDB_TX

Added 'table_type_valid' and 'table_type' fields to the steering action (STC) "Jump To Flow" table parameters to enable the user to jump from NIC_TX to FDB_TX and bypass the ACL table.

Jump to TIR or queue from FDB on Tx

Enabled hop reduction by bypassing NIC domain in various use cases. Such action r educes the number of hops (improves PPS) to deal with mass number of flows and devices.

To enable this new capability, a new STC action type "JUMP_TO_FDB_RX" was added to allow jumping into the RX side of a table.

Hotplug/Unplug on VirtIO Devices when the Host is Powered OFF

Enabled hotplug/hotunplug during device's power off or power cycle to prevent the device from getting stuck.

2-steps-hotplug

Added support for 2-steps-hotplug capability. The device is plugged with "free" status by default, and it will not appear on the bus until being modified to "hotplug" status.

Bug Fixes

See Bug Fixes in this Firmware Version section.

© Copyright 2024, NVIDIA. Last updated on Nov 12, 2024.