NVIDIA BlueField-3 DPU NIC Firmware Release Notes v32.43.1014

Changes and New Features

Note

Security Hardening Enhancements: This release contains important reliability improvements and security hardening enhancements. NVIDIA recommends upgrading your devices' firmware to this release to improve the devices’ firmware security and reliability.

Info

To generate PLDM packages for firmware updates, users must install and use the MFT version that corresponds with the respective firmware release.

Feature/Change

Description

32.43.1014

Programmable Congestion Control (PCC)

Migrated PCC NP solution from ACE hardware platform to DPA hardware platform. The new capability is applicable to the following 2 modes:

  • PCC_INT_EN=True and PCC_INT_NP_RTT_DATA_MODE=INGRESS_BYTE

  • PCC_INT_EN=True and PCC_INT_NP_RTT_DATA_MODE=NO_DATA

The first mode is used to support ZTRCC RX bytes in RTT response.

HPCC2 Custom Header

Added support for HPCC2 custom header insertion in RTT request packets for DOCA PCC. The capability will be supported when setting ROCE_CC_STEERING_EXT = ENABLED.

Hight Availability for virtio-net-controller

Added support for a second emulation VirtIO blk and net device on the same vHCA to enable switching to the second emulation device and reduce downtime.

RDMA Telemetry

Added the option to indicate an error CQE event on every selected function per eSwitch manager. This indication is defined as a new WQE including the relevant information about the error (such as: syndrome, function_id, timestamp, QPs num etc.).

The feature is configured using a new general object: RDMA-Telemetry object, and depends on the following new caps: HCA_CAP.rdma_telemetry_notification_types and HCA_CAP.rdma_telemetry.

UID Permissions

Extended kernel lockdown permission set. The following sub-operations can now be called by tools (permission TOOLS_RESORCES) using new HCA capability bitmask field: tool_partial_cap.

The 5 sub-operations are:

  • QUERY_HCA_CAP with other function

  • QUERY_VUID with direct data

  • QUERY_ROCE_ADDRESS with other vport

  • SET_HCA_CAP with other function

  • POSTPONE_CONNECTED_QP_TIMEOUT with other vport

The new added caps are:

  • tool_partial_cap.postpone_conn_qp_timeout_other_vport,

  • tool_partial_cap.set_hca_cap_other_func

  • tool_partial_cap.query_roce_addr_other_vport

  • tool_partial_cap.query_vuid_direct_data

  • tool_partial_cap.query_hca_cap_other_func

Cross E-Switch Scheduling

Added support for QoS scheduling across multiple E-Switches grouped in a LAG. VPort members of a Physical Function can be added to a rate group from another Physical Function and rate limits of the group will apply to those VPort members as well.

Jump from NIC_TX to FDB_TX

Added 'table_type_valid' and 'table_type' fields to the steering action (STC) "Jump To Flow" table parameters to enable the user to jump from NIC_TX to FDB_TX and bypass the ACL table.

Jump to TIR or queue from FDB on Tx

Enabled hop reduction by bypassing NIC domain in various use cases. Such action r educes the number of hops (improves PPS) to deal with mass number of flows and devices.

To enable this new capability, a new STC action type "JUMP_TO_FDB_RX" was added to allow jumping into the RX side of a table.

Virtual Quality of Service

Added a new scheduling element type ("TC_ARB") capability in the VQoS domain (Virtual Quality of Service), to support TC arbitration between functions (VPORTs).

Hotplug/Unplug on VirtIO Devices when the Host is Powered OFF

Enabled hotplug/hotunplug during device's power off or power cycle to prevent the device from getting stuck.

2-steps-hotplug

Added support for 2-steps-hotplug capability. The device is plugged with "free" status by default, and it will not appear on the bus until being modified to "hotplug" status.

Bug Fixes

See Bug Fixes in this Firmware Version section.

© Copyright 2024, NVIDIA. Last updated on Nov 12, 2024.