image image image image image

On This Page

The following is a list of general limitations and known issues of the various components of this Mellanox OFED for Linux release.

For the list of old known issues, please refer to Mellanox OFED Archived Known Issues file at: http://www.mellanox.com/pdf/prod_software/MLNX_OFED_Archived_Known_Issues.pdf

Internal Ref. NumberIssue
2248201


Description: During MLNX_OFED installation, the following warning messages related to modules iw_cxgb3 and iw_nes may appear in the log: 

depmod: WARNING: //lib/modules/4.12.14-195-default/kernel/drivers/infiniband/hw/cxgb3/iw_cxgb3.ko disagrees about version of symbol ib_umem_release

depmod: WARNING: //lib/modules/4.12.14-195-default/kernel/drivers/infiniband/hw/nes/iw_nes.ko disagrees about version of symbol ib_unregister_device

These warning messages can be ignored, except for when running on SLES errata kernel, as this issue may prevent the creation of the weak update symbolic links, resulting in the usage of the Inbox driver instead of the MLNX_OFED driver.

Workaround: When running on SLES errata kernel, make sure to install MLNX_OFED using the flag --add-kernel-support.
Keywords: SLES, RHEL, KMP, weak updates, kmod

Discovered in Release: 5.1-0.6.6.0

2209987


Description: aRFS feature (activated using "ethtool ntuple on") is disabled for kernel 4.1 or below.
Workaround: N/A
Keywords: aRFS

Discovered in Release: 5.1-0.6.6.0

2200320


Description: In case MLNX_OFED is re-installed on a certain system without using --force, the installation may fail requiring the removal of infiniband-diags package.
Workaround: Remove the infiniband-diags package using rpm -e.
Keywords: Installation, infiniband-diags

Discovered in Release: 5.1-0.6.6.0

2248996


Description: Downgrading the firmware version for ConnectX-6 cards using "mlnx_ofed_install --fw-update-only --force-fw-update" fails.
Workaround: Manually downgrade the firmware version - please see Firmware Update Instructions.
Keywords: Firmware, ConnectX-6

Discovered in Release: 5.1-0.6.6.0

2244336


Description: AF_XDP is not functional.
Workaround: N/A
Keywords: AF_XDP

Discovered in Release: 5.1-0.6.6.0

2175930 


Description: When using OFED 5.1 on PPC architectures with kernels v5.5 or v5.6 and an old ethtool utility, a harmless warning call trace may appear in the dmesg due to mismatch between user space and kernel. The warning call trace mentions ethtool_notify.
Workaround: Update the ethtool utility to version 5.6 on such systems in order to avoid the call trace.
Keywords: PPC, ethtool_notify, kernel

Discovered in Release: 5.1-0.6.6.0

2192791


Description: The packages neohost-backend and neohost-sdk are not properly removed by the uninstallation procedure and may require manual removal before re-installing or upgrading the MLNX_OFED driver.
Workaround: Manually remove the packages by running: rpm -e neohost-backend neohost-sdk
Keywords: NEO-Host, SDK

Discovered in Release: 5.1-0.6.6.0

2198764


Description: If MLNX_OFED is installed on a Debian or Ubuntu system that is run in chroot environment, the openibd service will not be enabled. If the chroot files are being used as a base of a full system, the openibd service is left disabled.
Workaround: Currently, openibd is a sysv-init script that you can enable manually by running: update-rc.d openibd defaults
Keywords: chroot, Debian , Ubuntu, openibd

Discovered in Release: 5.1-0.6.6.0

2237134




Description: Running connection tracking (CT) with FW steering may cause CREATE_FLOW_TABLE command to fail with syndrome.

Workaround: Configure OVS to use a single handler-thread:

#ovs-vsctl set Open_vSwitch . other_config:n-handler-threads=1

Keywords: Connection tracking, ASAP, OVS, FW steering

Discovered in Release: 5.1-0.6.6.0

2239894


Description: Running OpenVSwitch offload with high traffic throughput can cause low insertion rate due to high CPU usage.
Workaround: Reduce the number of combined channels of the uplink using "ethtool -L".
Keywords: Insertion rate, ASAP2

Discovered in Release: 5.1-0.6.6.0

2240671


Description: Header rewrite action is not supported over RHEL/CentOS 7.4.
Workaround: N/A
Keywords: ASAP, header rewrite, RHEL, RedHat, CentOS, OS

Discovered in Release: 5.1-0.6.6.0

2242546


Description: Tunnel offload (encap/decap) may cause kernel panic if nf_tables module is not probed.
Workaround: Make sure to probe the nf_tables module before inserting any rule.
Keywords: Kernel v5.7, ASAP, kernel panic

Discovered in Release: 5.1-0.6.6.0

2244416


Description: Configuring "other" channels over one representor is not supported and may cause a call trace.
Workaround: N/A
Keywords: ASAP, SwitchDev, ethtool, representor

Discovered in Release: 5.1-0.6.6.0

2143007


Description: IPsec packets are dropped during heavy traffic due to a bug in net/xfrm Linux Kernel.
Workaround: Make sure the Kernel is modified to apply the following patch: "xfrm: Fix double ESP trailer insertion in IPsec crypto offload".
Keywords: IPsec, xfrm

Discovered in Release: 5.1-0.6.6.0

2225952


Description: VF mirroring with TC policy skip_sw is not supported on RHEL/CentOS 7.4, 7.5 and 7.6 OSs.
Workaround: N/A
Keywords: ASAP2, Mirroring, RHEL, RedHat, OS

Discovered in Release: 5.1-0.6.6.0

2216521


Description: After upgrading MLNX_OFED from v5.0 or earlier, ibdev2netdev utility changes the installation prefix to /usr/sbin. Therefore, it cannot be found while found in the same SHELL environment.
Workaround: After installing MLNX_OFED, log out and log in again to refresh the SHELL environment.
Keywords: ibdev2netdev

Discovered in Release: 5.1-0.6.6.0

2202520


Description: Rules with VLAN push/pop, encap/decap and header rewrite actions together are not supported.
Workaround: N/A
Keywords: ASAP2, SwitchDev, VLAN push/pop, encap/decap, header rewrite

Discovered in Release: 5.1-0.6.6.0

2210752


Description: Switching from Legacy mode to SwitchDev mode and vice-versa while TC rules exist on the NIC will result in failure.
Workaround: Before attempting to switch mode, make sure to delete all TC rules on the NIC or stop OpenvSwitch.
Keywords: ASAP2, Devlink, Legacy SR-IOV 

Discovered in Release: 5.1-0.6.6.0

2125036/2125031


Description: Upgrading the MLNX_OFED from an UPSTREAM_LIBS based version to an MLNX_LIBS based version fails unless the driver is uninstalled and then re-installed.
Workaround: Make sure to uninstall and re-install MLNX_OFED to complete the upgrade.
Keywords: Installation, UPSTREAM_LIBS, MLNX_LIBS

Discovered in Release: 5.0-2.1.8.0

2105447


Description: hns_roce warning messages will appear in the dmesg after reboot on Euler2 SP3 OSs.
Workaround: N/A
Keywords: hns_roce, dmesg, Euler

Discovered in Release: 5.0-2.1.8.0

2110321


Description: Multiple driver restarts may cause IPoIB soft lockup.
Workaround: N/A
Keywords: Driver restart, IPoIB

Discovered in Release: 5.0-2.1.8.0

2112251


Description: On kernels 4.10-4.14, when Geneve tunnel's remote endpoint is defined using IPv6, packets larger than MTU are not fragmented, resulting in no traffic sent.

Workaround: Define geneve tunnel's remote endpoint using IPv4. 
Keywords: Kernel, Geneve, IPv4, IPv6, MTU, fragmentation

Discovered in Release: 5.0-2.1.8.0

2119210


Description: Multiple driver restarts may cause a stress and result in mlx5 commands check error message in the log.
Workaround: N/A
Keywords: Driver restart, syndrome, error message

Discovered in Release: 5.0-2.1.8.0

2118956


Description: mlx5dv_dr API does not support sub functions (SFs) as destination actions.
Workaround: Create the SFs only after domain creation.
Keywords: mlx5dv_dr, sub functions, SF

Discovered in Release: 5.0-2.1.8.0

2102902


Description: A kernel panic may occur over RH8.0-4.18.0-80.el8.x86_64 OS when opening kTLS offload connection due to a bug in kernel TLS stack.
Workaround: N/A
Keywords: TLS offload, mlx5e

Discovered in Release: 5.0-2.1.8.0

2111534


Description: A Kernel panic may occur over Ubuntu19.04-5.0.0-38-generic OS when opening kTLS offload connection due to a bug in the Kernel TLS stack.
Workaround: N/A
Keywords: TLS offload, mlx5e

Discovered in Release: 5.0-2.1.8.0

2117845





Description: Relaxed ordering memory regions are not supported when working with CAPI. Registering memory region with relaxed ordering while CAPI enabled will result in a registration failure.
Workaround: N/A
Keywords: Relaxed ordering, memory region, MR, CAPI

Discovered in Release: 5.0-2.1.8.0

2083942


Description: The content of file /sys/class/net/<NETIF>/statistics/multicast may be out of date and may display values lower than the real values.
Workaround: Run ethtool -S <NETIF> to show the actual multicast counters and to update the content of file /sys/class/net/<NETIF>/statistics/multicast.
Keywords: Multicast counters

Discovered in Release: 5.0-1.0.0.0

2035950

Description: An internal error might take place in the firmware when performing any of the following in VF LAG mode, when at least one VF of either PF is still bound/attached to a VM.

  1. Removing PF from the bond (using ifdown, ip link or any other function)
  2. Attempting to disable SR-IOV
Workaround: N/A
Keywords: VF LAG, binding, firmware, FW, PF, SR-IOV

Discovered in Release: 5.0-1.0.0.0

2094176Description: When running in a large scale in VF-LAG mode, bandwidth may be unstable.
Workaround: N/A
Keywords: VF LAG

Discovered in Release: 5.0-1.0.0.0

2044544Description: When working with OSs with Kernel v4.10, bonding module does not allow setting MTUs larger than 1500 on a bonding interface.
Workaround: Upgrade your Kernel version to v4.11 or above.
Keywords: Bonding, MTU, Kernel

Discovered in Release: 5.0-1.0.0.0

1882932Description: Libibverbs dependencies are removed during OFED installation, requiring manual installation of libraries that OFED does not reinstall.
Workaround: Manually install missing packages.
Keywords: libibverbs, installation
Discovered in Release: 5.0-1.0.0.0
2058535Description: ibdev2netdev command returns duplicate devices with different ports in SwitchDev mode.
Workaround: Use /opt/mellanox/iproute2/sbin/rdma link show command instead.
Keywords: ibdev2netdev
Discovered in Release: 5.0-1.0.0.0
2072568Description: In RHEL/CentOS 7.2 OSs, adding drop rules when act_gact is not loaded may cause a kernel crash.
Workaround: Preload all needed modules to avoid such a scenario (cls_flower, act_mirred, act_gact, act_tunnel_key and act_vlan).
Keywords: RHEL/CentOS 7.2, Kernel 4.9, call trace, ASAP

Discovered in Release: 5.0-1.0.0.0

2093698Description: VF LAG configuration is not supported when the NUM_OF_VFS configured in mlxconfig is higher than 64.
Workaround: N/A
Keywords: VF LAG, SwitchDev mode, ASAP

Discovered in Release: 5.0-1.0.0.0

2093746Description: Devlink health dumps are not supported on kernels lower than v5.3.
Workaround: N/A
Keywords: Devlink, health report, dump

Discovered in Release: 5.0-1.0.0.0

2000590Description: Sending packets larger than MTU is not supported when working with OVS-DPDK.
Workaround: N/A
Keywords: MTU, OVS-DPDK

Discovered in Release: 5.0-1.0.0.0

2062900Description: Moving VF from SwitchDev mode to Legacy mode while the representor is being used by OVS-DPDK results in a segmentation fault.
Workaround: To move VF to Legacy mode with no error, make sure to delete the ports from the OVS. 
Keywords: SwitchDev, Legacy, representor, OVS-DPDK

Discovered in Release: 5.0-1.0.0.0

2075942Description: Huge pages configuration is lost each time the server is configured.
Workaround: Re-configure the huge pages after each reboot, or configure them as a kernel parameter.
Keywords: Huge pages, reboot, OVS-DPDK

Discovered in Release: 5.0-1.0.0.0

2083427Description: For kernels with connection tracking support, neigh update events are not supported, requiring users to have static ARPs to work with OVS and VxLAN.
Workaround: N/A
Keywords: VxLAN, VF LAG, neigh, ARP

Discovered in Release: 5.0-1.0.0.0

2067012Description: MLNX_OFED cannot be installed on Debian 9.11 OS in SwitchDev mode.
Workaround: Install OFED with the flag --add-kernel-support.
Keywords: ASAP, SwitchDev, Debian, Kernel

Discovered in Release: 5.0-1.0.0.0

2067746Description: When attaching a second slave to a bond, some bond interface GIDs might disappear.
Workaround: Re-create and re-configure the bond device.
Keywords: Bond, GID

Discovered in Release: 5.0-1.0.0.0

2036572Description: When using a thread domain and the lockless rdma-core ibv_post_send path, there is an additional CPU penalty due to required barriers around the device MMIO buffer that were omitted in MLNX_OFED.
Workaround: N/A
Keywords: rdma-core, write-combining, MMIO buffer

Discovered in Release: 5.0-1.0.0.0

-Description: The argparse module is installed by default in Python versions =>2.7 and >=3.2. In case an older Python version is used, the argparse module is not installed by default.
Workaround: Install the argparse module manually.
Keywords: Python, MFT, argparse, installation
Discovered in Release: 4.7-3.2.9.0
1997230Description: Running mlxfwreset or unloading mlx5_core module while contrak flows are offloaded may cause a call trace in the kernel.
Workaround: Stop OVS service before calling mlxfwreset or unloading mlx5_core module.
Keywords: Contrak, ASAP, OVS, mlxfwrest, unload
Discovered in Release: 4.7-3.2.9.0
1955352Description: Moving 2 ports to SwitchDev mode in parallel is not supported.
Workaround: N/A
Keywords: ASAP, SwitchDev
Discovered in Release: 4.7-3.2.9.0
1979958Description: VxLAN IPv6 offload is not supported over CentOS/RHEL v7.2 OSs.
Workaround: N/A
Keywords: Tunnel, VXLAN, ASAP, IPv6
Discovered in Release: 4.7-3.2.9.0
1980884Description: Setting VF VLAN, state and spoofchk using ip link tool is not supported in SwitchDev mode.
Workaround: N/A
Keywords: ASAP, ip tool, VF, SwitchDev
Discovered in Release: 4.7-3.2.9.0
1991710Description: PRIO_TAG_REQUIRED_EN configuration is not supported and may cause call trace.
Workaround: N/A
Keywords: ASAP, PRIO_TAG, mstconfig
Discovered in Release: 4.7-3.2.9.0
1970429

Description: With HW offloading in SR-IOV SwitchDev mode, the fragmented ICMP echo request/reply packets (with length larger than MTU) do not function properly. The correct behavior is for the fragments to miss the offloading flow and go to the slow path. However, the current behavior is as follows.

  • Ingress (to the VM): All echo request fragments miss the corresponding offloading flow, but all echo reply fragments hit the corresponding offloading flow
  • Egress (from the VM): The first fragment still hits the corresponding offloading flow, and the subsequent fragments miss the corresponding offloading flow
Workaround: N/A
Keywords: HW offloading, SR-IOV, SwitchDev, ICMP, VM, virtualization
Discovered in Release: 4.7-3.2.9.0
1967866Description: Enabling ECMP offload requires the VFs to be unbound and VMs to be shut down.
Workaround: N/A
Keywords: ECMP, Multipath, ASAP2
Discovered in Release: 4.7-3.2.9.0
1921981Description: On Ubuntu, Debian and RedHat 8 and above OSS, parsing the mfa2 file using the mstarchive might result in a segmentation fault. 
Workaround: Use mlxarchive to parse the mfa2 file instead. 
Keywords: MFT, mfa2, mstarchive, mlxarchive, Ubuntu, Debian, RedHat, operating system
Discovered in Release: 4.7-1.0.0.1
1840288

Description: MLNX_OFED does not support XDP features on RedHat 7 OS, despite the declared support by RedHat.

Workaround: N/A
Keywords: XDP, RedHat
Discovered in Release: 4.7-1.0.0.1
1821235Description: When using mlx5dv_dr API for flow creation, for flows which execute the "encapsulation" action or "push vlan" action, metadata C registers will be reset to zero.
Workaround: Use the both actions at the end of the flow process.
Keywords: Flow steering
Discovered in Release: 4.7-1.0.0.1
1888574

Description: Kernel support limitations in the current MLNX_OFED version:

  • SR-IOV SwitchDev is only supported on Kernel 4.14 and above, and on RedHat/CentOS 7.4, 7.5 and 7.6.
  • SR-IOV Legacy is only supported on Kernel 4.3 and above, and on RedHat/CentOS 7.4, 7.5, 7.6 and 7.7.
Workaround: N/A
Keywords: SwitchDev, ASAP, Kernel , SR-IOV, RedHat, RHEL
Discovered in Release: 4.7-1.0.0.1
1892663Description: mlnx_tune script does not support python3 interpreter.
Workaround: Run mlnx_tune with python2 interpreter only.
Keywords: mlnx_tune, python3, python2
Discovered in Release: 4.7-1.0.0.1
1504785Description: A lost interrupt issue in pass-through virtual machines may prevent the driver from loading, followed by printing managed pages errors to the dmesg.
Workaround: Restart the driver.
Keywords: VM, virtual machine
Discovered in Release: 4.6-1.0.1.1
1764415Description: Unbinding PFs on LAG devices results in a "Failed to modify QP to RESET" error message.
Workaround: N/A
Keywords: RoCE LAG, unbind, PF, RDMA
Discovered in Release: 4.6-1.0.1.1
1806565Description: RoCE default GIDs v1 and v2 are derived from the MAC address of the corresponding netdevice's PCI function, and they resemble the IPv6 address. However, in systems where the IPv6 link local address generated does not depend on the MAC address, RoCEv2 default GID should not be used.
Workaround: Use RoCEv2 default GID.
Keywords: RoCE
Discovered in Release: 4.6-1.0.1.1
1834997Description: When working with VF Lag while the bond device is in active-active mode, traffic on both physical ports may not reach line rate.
Workaround: N/A
Keywords: VF LAG, bonding, bandwidth degradation, fairness
Discovered in Release: 4.6-1.0.1.1
-Description: Aging is not functional on bond device in RHEL 7.6.
Workaround: N/A
Keywords: VF LAG, ASAP2
Discovered in Release: 4.6-1.0.1.1
1747774Description: In VF LAG mode, outgoing traffic in load balanced mode is according to the origin ring, thus, half of the rings will be coupled with port 1 and half with port 2. All the traffic on the same ring will be sent from the same port.
Workaround: N/A
Keywords: VF LAG, ASAP2
Discovered in Release: 4.6-1.0.1.1
1735161Description: Innova cards do no support InfiniBand mode.
Workaround: N/A
Keywords: Innova, IB, InfiniBand
Discovered in Release: 4.6-1.0.1.1
1787667

Description: NVMe-oF driver of MLNX OFED v4.6-x.x.x.x does not function on SLES12 SP4 and SLES15 SP1 OSs, as they have a built-in NVME driver in the Linux image. Therefore, Mellanox NVME and NVME-oF drivers cannot be loaded.

For tracking purposes of this bug, see Bugzilla issue #1150850 and Bugzilla issue #1150846.

Workaround: Change the kernel configuration of NVMe-oF driver to be "=m" and recompile the kernel. 
Keywords: NVME-oF, NVME, SLES
Discovered in Release: 4.6-1.0.1.1
1753629

Description: A bonding bug found in Kernels 4.12 and 4.13 may cause a slave to become permanently stuck in BOND_LINK_FAIL state. As a result, the following message may appear in dmesg:

bond: link status down for interface eth1, disabling it in 100 ms

Workaround: N/A
Keywords: Bonding, slave
Discovered in Release: 4.6-1.0.1.1
1712068Description: Uninstalling MLNX_OFED automatically results in the uninstallation of several libraries that are included in the MLNX_OFED package, such as InfiniBand-related libraries.
Workaround: If these libraries are required, reinstall them using the local package manager (yum/dnf).
Keywords: MLNX_OFED libraries
Discovered in Release: 4.6-1.0.1.1
-Description: Due to changes in libraries, MFT v4.11.0 and below are not forward compatible with MLNX_OFED v4.6-1.0.0.0 and above.

Therefore, with MLNX_OFED v4.6-1.0.0.0 and above, it is recommended to use MFT v4.12.0 and above.

Workaround: N/A
Keywords: MFT compatible
Discovered in Release: 4.6-1.0.1.1
1730840Description: On ConnectX-4 HCAs, GID index for RoCE v2 is inconsistent when toggling between enabled and disabled interface modes.
Workaround: N/A
Keywords: RoCE v2, GID
Discovered in Release: 4.6-1.0.1.1
1717428Description: On kernels 4.10-4.14, MTUs larger than 1500 cannot be set for a GRE interface with any driver (IPv4 or IPv6).
Workaround: Upgrade your kernel to any version higher than v4.14.
Keywords: Fedora 27, gretap, ip_gre, ip_tunnel, ip6_gre, ip6_tunnel
Discovered in Release: 4.6-1.0.1.1
1748343Description: Driver reload takes several minutes when a large number of VFs exists.
Workaround: N/A
Keywords: VF, SR-IOV
Discovered in Release: 4.6-1.0.1.1
1748537Description: Cannot set max Tx rate for VFs from the ARM.
Workaround: N/A
Keywords: Host control, max Tx rate
Discovered in Release: 4.6-1.0.1.1
1732940Description: Software counters not working for representor net devices.
Workaround: N/A
Keywords: mlx5, counters, representors
Discovered in Release: 4.6-1.0.1.1
1733974Description: Running heavy traffic (such as 'ping flood') while bringing up and down other mlx5 interfaces may result in “INFO: rcu_preempt dectected stalls on CPUS/tasks:” call traces.
Workaround: N/A
Keywords: mlx5
Discovered in Release: 4.6-1.0.1.1
1731939


Description: Get/Set Forward Error Correction FEC configuration is not supported on ConnectX-6 HCAs with 200Gbps speed rate.
Workaround: N/A
Keywords: Forward Error Correction, FEC, 200Gbps

Discovered in Release: 4.6-1.0.1.1

-



Description: On ConnectX-6 HCAs and above, an attempt to configure advertisement (any bitmap) will result in advertising the whole capabilities.
Workaround: N/A
Keywords: 200Gmbps, advertisement, Ethtool
Discovered in Release: 4.6-1.0.1.1
1699289Description: HW LRO feature is disabled OOB, which results in increased CPU utilization on the Receive side. On ConnectX-5 adapter cards and above, this causes a bandwidth drop for a few streams.

Workaround: Make sure to enable HW LRO in the driver:

ethtool -k <intf> lro

ethtool --set-priv-flag <intf> hw_lro on

Keywords: HW LRO, ConnectX-5 and above
Discovered in Release: 4.5-1.0.1.0

1403313

Description: Attempting to allocate an excessive number of VFs per PF in operating systems with kernel versions below v4.15 might fail due to a known issue in the Kernel.

Workaround: Make sure to update the Kernel version to v4.15 or above.

Keywords: VF, PF, IOMMU, Kernel, OS

Discovered in Release: 4.5-1.0.1.0

-

Description: NEO-Host is not supported on the following OSs:

  • SLES12 SP3
  • SLES12 SP4
  • SLES15
  • Fedora 28
  • RHEL7.1
  • RHEL7.4 ALT (Pegas1.0)
  • REL 7.5
  • RHEL7.6
  • XenServer 4.9

Workaround: N/A

Keywords: NEO-Host, operating systems

Discovered in Release: 4.5-1.0.1.0

1521877




Description: On SLES 12 SP1 OSs, a kernel tracepoint issue may cause undefined behavior when inserting a kernel module with a wrong parameter.

Workaround: N/A

Keywords: mlx5 driver, SLES 12 SP1

Discovered in Release: 4.5-1.0.1.0