General Troubleshooting

  1. Enable sysrq on Arm side:

    Copy
    Copied!
                

    echo "1" > /proc/sys/kernel/sysrq

  2. Send sysrq key from the DPU's BMC:

    Copy
    Copied!
                

    systemctl stop obmc-console@ttyS2.service screen /dev/ttyS2 115200

    Press "Ctrl+a b" to generate a BREAK, then quickly type the desired sysrq command (e.g., h).

  • Ensure that the DPU is placed correctly

  • Make sure the DPU slot and the DPU are compatible

  • Install the DPU in a different PCI Express slot

  • Use the drivers that came with the DPU or download the latest

  • Make sure your motherboard has the latest BIOS

  • Power cycle the server

  • Reseat the DPU in its slot or a different slot, if necessary

  • Try using another cable

  • Reinstall the drivers for the network driver files may be damaged or deleted

  • Power cycle the server

  • Try removing and reinstalling all DPUs

  • Check that cables are connected properly

  • Make sure your motherboard has the latest BIOS

  • Try another port on the switch

  • Make sure the cable is securely attached

  • Check you are using the proper cables that do not exceed the recommended lengths

  • Verify that your switch and DPU port are compatible

  • Check that the latest driver is loaded

  • Check that both the DPU and its link are set to the same speed and duplex settings

© Copyright 2023, NVIDIA. Last updated on Nov 21, 2023.