ConnectX-7 I/O Replacement#

This topic describes how to replace the ConnectX-7 I/O card in the NVIDIA DGX™ B200 system.

ConnectX-7 I/O Card Replacement Overview#

This is a high-level overview of the procedure to replace a ConnectX-7 I/O card.

  1. Identify the failed card.

  2. Get a replacement ConnectX-7 I/O card from NVIDIA Enterprise Support.

  3. Ensure the system is shut down.

  4. If cables do not reach, label all the cables and unplug them from the motherboard tray.

  5. Slide the motherboard out until it locks in place.

  6. Open the rear compartment.

  7. Pull out the card directly above the failed ConnectX-7 to make room for the procedure.

  8. Pull out the ConnectX-7 I/O card.

  9. Remove the IPEX cables from the failed card.

  10. Install the IPEX cables to the new card.

  11. Install the new ConnectX-7 I/O card.

  12. Install the card that goes over the ConnectX-7 card.

  13. Close the rear motherboard compartment.

  14. Slide the motherboard back into the system.

  15. Plug in all cables using the labels as a reference.

  16. Power on the system.

  17. Update the firmware if necessary and test the ConnectX-7 I/O card.

  18. Send the failed unit to NVIDIA Enterprise Support using the packaging provided.

Prepare the System for Replacement#

  1. Identify which I/O card to replace.

    Use the nvsm command or network tools to determine which card failed. After you have this information, contact NVIDIA Enterprise Support to get a replacement.

  2. When the new card arrives, power off the system.

  3. Based on the nvsm output, identify which card needs replacing, the card in slot 1 or slot 2.

    _images/dgx-b200-case-rear.png

Remove the I/O Card above the ConnectX-7 Card to be Replaced#

  1. Pull out the motherboard tray and access the I/O door. Refer to Motherboard Tray - Opening and Closing the I/O Door for information about accessing the I/O door.

  2. Remove the I/O card that is above the ConnectX-7 card. The card can be the M.2 boot drive assembly or a network interface card.

    • Refer to M.2 Boot Drive Assembly Replacement to remove the M.2 boot drive carrier.

      The images at the preceding link show how to remove the boot drive carrier on the right, above the ConnectX-7 card in slot 2. If you need to replace the ConnectX-7 card in slot 1, follow the instructions, but use the thumbscrew on the left side of the motherboard tray.

    • Refer to Network Interface Card Replacement to remove the Ethernet NIC.

Remove the ConnectX-7 Card#

  1. Pull the card out of the slot:

    _images/dgx-h100-cx7-remove-card.png
  2. Before you pull the card too far, remove the white and black IPEX cables from the card.

    The white cable connects to the top of the card and the black cable connects to the bottom (heatsink) of the card:

    _images/dgx-b200-cx7-ipex.png
  3. Follow the instructions in Remove an IPEX Cable to remove the IPEX connectors.

Remove an IPEX Cable#

Repeat this process for both white and black cables.

  1. Locate the IPEX cable attached to the connector:

    _images/ipex-cable-8.png
  2. Lift the locking door:

    _images/ipex-cable-2.png
  3. Push the cable away from the connector:

    _images/ipex-cable-3.png

Install ConnectX-7 Card#

  1. Attach the IPEX cables following the instructions in the figure:

    The white cable connects to the top of the card and the black cable connects to the bottom (heatsink) of the card. These cables need to be installed before inserting the card.

    _images/dgx-b200-cx7-ipex.png
  2. Follow the instructions in Insert an IPEX Cable to insert the IPEX connectors.

  3. Insert the card in the slot:

    Note the two IPEX cables on the right side of the card.

    _images/dgx-b200-connectx-card-installed.png

Insert an IPEX Cable#

Repeat this process for both white and black cables.

  1. Align the IPEX cable to the connector:

    _images/ipex-cable-4.png
  2. Press the cable into the connector:

    _images/dgx-b200-ipex-cable-5.png
  3. Confirm the cable is in the connector:

    _images/ipex-cable-6.png
  4. Close the latching mechanism:

    _images/ipex-cable-7.png
  5. Make sure the cable is locked to the connector on the board:

    _images/ipex-cable-8.png

Install the I/O Card above the ConnectX-7 Card#

  1. Reinstall the I/O card that is above the ConnectX-7 card. Refer to one of the two following procedures:

  2. Close the motherboard tray I/O door and insert the motherboard tray. Refer to Motherboard Tray - Opening and Closing the I/O Door for more information.

Power on the System and Confirm the Replacement#

  1. Power on and boot the system.

  2. Update the firmware on the card.

    For more information, refer to Updating the ConnectX-7 Firmware.

  3. Use the nvsm command to confirm that the system is operating correctly:

    sudo nvsm show health
    
  4. Send the failed unit to NVIDIA Enterprise Support using the packaging provided.