Network Interface Card Replacement#

Network Card Replacement Overview#

This is a high-level overview of the procedure to replace one or more network cards on the NVIDIA DGX™ B200 system.

  1. Identify the failed card.

  2. Get a replacement Ethernet card from NVIDIA Enterprise Support.

  3. Ensure the system is shut down.

  4. If cables do not reach, label all cables and unplug them from the motherboard tray.

  5. Slide the motherboard out until it locks in place.

  6. Open the rear compartment.

  7. Pull out the failed Ethernet card.

  8. Install the new Ethernet card.

  9. Close the rear motherboard compartment.

  10. Slide the motherboard back into the system.

  11. Plug in all cables using the labels as a reference.

  12. Power on the system.

  13. Test the Ethernet card.

  14. Send the failed unit to NVIDIA Enterprise Support using the packaging provided.

Prepare the System for Replacement#

Usually, a network interface card fails to function for the following reasons:

  • The operating system does not detect the device.

  • The device does not transmit or receive data.

After you rule out external connectivity issues, contact NVIDIA Enterprise Support to receive a replacement card.

When you receive the card, begin the replacement by performing the following actions:

  1. Power off the system.

  2. Open the motherboard tray I/O door to access the rear section of the motherboard. Refer to Motherboard Tray - Opening and Closing the I/O Door for more information.

Remove the Non-Functional Card#

First, turn the locking mechanism 90 degrees so the card can be extracted from the PCI slot:

  1. Confirm the motherboard tray service lid is open and loosen the thumbscrew for the PCI card locking mechanism next to slots 1 and 3:

    _images/dgx-b200-mb-tray-lock-2.png
  2. Release the PCI cards by turning the locking mechanism 90 degrees as shown in the following figure:

    _images/card-riser.png
  3. Pull the PCI Ethernet card out of the slot:

    _images/dgx-b200-pci-eth-remove.png
  4. Remove the card from the system:

    _images/card-remove.png

Install the New Card and Close the Lock#

  1. Insert the new card into the upper PCI slot:

    _images/dgx-b200-pci-eth-insert.png
  2. Turn the locking mechanism to secure the PCI cards:

    _images/dgx-b200-card-close-lock.png
  3. Secure the locking mechanism by tightening the black thumbscrew:

    _images/dgx-b200-mb-tray-io-slot-left-tighten.png

Finalize the Network Interface Card Replacement#

  1. Close the motherboard tray I/O door and insert the motherboard tray. Refer to Motherboard Tray - Opening and Closing the I/O Door for more information.

  2. Power on and boot the system.

  3. Check for network connectivity on the replacement card.

  4. Confirm that the system is operating correctly by running the nvsm command:

    sudo nvsm show health
    
  5. Send the failed unit to NVIDIA Enterprise Support using the packaging provided.