DC-SCM Module Replacement#

This topic describes how to replace the DC-SCM module in the NVIDIA DGX™ B300 system.

Caution

Static Sensitive Devices: Ensure to observe best practices for electrostatic discharge (ESD) protection. Ensure that personnel and equipment are connected to a common ground, such as wearing a wrist strap connected to the chassis ground and placing components on static-free work surfaces.

DC-SCM Module Replacement Overview#

This is a high-level overview of the procedure to replace a DC-SCM module.

  1. Confirm the DC-SCM is not functional.

  2. Get a replacement DC-SCM from NVIDIA Enterprise Support.

  3. Power off the system.

  4. Unplug all motherboard cables.

  5. Pull out the motherboard.

  6. Remove the lid.

  7. Remove the right BlueField-3 I/O bay.

  8. Pull out the DC-SCM bay.

  9. Install the new DC-SCM module.

  10. Install the right BlueField-3 I/O bay.

  11. Install the motherboard lid.

  12. Slide the motherboard tray into the system.

  13. Connect all the cables.

  14. Power on the system and confirm the DC-SCM is functional.

  15. Send the failed unit to NVIDIA Enterprise Support using the packaging provided.

Prepare for Replacement#

Caution

Wear an ESD strap during any procedure that involves touching electronic components.

  1. Contact NVIDIA Enterprise Support to help triage if the DC-SCM module is not functioning.

  2. When the new part arrives, power off the system.

  3. Follow the instructions to pull out the motherboard and remove the lid in Motherboard Tray - Removal and Installation.

  4. Remove the right BlueField-3 I/O bay to access the DC-SCM module below.

    Note

    Each cable is labeled to ensure it is connected to the correct position after the procedure.

    _images/dgx-b300-bf3-right-bay.png

Remove the BlueField-3 I/O Bay#

  1. After the four cables have been unplugged, press the right release tab and push the bay towards the front.

    _images/dgx-b300-bf3-right-release-tab.png
  2. Carefully route the cables through the opening as the I/O bay is moved out of the motherboard tray.

  3. Finish pulling the old I/O bay out of the motherboard tray.

  4. Ensure the motherboard tray levers remain fully extended, as shown in the illustration, so the DC-SCM module can be pulled out.

    _images/dgx-b300-bf3-right-bay-out.png

Remove the DC-SCM Module#

  1. Ensure the ejection levers are fully extended to prevent obstruction before removing the DC-SCM module from the motherboard.

    _images/dgx-b300-dc-scm.png
  2. Release the latch on the DC-SCM module as shown in the illustration and then pull the bay out the front to eject it.

    _images/dgx-b300-dc-scm-eject.png

Insert the DC-SCM Module into the Tray#

  1. To install the new DC-SCM, ensure the ejection levers are fully open. Insert the DC-SCM into the lower slot until it locks into place.

    _images/dgx-b300-dc-scm-insert.png

Reconnect the I/O Bay#

  1. Route all cables carefully through the opening in the motherboard tray slot.

    After inserting the I/O bay into the tray, ensure it locks in place by checking that the tab is secure.

    _images/dgx-b300-dc-scm-io-bay-insert.png
  2. Connect the two power cables and the two PCIe cables to their correct connectors on the switchboard, following the labels on each cable end.

    _images/dgx-b300-dc-scm-io-bay-connect.png

    To identify the correct connections, refer to this table that maps BlueField-3 card connectors to their corresponding board connectors.

    BlueField-3 I/O Board

    Left Slot Installation

    Right Slot Installation

    Cable Label P2

    Board connector J9

    Board connector J3

    Cable Label P3

    Board connector J10

    Board connector J4

Integrate the New DC-SCM and Complete the Installation#

  1. Insert the motherboard following the instructions in Motherboard Tray - Removal and Installation.

  2. Power on the system.

  3. Update the system firmware to the latest version.

  4. Confirm the system is healthy by running the nvsm command.

    sudo nvsm show health
    
  5. Send the failed DC-SCM module to NVIDIA Enterprise Support using the packaging provided.