DIMM Upgrade

You can upgrade the system memory by installing 16 additional DIMMs.

DIMM Upgrade Overview

This is a high-level overview of the procedure to add 16 additional dual-inline memory modules (DIMMs) on the DGX A100 system.

  1. Obtain the memory upgrade (16 DIMMs) from NVIDIA Sales.

  2. Shut down the system.

  3. Label all motherboard tray cables and unplug them.

  4. Remove the motherboard tray and place on a solid flat surface.

  5. Remove the motherboard tray lid.

  6. Use the reference diagram on the lid of the motherboard tray to identify the empty DIMM locations (spots with air baffles installed).

  7. Replace the air baffles with the new DIMMs.

  8. Close the lid on the motherboard tray.

  9. Insert the motherboard tray into the system.

  10. Plug in all cables using the labels as a reference.

  11. Power on the system.

  12. Verify that all DIMMs as well as the system are healthy using nvsm.

Upgrading the DIMM

Caution

Static Sensitive Devices: - Be sure to observe best practices for electrostatic discharge (ESD) protection. This includes making sure personnel and equipment are connected to a common ground, such as by wearing a wrist strap connected to the chassis ground, and placing components on static-free work surfaces.

  1. Power down the system.

  2. Label all cables connected to the motherboard tray for easy identification when reconnecting.

  3. Remove the motherboard tray and air baffles.

    Refer to the instructions in the section Removing the Motherboard Tray.

  4. Using the diagram label on the lid as a guide, locate the DIMMs to be installed during the upgrade.

    _images/mb-tray-lid-label-dimms.png
  5. Remove the DIMM air baffles.

    Press down on the side latches at both ends of the air baffle to eject the module from the slot, then pull the air baffle out of the slot.

    _images/dimm-air-baffle-remove.png
  6. Remove 8 DIMMs from CPU-1 slots I1, J1, K1, L1, M, N1, O1, and P1

    Press down on the side latches at both ends of the DIMM to eject the module from the slot, then pull the DIMM out of the slot.

    _images/dimm-remove.png
  7. Carefully insert the 8 DIMMS that you just removed into CPU-0 slots A0, B0, C0, D0, E0, F0, G0, and H0.

    1. Make sure the socket latches are open.

    2. Position the DIMM over the socket, making sure that the notch on the DIMM lines up with the key in the slot, then press the DIMM down into the socket until the side latches click in place.

      _images/dimm-insert.png
    3. Make sure that the latches are up and locked in place.

  8. Install the new DIMMs from the upgrade kit to CPU-1 slots I0, I1, J0, J1, K0, K1, L0, L1, M0, M1, N0, N1, O0, O1, P0, and P1.

    1. Make sure the socket latches are open.

    2. Position the DIMM over the socket, making sure that the notch on the DIMM lines up with the key in the slot, then press the DIMM down into the socket until the side latches click in place.

      _images/dimm-insert.png
    3. Make sure that the latches are up and locked in place.

  9. Install the three motherboard air baffles, replace the motherboard tray lid and then install the motherboard tray.

    Refer to the instructions in the section Reinstalling the Motherboard Tray.

  10. Connect all the cables to the motherboard tray.

  11. Install all the power cords.

  12. Power on the system and log in.

  13. Confirm that the total memory is now 2 TB.

    $ lsmem
    
    Total online memory:       2T
    
  14. Confirm that the system is healthy.

    $ sudo nvsm show health