Front Fan Module Replacement

Front Fan Module Replacement Overview

This is a high-level overview of the steps needed to replace the front fan modules.

  1. Identify the failed front fan module through the BMC or the fan module LED and submit a service ticket to NVIDIA Enterprise Support.

  2. Get a replacement from NVIDIA Enterprise Support.

  3. Remove the failed fan module using the fan numbering diagram as a reference.

  4. Insert the new fan module.

  5. Confirm the new fan module is working correctly through the BMC or NVSM. sudo nvsm show health

  6. Return the bad fan module using the packaging from the new fan module.

Identifying the Failed Fan Module

There are several ways to determine the faulty fan module to replace.

_images/dgxa100-fan-id.png

Viewing the Fan Module LED

Look for the lit fault LED on the upper right corner of the faulty fan module.

_images/dgxa100-fan-led.png

Using the BMC Dashboard and NVSM

  1. Identify the faulty fan module using the BMC dashboard.

    1. Log on to the BMC.

    2. Click Sensor from the left navigation menu, then review the Normal Sensors section.

      _images/bmc-sensor-psu-carrier.png

      There are two fans in the fan module, identified by SPD_FAN_SYSn_F and SPD_FAN_SYSn_R, where n is the module ID. If either fan fails, then the entire module must be replaced.

  2. Use NVSM to confirm the fan issue.

    $ sudo nvsm show fans
    

    In the output, look for the ‘unhealthy’ status for the same fan.

Replacing and Returning the Front Fan Module

  1. Remove the new fan module from its packaging and be ready to install it.

  2. Remove the failed fan module by pressing on the release button on the top of the module and pulling on the handle.

    _images/dgxa100-fan-remove.png
  3. Quickly insert the new fan module, observing that the handle release mechanism is facing up.

    Caution

    Replace the fan module within 30 seconds to prevent overheating of the system components.

  4. Confirm that the fan module is healthy working properly by

    • Verifying that the fan module fault LED is not lit.

    • Viewing the state of the fan module on he BMC dashboard.

    • Using NVSM (sudo nvsm show fans)

  5. Use packaging to pack up the bad fan and follow the shipping instructions to return the bad fan to NVIDIA Enterprise Support.