Troubleshooting

Note

This document is preliminary and subject to change.

Problem Indicator

Symptoms

Cause and Solution

LEDs

System Status LED is blinking for more than 5 minutes

Cause: NVIDIA Onyx (MLNX-OS) software did not boot properly and only firmware is running.

Solution: Connect to the system via the console port, and check the software status. You might need to contact an FAE if the NVIDIA Onyx (MLNX-OS) software did not load properly.

System Status LED is amber

Cause:

  • Critical system fault (CPU error, bad firmware)

  • Over temperature

Solution:

  • Check environmental conditions (room temperature)

Fan Status LED is amber

Cause:

Possible fan issue

Solution:

  • Check that the fan is fully inserted and nothing blocks the airflow

  • Replace the fan FRU if needed

PSU Status LED is red

Cause:

Possible PSU issue

Solution:

  • Check/replace the power cable

  • Replace the PSU if needed

System boot failure while using NVIDIA Onyx (MLNX-OS)

Software upgrade failed on x86 based systems

Solution:

  • Connect the RS232 connector (CONSOLE) to a laptop.

  • Push the system’s reset button.

  • Press the ArrowUp or ArrowDown key during the system boot. GRUB menu will appear. For example:

Copy
Copied!
            

Default image: 'SX_X86_64 SX_3.4.0008 2014-11-10 20:07:51 x86_64' Press enter to boot this image, or any other key for boot menu Booting default image in 3 seconds. Boot Menu ------------------------------------------------------------------- 0: SX_X86_64 SX_3.4.0008 2014-11-10 20:07:51 x86_64 1: SX_X86_64 SX_3.4.0007 2014-10-23 17:27:34 x86_64 ------------------------------------------------------------------- Use the ArrowUp and Arrowdown keys to select which entry is highlighted. Press enter to boot the selected image or 'p' to enter a password to unlock the next set of features. Highlighted entry is 0: "

  • Select previous image to boot by pressing an arrow key and choosing the appropriate image.

System boot failure while using Cumulus Linux

Software upgrade failed on x86 based systems

See Monitoring and Troubleshooting in Cumulus Linux User Guide.

System reset failure in SN3420

When the front panel reset button is pressed, the system does not respond. It either stalls, or continues operating with no reset.

Cause: The reset button is stuck in a pressed position due to physical pressure applied by the front panel.

Solution: The suitable solution depends on the reset reason:

1. For regular system reset, select one of the following commands (according to your Operating System), and run it from the CLI:

DVS OS: reboot

Sonic: reboot

Onyx: reload

Cumulus: sudo reboot

2. In case a reset is required in order to quit a sleep mode that was activated using the halt, poweroff or shutdown commands, the system should be power cycled using the PDU OFF/ON command.

3. If password reset is required, please contact NVIDIA's support team at Networking-support@nvidia.com.

© Copyright 2024, NVIDIA. Last updated on Mar 26, 2024.