U.2 NVMe Cache Drive Post-Installation Tasks#

This section describes the tasks that you typically need to perform after replacing a U.2 NVMe drive.

Re-creating the RAID Arrays#

  1. Power on the system and log in.

  2. Confirm that all installed drives are visible from the OS by using the nvme command:

    sudo nvme list
    

    The output can indicate two boot drives and eight cache drives, depending on how many are installed in the system.

    Example Output:

    Node             SN                   Model                                    Namespace Usage                      Format           FW Rev
    ---------------- -------------------- ---------------------------------------- --------- -------------------------- ---------------- --------
    /dev/nvme0n1     S4YPNE0N200093       SAMSUNG MZWLJ3T8HBLS-00007               1           3.84  TB /   3.84  TB    512   B +  0 B   EPK9CB5Q
    /dev/nvme1n1     S4YPNE0N200040       SAMSUNG MZWLJ3T8HBLS-00007               1           3.84  TB /   3.84  TB    512   B +  0 B   EPK9CB5Q
    /dev/nvme2n1     S436NA0N106764       SAMSUNG MZ1LB1T9HALS-00007               1          44.44  GB /   1.92  TB    512   B +  0 B   EDA7602Q
    /dev/nvme3n1     S436NA0N106850       SAMSUNG MZ1LB1T9HALS-00007               1          45.18  GB /   1.92  TB    512   B +  0 B   EDA7602Q
    /dev/nvme4n1     S4YPNE0N200144       SAMSUNG MZWLJ3T8HBLS-00007               1           3.84  TB /   3.84  TB    512   B +  0 B   EPK9CB5Q
    /dev/nvme5n1     S4YPNE0N200171       SAMSUNG MZWLJ3T8HBLS-00007               1           3.84  TB /   3.84  TB    512   B +  0 B   EPK9CB5Q
    /dev/nvme6n1     S4YPNE0N200481       SAMSUNG MZWLJ3T8HBLS-00007               1           3.84  TB /   3.84  TB    512   B +  0 B   EPK9CB5Q
    /dev/nvme7n1     S4YPNE0N200094       SAMSUNG MZWLJ3T8HBLS-00007               1           3.84  TB /   3.84  TB    512   B +  0 B   EPK9CB5Q
    /dev/nvme8n1     S4YPNE0N200064       SAMSUNG MZWLJ3T8HBLS-00007               1           3.84  TB /   3.84  TB    512   B +  0 B   EPK9CB5Q
    /dev/nvme9n1     S4YPNE0N200055       SAMSUNG MZWLJ3T8HBLS-00007               1           3.84  TB /   3.84  TB    512   B +  0 B   EPK9CB5Q
    
  3. If the cache volume was encrypted previously with an access key, disable the encryption before re-creating the RAID volume:

    sudo nv-disk-encrypt disable
    

    The disk encryption packages must be installed on the system. For more information, refer to the Managing Self-Encrypting Drives topic in the DGX OS 7.0 User Guide.

  4. Re-create the cache volume and the /raid filesystem:

    configure_raid_array.py -c -f
    

    At the prompt, enter y to confirm the rebuild action.

  5. (Optional) To encrypt the volume with an access key, refer to the Managing Self-Encrypting Drives topic in the DGX OS 7.0 User Guide.

  6. Confirm the volume is healthy:

    sudo nvsm show volumes
    
  7. Send the old drive to NVIDIA Enterprise Support using the packaging from the new drive.