Software Update#

  1. Clone the category in BCM

    cmsh
    category
    clone <dgx-gb200> <new-dgx-gb200>
    commit
    
  2. Clone the OS image

    cmsh
    softwareimage
    clone <dgx-image> <new-dgx-image>
    commit
    

    Set the new Category to the new image

    cmsh
    category
    set <new-dgx-gb200> softwareimage <new-dgx-image>
    commit
    
  3. Enter the Image to make changes

    cm-chroot /cm/images/new-dgx-image/
    
  4. Create DOCA Repo based on Architecture

    X86:

    dd status=none of=/etc/apt/sources.list.d/doca.sources << EOF
    Types: deb
    URIs: https://linux.mellanox.com/public/repo/doca/baseos8-latest/ubuntu24.04/x86_64/
    Suites: /
    Signed-By: /usr/share/keyrings/GPG-KEY-Mellanox.gpg
    EOF
    

    arm64:

    dd status=none of=/etc/apt/sources.list.d/doca.sources << EOF
    Types: deb
    URIs: https://linux.mellanox.com/public/repo/doca/baseos8-latest/ubuntu24.04/arm64-sbsa/
    Suites: /
    Signed-By: /usr/share/keyrings/GPG-KEY-Mellanox.gpg
    EOF
    
  5. Install the latest DGX OS packages.

    For a detailed DGX OS update guide, please refer to https://docs.nvidia.com/dgx/dgx-os-7-user-guide/upgrading-the-os.html#performing-package-upgrades-using-the-cli

    1. Update the internal database with the list of available packages and their versions.
    apt update
    
    2. Review the packages that will be upgraded.
    apt full-upgrade -s
    
    3. Upgrade to the latest version.
    apt full-upgrade
    

    Verify installations:

    # Check DOCA packages
    sudo dpkg -l | grep <Expected DOCA Ver>
    
    # Check driver package
    sudo dpkg -l | grep <Expected Driver ver>
    
  6. Save changes into the image

    exit
    
  7. Set compute node to DGX Category

    cmsh
    device
    foreach -n dgx-nodes[XX-XX] (set category <new-dgx-gb200>)
    commit
    
  8. Reboot compute nodes

    reboot -c <new-dgx-gb200>
    
  9. Verify all components have been upgraded