Software Update#
Clone the category in BCM
cmsh category clone <dgx-gb200> <new-dgx-gb200> commit
Clone the OS image
cmsh softwareimage clone <dgx-image> <new-dgx-image> commit
Set the new Category to the new image
cmsh category set <new-dgx-gb200> softwareimage <new-dgx-image> commit
Enter the Image to make changes
cm-chroot /cm/images/new-dgx-image/
Create DOCA Repo based on Architecture
X86:
dd status=none of=/etc/apt/sources.list.d/doca.sources << EOF Types: deb URIs: https://linux.mellanox.com/public/repo/doca/baseos8-latest/ubuntu24.04/x86_64/ Suites: / Signed-By: /usr/share/keyrings/GPG-KEY-Mellanox.gpg EOF
arm64:
dd status=none of=/etc/apt/sources.list.d/doca.sources << EOF Types: deb URIs: https://linux.mellanox.com/public/repo/doca/baseos8-latest/ubuntu24.04/arm64-sbsa/ Suites: / Signed-By: /usr/share/keyrings/GPG-KEY-Mellanox.gpg EOF
Install the latest DGX OS packages.
For a detailed DGX OS update guide, please refer to https://docs.nvidia.com/dgx/dgx-os-7-user-guide/upgrading-the-os.html#performing-package-upgrades-using-the-cli
1. Update the internal database with the list of available packages and their versions. apt update 2. Review the packages that will be upgraded. apt full-upgrade -s 3. Upgrade to the latest version. apt full-upgrade
Verify installations:
# Check DOCA packages sudo dpkg -l | grep <Expected DOCA Ver> # Check driver package sudo dpkg -l | grep <Expected Driver ver>
Save changes into the image
exit
Set compute node to DGX Category
cmsh device foreach -n dgx-nodes[XX-XX] (set category <new-dgx-gb200>) commit
Reboot compute nodes
reboot -c <new-dgx-gb200>
Verify all components have been upgraded