Skip to main content
Ctrl+K
NVIDIA Mission Control Management Plane and Rack Setup with NVIDIA GB200 NVL72 Systems Installation Guide - Home NVIDIA Mission Control Management Plane and Rack Setup with NVIDIA GB200 NVL72 Systems Installation Guide - Home

NVIDIA Mission Control Management Plane and Rack Setup with NVIDIA GB200 NVL72 Systems Installation Guide

NVIDIA Mission Control Management Plane and Rack Setup with NVIDIA GB200 NVL72 Systems Installation Guide - Home NVIDIA Mission Control Management Plane and Rack Setup with NVIDIA GB200 NVL72 Systems Installation Guide - Home

NVIDIA Mission Control Management Plane and Rack Setup with NVIDIA GB200 NVL72 Systems Installation Guide

Table of Contents

Installation Guide

  • Introduction
  • BCM Software Installation
  • BCM Networking Setup
  • Configuration for Node Provisioning
    • Mixed Architecture Setup
    • Software Image Setup
    • Category Creation
    • Initial Setup Verification Checklist
    • Control Plane Node Entries
    • Control Plane Configuration Verification Checklist
    • Automated Rack Import Process
    • Manual Rack Import Process
    • Manual Addition of GB200/GB300 Rack Entries
    • Manual Addition of NVLink Switch Rack Entries
    • GB200/GB300 Rack Configuration Verification Checklist
    • Finalize Headnode Setup
    • Control Plane Power On and Provisioning
    • Unified Fabric Manager (UFM) Pre-Setup
  • High Availability
  • GB200/GB300 Rack Power On and Bring Up
  • GB200/GB300 Rack Firmware Update
  • Deployment Summary Validation Checklist
  • Appendix
  • Configuration for Node Provisioning
  • Manual Addition of NVLink Switch Rack Entries

Manual Addition of NVLink Switch Rack Entries#

This section provides step-by-step instructions for manually adding NVLink Switches to BCM with full Zero Touch Provisioning (ZTP) settings. The process includes:

  • Configuring the NVLink Switch in BCM

  • Installing the NVOS software

  • Applying a startup configuration file

  • Installing the cm-lite-daemon package

Prerequisites#

For adding NVLink Switches to BCM, ensure the following information is available:

  • MAC address for eth0/COMe0 of the NVLink Switch

  • IP address for eth0/COMe0 of the NVLink Switch

  • Management network for eth0/COMe0 of the NVLink Switch

  • Username and password for SSH access to the NVLink Switch

  • If eth1/COMe1 is connected instead, obtain the same information for eth1/COMe1

For setting up BCM to perform ZTP for NVLink Switches, the following information will also be needed:

  • NVOS firmware image files

  • Startup configuration files or templates

Adding NVLink Switch Entries to BCM#

This section covers the basic steps to add NVLink switch entries to BCM’s device database and can be done instead of the steps outlined in the manual rack import process. These steps create the switch device entry and configure basic network connectivity.

  1. Add the switch in BCM, enable cm-lite-daemon, create a new interface using the MAC address and an assigned IP.

    cmsh -c "device; add switch <switch_hostname>; set mac <00:00:00:00:00:00>; set hasclientdaemon yes; set kind nvlink; set disablesnmp yes; commit"
    
    Step 1: Individual commands
    cmsh
    device
    add switch <switch_hostname>
    set mac <00:00:00:00:00:00>
    set hasclientdaemon yes
    set kind nvlink
    set disablesnmp yes
    
  2. Add eth0 and rf0 interfaces and set network configuration.

    cmsh -c "device; use <switch_hostname>; interfaces; add physical eth0; set ip <switch_ip>; set network <ipminet>; add physical rf0; set ip <rf0_ip>; set network <ipminet>; commit"
    

    Note

    The COMe0/1 network (for eth0/eth1) and the rf0 network are usually the same out-of-band (OOB) management network, just running at different speeds.

    To add eth0 and rf0 interfaces and set network configuration:
    cmsh
    device
    use <switch_hostname>
    interfaces
    add physical eth0
    set ip <switch_ip>
    set network <ipminetwork>
    add bmc rf0
    set ip <rf0_ip>
    set network <ipminetwork>
    commit
    
  3. Set management network and verify interface configuration.

    cmsh -c "device; use <switch_hostname>; set managementnetwork <management_network>; commit"
    
    To set management network and verify interface configuration:
    # Exit interface menu and set management network
    ..
    ..
    set managementnetwork <management_network>
    commit
    
    # Verify interface configuration
    interfaces
    use eth0
    show
    
  4. Assign the username and password for SSH access.

    cmsh -c "device; use <switch_hostname>; accesssettings; set username admin; set password <password>; set restport 443; commit"
    
    To assign the username and password for SSH access:
    cmsh
    device
    use <switch_hostname>
    accesssettings
    set username admin
    set password <password>
    set restport 443
    commit
    show
    exit
    

Manual ZTP Settings Configuration in BCM#

This section covers how to manually configure ZTP configuration in BCM without bcm-netautogen.

To configure Zero Touch Provisioning (ZTP) to automatically update NVOS software, firmware, and apply startup configurations, follow these steps to configure the ZTP settings in BCM manually. These settings will be applied to all cloned switch entries.

  1. Enable the API on the switch and create the necessary directory structure.

    cmsh -c "device; use <switch_hostname>; ztpsettings; set scripttemplate nvos-ztp.sh; set jsontemplate nvlink-nvos.json; set enableapi yes; set installlitedaemon yes; commit; exit; initialize"
    
    To enable the API on the switch and create the necessary directory structure:
    cmsh
    device
    use <switch_hostname>
    ztpsettings
    set scripttemplate nvos-ztp.sh
    set jsontemplate nvlink-nvos.json
    set enableapi yes
    set installlitedaemon yes
    show
    commit
    exit
    initialize
    quit
    

    Note

    • Setting installlitedaemon yes will automatically install the cm-lite-daemon on the NVLink switches during the setup process. If it is set to no, manually install cm-lite-daemon on the switches after they have successfully completed the ZTP process. For detailed instructions on manually installing cm-lite-daemon, see the section Install cm-lite-daemon.

    • The initialize command creates a directory for every NVLink switch entry at /cm/local/apps/cmd/etc/htdocs/switch/<switch_name>/. This directory will be used to store startup configuration files and other switch-specific files.

  2. Copy the startup configuration and image files into the BCM repositories.

    # Copy image files to image directory
    cp <nvos_image_file> /cm/local/apps/cmd/etc/htdocs/switch/image/
    
    # Copy startup configuration to switch-specific directory
    # (Created after the initialize step)
    cp <startup_config_file> /cm/local/apps/cmd/etc/htdocs/switch/<switch_name>/
    

    File locations:

    • Image directory: /cm/local/apps/cmd/etc/htdocs/switch/image

    • Startup configuration directory: /cm/local/apps/cmd/etc/htdocs/switch/<switch_name>

    Note

    The startup configuration directory is created automatically after running the initialize command in the previous step.

  3. Modify the startup file to include the password field with hashed password from accesssettings.

    Edit the startup YAML file to include the following in the password field:

    - set:
          system:
          aaa:
             user:
                admin:
                hashed-password: '${CMD_ACCESS_HASHED_PASSWORD}'
    

    Sample complete startup.yaml configuration:

    The following is a complete sample startup.yaml configuration that you can use as a template:
    - header:
          model: n5700
          nvue-api-version: nvue_v1
          rev-id: 1.0
          version: 25.02.4220
    - set:
          interface:
          eth0-1:
             acl:
                ACL_MGMT_INBOUND_CP_DEFAULT:
                inbound:
                   control-plane: {}
                ACL_MGMT_INBOUND_CP_DEFAULT_IPV6:
                inbound:
                   control-plane: {}
                ACL_MGMT_INBOUND_DEFAULT:
                inbound: {}
                ACL_MGMT_INBOUND_DEFAULT_IPV6:
                inbound: {}
                ACL_MGMT_OUTBOUND_CP_DEFAULT:
                outbound:
                   control-plane: {}
                ACL_MGMT_OUTBOUND_CP_DEFAULT_IPV6:
                outbound:
                   control-plane: {}
             type: eth
          lo:
             acl:
                ACL_LOOPBACK_INBOUND_CP_DEFAULT:
                inbound:
                   control-plane: {}
                ACL_LOOPBACK_INBOUND_CP_DEFAULT_IPV6:
                inbound:
                   control-plane: {}
             type: loopback
          system:
          aaa:
             user:
                admin:
                hashed-password: '${CMD_ACCESS_HASHED_PASSWORD}'
    
  4. Add additional parameters to the device in BCM to apply the startup file and image updates during the ZTP process.

    cmsh -c "device; use <switch_hostname>; set nvconfigurationmode file; set nvconfigurationfile <startup_yaml>; ztpsettings; set jsontemplate nvlink-nvos.json; set fm_config_file <fm_config_file>; set image <NVOS_image_name>; set checkimageonboot yes; exit; commit; initialize"
    
    To configure ZTP use the following step-by-step cmsh commands:
    cmsh
    device
    use <switch_hostname>
    
    # Define the startup files matching the name of the file from the folder
    set nvconfigurationmode file
    set nvconfigurationfile <startup_yaml>
    
    # set the fm_config_file so that an NVLink Switch leader can be elected by cm-lite-daemon
    set fm_config_file gb200_nvl72r1_c2g4.cfg
    
    # Define the ZTP Settings
    ztpsettings
    set jsontemplate nvlink-nvos.json
    set image <NVOS_image_name>
    set checkimageonboot yes
    
    exit
    commit
    initialize
    quit
    

    Note

    You need to keep the following in mind when you manually configure ZTP settings:

    • The minimum version of NVOS required for the NVLink Switch is 25.02.2134 to support custom scripts that will do the cm-lite-daemon installation.

    • The Run ZTP on each boot option, when set to Yes, will enable cm-lite-daemon to enable ZTP if it is disabled.

Clone NVLink Switch Entries#

After completing the ZTP settings configuration for the “golden” switch, clone additional NVLink Switch entries for the rack. The ZTP settings will be inherited by all cloned entries. The “golden” switch is the switch that will be used to clone the other switches.

For 9 NVLink switches in a rack:

foreach -o <golden_switch_hostname> -n <switch_hostname_02>..<switch_hostname_09> --next-ip ()

Example:

foreach -o a05-18-p1-nvsw-01 -n a05-19-p1-nvsw-02..a05-26-p1-nvsw-09 --next-ip ()
commit

Note

  • IPs and MACs will have to be updated manually for each cloned entry

  • The ZTP settings configured in the previous section will be applied to all cloned switches

  • Each switch will automatically receive the same NVOS firmware and startup configurations during ZTP

  • Ensure that the directory structure is created for each cloned switch. If it is not, run the initialize command again.

Once all the switches are added to BCM, the following commands can be used to verify the configuration.

Verification Commands#

After completing the configuration for all switches:

List all NVLink switches and check their status:

cmsh -c "device; list -t switch |grep -i nvsw"

Verify ZTP completion:

cmsh -c "device; use <switch_hostname>; ssh"
sudo ztp status

Check lite daemon status:

cmsh -c "device; use <switch_hostname>; latesthealthdata"

Troubleshooting#

Common issues and solutions:

  • ZTP fails: Check network connectivity, verify image and configuration file paths

  • SSH access fails: Verify accesssettings configuration and network connectivity

  • Lite daemon installation fails: Check network access to BCM head node, verify package availability

  • Switch not responding: Verify switch is powered on, check BMC connectivity for remote restart

Log files to check:

  • /var/log/cmdaemon - BCM daemon logs

  • /var/log/switch/<switch_name> - Switch-specific ZTP logs

previous

Manual Addition of GB200/GB300 Rack Entries

next

GB200/GB300 Rack Configuration Verification Checklist

On this page
  • Prerequisites
  • Adding NVLink Switch Entries to BCM
  • Manual ZTP Settings Configuration in BCM
  • Clone NVLink Switch Entries
  • Verification Commands
  • Troubleshooting
NVIDIA NVIDIA

Copyright © 2024-2025, NVIDIA Corporation.

Last updated on Oct 27, 2025.