NVIDIA Mission Control

NVIDIA Mission Control is an integrated AI factory management platform designed to simplify operations, reduce downtime, and accelerate model development for enterprise AI infrastructure. It combines NVIDIA’s operational best practices and AI cluster automation into a single control plane.

NVIDIA Mission Control is an integrated AI factory management platform designed to simplify operations, reduce downtime, and accelerate model development for enterprise AI infrastructure. It combines NVIDIA’s operational best practices and AI cluster automation into a single control plane.


Learn more at Cloud & Data Center -> Mission Control.

DGX Systems

For all DGX systems—excluding DGX GB200/GB300—installation and setup resources are available at https://docs.nvidia.com/dgx-resources/index.html:

  • What's Included with Your Purchase
  • Software Download, Installation & Activation
  • DGX roduct Updates

GB200/GB300 NVL72 Systems

Hardware Setup

For systems with NVIDIA GB200/GB300 NVL72 (including DGX GB200/GB300):

  • Contact your hardware vendor for hardware-specific installation instructions

NVIDIA Mission Control Installation

For self-installation of NVIDIA Mission Control:

  1. Review Release Notes: Consult the Latest Mission Control Release Notes for your specific version.
  2. Review Software Bill of Materials:
  3. Download and Install: Follow the appropriate installation guide:
  4. Follow Technical Guides: Reference the Latest Mission Control Product Documentation for detailed procedures, including:
    • Quick Start Guide
    • Installation and configuration workflows
    • System management procedures

Critical Installation Notice

⚠️ Important: GB200/GB300 NVL72 systems (including DGX GB200/GB300) require NVIDIA Mission Control for installation and management. Do not use DGX B200/B300 installation procedures for these systems.

NVIDIA Mission Control Release Announcements (Latest)

Release 1.2.1

This PDF contains the release notes for the NVIDIA Mission Control 1.2.1 release.
This document contains details of the NVIDIA Mission Control 1.2.1 Bill of Materials for GB200 NVL72.
This document provides additional information and references for DGX customers.
This document provides additional information and references for OEM partners.

Release 1.2

This PDF contains the release notes for the NVIDIA Mission Control 1.2 release. This is the launch release for NVIDIA Mission Control.
This document contains details of the NVIDIA Mission Control 1.2 Bill of Materials for GB200 NVL72.
This document provides additional information and references for DGX customers.
This document provides additional information and references for OEM partners.
This document discusses administration of the features that are supported on the DGX SuperPODs.
This document discusses how to get started with the features that are supported on the DGX SuperPODs.
This document describes the control management plane and rack setup process for NVIDIA DGX GB200/GB300 NLV72 systems.
This document describes the process for installing all of the software components required to enable full NVIDIA Mission Control functionality.
This document details the process for deploying the North-South Networking for the DGX GB200 NVL72 SuperPOD.

What’s included with your purchase?

Your purchase may include one or more of the following from the DGX software stack:

  • NVIDIA Mission Control
    • Recommended software for DGX B200/B300 and DGX GB300/GB200 systems.
    • As a part of integrated software delivery licensing, the following is included with NVIDIA Mission Control:
      • NVIDIA Base Command Manager (BCM) for cluster management and provisioning
      • NVIDIA Run:ai for AI workload management and GPU orchestration
      • NVIDIA Unified Fabric Manager (UFM) for managing InfiniBand scale-out computing environments. May require additional hardware appliances
      • NVIDIA NetQ for unified observability across NVLink and Ethernet, providing real-time visibility, monitoring, and troubleshooting for datacenter network fabrics and NVLink Switch telemetry.
    • NVIDIA Mission Control delivers additional capabilities not included with these other NVIDIA products. These additional capabilities are delivered through NVIDIA Base Command only to customers who purchased NVIDIA Mission Control. To stay up to date on current functionalities and what’s supported on your DGX system, see NVIDIA Mission Control documentation.
    • If you purchased NVIDIA Mission Control: BCM is included as a core component with full cluster management capabilities. Follow the Mission Control installation instructions below.
    • If you purchased NVIDIA AI Enterprise: BCM is included with NVIDIA AI Enterprise for basic cluster management. Installation and activation are handled through the NVIDIA AI Enterprise workflow. See the NVIDIA AI Enterprise section below.
    • Free BCM License (No Purchase Required): If you did not purchase a solution for managing your AI or HPC cluster, NVIDIA Base Command Manager is available for free for up to eight accelerators per system. This free-to-use offer does not include support. Learn more here.
  • NVIDIA AI Enterprise 
    • Recommended for all DGX systems and included in the DGX Software Bundle for DGX systems
    • Provides a suite of software optimized to streamline AI development and deployment. Learn more about it here.
  • DGX Software Bundle
    • Recommended with DGX H200 purchases
    • Includes licensing for NVIDIA AI Enterprise and NVIDIA Base Command (combines Base Command Manager, Magnum IO, UFM, and DGX OS into a single package.)
  • NVIDIA DGX OS 

    DGX OS is now included with your system purchase.

    • DGX OS provides a customized installation of Ubuntu Linux with system-specific optimizations and configurations, additional drivers, and diagnostic and monitoring tools. It provides a stable, fully tested, and supported OS to run AI, machine learning, and analytics applications on DGX Supercomputers. The software packages included in the DGX OS software stack are also available and installable on top of a vanilla Ubuntu distribution and Red Hat Enterprise Linux. Refer to NVIDIA Base OS for more details.
  • DGX System Software and Firmware
    • Includes firmware components for Compute
    • Includes specialized drivers, diagnostic tools, and monitoring capabilities
  • Magnum IO
    • Utilizes storage IO, network IO, in-network compute, and IO management to simplify and speed up data movement, access, and management for multi-GPU, multi-node systems. Learn more about it here
  • Base Command Manager Free License
    • If you did not purchase a solution for managing your AI or HPC cluster, NVIDIA Base Command Manager is available for free for up to eight accelerators per system. This free-to-use offer does not include support. Learn more here.

Software Download, Installation, and Activation :

Installing Software for DGX GB300/GB200 using NVIDIA Mission Control

  • DGX GB300/GB200 systems leverage NVIDIA Mission Control for software installation and activation.
  • Installation is managed through NVIDIA Mission Control workflows, specifically for DGX GB300/GB200 platforms. Note: Please do not use the DGX B300/B200 SuperPOD installation instructions for DGX GB300/GB200 systems, as the processes and tools differ.
  • For assistance with setup or installation, please contact the NVIDIA Installer Services team member for your account.

Components included with DGX GB300/GB200 Mission Control:

  • NVIDIA Base Command Manager (BCM): 
    • Generating License File: 
      Using the PAK ID on the entitlement email, generate your Base Command Manager Product Key from the  NVIDIA Licensing Portal (NLP).  Within the NLP, click the “Base Command Manager” section on the left navigation pane. Then on the Base Command Manager line item, click the 3 vertical dots under the ‘Actions’ column to expose a “Generate key” button. This key will be used to activate your license
    • Downloading and Installing BCM Software: 
      • Download your ISO for Base Command Manager software by visiting the Base Command Manager Download site.  You will need to fill in the following details :
        • Product key 
        • Linux version 
        • Base Command Manager version (if not the most current version) 
      • Should you need support when installing your software, please visit the NVIDIA Enterprise Support Portal. Once you’ve installed the software on your head node, use the 'request-license' command on the head node to activate the license for your cluster. Detailed instructions can be found in chapter 4 of the Installation Manual here.
  • NVIDIA Run:ai:
    • License Generation and Installation
    • Contact for Licenses: You’ll receive a token to access Run:ai installation artifacts. For any assistance, please contact your NVIDIA representative.
    • Software Download: Downloaded from NGC Catalog via NVIDIA Base Command Manager ‘ Run:ai set-up (cm-kubernetes-setup) wizard’ .
    • Installation: Refer to the updated NVIDIA Base Command Manager Run:ai documentation for Base Command Manager Wizard based installation procedure.
  • NVIDIA Unified Fabric Manager (UFM):  
    • Generating License File: 
      • Prepare a list of servers with the MAC address of each server on which you plan to install the UFM software
      • Go to NVIDIA’s NVIDIA Licensing Portal (NLP) and log in using your credentials
      • Click on the Network Entitlements tab. You'll see a list with the serial licenses of all your software products and software product license information and status 
      • Select the license you want to activate and click on the “Actions” button
      • In the MAC Address field, enter the MAC address of the delegated license-registered host. If applicable, in the HA MAC Address field, enter your High Availability (HA) server MAC address. If you have more than one NIC installed on a UFM Server, use any of the MAC addresses
      • Click on Generate License File to create the license key file for the software 
      • Click on Download License File and save it on your local computer
      • If you replace your NIC or UFM server, repeat the process of generating the license to set new MAC addresses. You can only regenerate a license two times. To regenerate the license after that, contact NVIDIA Sales Administration at enterprisesupport@nvidia.com
    • Downloading UFM Software: 
      • Go to NVIDIA’s NVIDIA Licensing Portal (NLP) and log in using your credentials
      • Click on Software Downloads, filter the product family to UFM, and select the relevant version of the software
      •  Click on Download
      • Save the file on your local drive
      • Click Close
  • NVIDIA NetQ
    • License Generation
      • Preparation:
        • Prepare a list of servers with MAC addresses for each server where NetQ software will be installed
      • Generate License:
        • Log into the NVIDIA Licensing Portal (NLP)
        • Click on the "Network Entitlements" tab
        • Review the list of serial licenses for your software products
        • Select the license you want to activate and click "Actions"
        • Enter the MAC address of the delegated license-registered host
        • If applicable, enter your High Availability (HA) server MAC address
        • Note: If multiple NICs are installed on the NetQ Server, use any MAC address
        • Click "Generate License File"
        • Click "Download License File" and save to your local computer
      • License Regeneration:
        • If you replace your NIC or NetQ server, repeat the license generation process
        • Limitation: You can only regenerate a license 2 times
        • For additional regenerations, contact NVIDIA Sales Administration at enterprisesupport@nvidia.com
    • Ethernet
      • License Generation:
        • No license file generation required at this time

    Software Download

    • Download Process:
      1. Log into the NVIDIA Licensing Portal (NLP)
      2. Click on "Software Downloads"
      3. Filter the product family to "NetQ"
      4. Select the relevant software version
      5. Click "Download"
      6. Save the file to your local drive
      7. Click "Close"

Installation Process for All Components: All software components listed above are installed and configured through the NVIDIA Mission Control workflow specifically designed for GB200/GB300 systems. The installation process includes:

  • License Generation: NVIDIA Installer Services will generate and provision all required licenses for your system configuration
  • Software Download: All software packages will be downloaded and prepared by NVIDIA Installer Services
  • Installation and Configuration: Complete installation and system-specific configuration will be performed by NVIDIA Installer Services
  • Integration Testing: End-to-end testing to ensure all components work together seamlessly
  • Documentation and Training: Handover documentation and system-specific training materials

Mission Control Grafana Visualizations

Mission Control- Autonomous Recovery Engine Software Stack: 

  • autonomous job recovery
  • autonomous hardware recovery
  • autonomous hardware recovery-Config files and Runbooks
  • NVIDIA Resiliency Extension (NVRx)

Note: NVIDIA Resiliency Extensions (NVRx) is not bundled with Mission Control. It must be installed separately. Please follow the installation instructions at: https://nvidia.github.io/nvidia-resiliency-ext/

Mission Control Kubernetes Security Policy Files:

Mission Control- Launchpad :

Mission Control- Air-Gap Tool for DGX B200/B300 and GB200/GB300 NVL72 Systems

Important: Do not attempt to manually install or configure these components using standard installation procedures, as NVIDIA GB300/GB200 NVL72 systems require specialized installation workflows managed by NVIDIA Installer Services.

To download, install and activate the software that comes with your purchase, follow these steps:

  • NVIDIA Base Command Manager (BCM): 
    • Generating License File: 
      • Using the PAK ID on the entitlement email, generate your Base Command Manager Product Key from the  NVIDIA Licensing Portal (NLP).  Within the NLP, click the “Base Command Manager” section on the left navigation pane. Then on the Base Command Manager line item, click the 3 vertical dots under the ‘Actions’ column to expose a “Generate key” button. This key will be used to activate your license
    • Downloading and Installing BCM Software: 
      • Download your ISO for Base Command Manager software by visiting the Base Command Manager Download site.  You will need to fill in the following details :
        • Product key 
        • Linux version 
        • Base Command Manager version (if not the most current version) 
      • Should you need support when installing your software, please visit the NVIDIA Enterprise Support Portal. Once you’ve installed the software on your head node, use the 'request-license' command on the head node to activate the license for your cluster. Detailed instructions can be found in chapter 4 of the Installation Manual here.
  • NVIDIA Run:ai:
    • Generating License File and Installation:
      • Please contact runai-order@nvidia.com for your run.AI licenses and installation
      • Download software from NGC using your enterprise credentials
      • Refer to the updated Run:ai documentation for NGC-based installation procedures
  • NVIDIA Unified Fabric Manager (UFM):  
    • Generating License File: 
      • Prepare a list of servers with the MAC address of each server on which you plan to install the UFM software
      • Go to NVIDIA’s NVIDIA Licensing Portal (NLP) and log in using your credentials
      • Click on the Network Entitlements tab. You'll see a list with the serial licenses of all your software products and software product license information and status 
      • Select the license you want to activate and click on the Actions button
      • In the MAC Address field, enter the MAC address of the delegated license-registered host. If applicable, in the HA MAC Address field, enter your High Availability (HA) server MAC address. If you have more than one NIC installed on a UFM Server, use any of the MAC addresses
      • Click Generate License File to create the license key file for the software 
      • Click Download License File and save it on your local computer
      • If you replace your NIC or UFM server, repeat the process of generating the license to set new MAC addresses. You can only regenerate a license two times. To regenerate the license after that, contact NVIDIA Sales Administration at enterprisesupport@nvidia.com
    • Downloading UFM Software: 
      • Go to NVIDIA’s NVIDIA Licensing Portal (NLP) and log in using your credentials
      • Click Software Downloads, filter the product family to UFM, and select the relevant version of the software
      • Click Download
      • Save the file to your local drive
      • Click Close
  • DGX OS
    • DGX OS is include with your DGX system purchase and you can download the DGX OS ISO from the NVIDIA Licensing Portal for re-imaging the system following these steps: 
      • Go to the Download Center 
      • Click [Server/Workstation] -> [DGX] and select All Downloads for your system
      • Click on the download link for the latest ISO release to go to the announcement
      • Download the ISO image that is referenced in the announcement and save it to your local disk
      • For Red Hat Enterprise Linux or vanilla Ubuntu deployments, including other cluster management systems, refer to the User Guides available from NVIDIA Base OS User Guides 
      • Keeping your DGX OS software and firmware up to date is the most important task for protecting your DGX systems. Security-related updates are available from the Ubuntu and NVIDIA repositories. Please refer to the User Guides for upgrade instructions
    • For Base Command Manager 10 and 11, DGX OS doesn't need to be downloaded separately. Just follow the instructions in theBase Command Manager deployment guides.
  • DGX System Software and Firmware 
    • NVIDIA DGX systems have several firmware components. It is important to keep these up to date to avoid security, software, or hardware issues. Please refer to the system firmware update guides listed in the NVIDIA DGX Systems Guides for your system.
  • NVIDIA AI Enterprise
    • Generating License File and Downloading NVIDIA AI Enterprise Software Components: 
The NVIDIA Enterprise Support and Services User Guide is designed to provide comprehensive information on how to effectively access, use, and manage NVIDIA’s Enterprise Support and Service offerings. This guide serves as a central reference for understanding the range of support options, service models, and best practices available to enterprise customers.

This document is intended for both prospective and existing NVIDIA enterprise customers, including IT administrators, technical decision‑makers, and support teams who rely on NVIDIA enterprise products and solutions. It outlines key support workflows, service entitlements, and guidance to help customers maximize value from NVIDIA Enterprise‑branded support services.

Please note that this User Guide is a non‑binding informational document. It is provided solely for reference and informational purposes and does not create or modify any contractual obligations between NVIDIA and its customers. Customers should refer to their official agreements for specific terms and conditions related to support and services.

The goal of this guide is to ensure clarity, consistency, and ease of use when engaging with NVIDIA Enterprise Support, enabling customers to resolve issues efficiently, understand service capabilities, and maintain optimal system performance.

Announcements :

NVIDIA Mission Control Kubernetes Artifacts Now Available via NGC

We’re pleased to announce that Kubernetes‑based NVIDIA Mission Control software artifacts are now available to your organization through the NVIDIA Container Registry (NGC) Catalog.

This enablement provides streamlined access to containerized and Helm‑based NVIDIA Mission Control components designed for Kubernetes environments.

  • Helm charts
  • Container images
  • Kubernetes configuration and policy resources
  • Supporting deployment artifacts

These components are available under the NVIDIA Mission Control Collections in NGC.

The following NVIDIA Mission Control components can now be downloaded and deployed via NGC:

  • Mission Control- autonomous job recovery
  • Mission Control- autonomous hardware recovery
  • Mission Control- grafana visualizations
  • Kubernetes security policies for cluster security hardening
  • Mission Control-LaunchPad
  • Domain power service – early preview

All components require valid NGC access tokens. These tokens are required during installation workflows, including NVIDIA Base Command Manager setup.

You can generate credentials using:

  • Service Keys: https://org.ngc.nvidia.com/service-keys
  • Personal API Keys: https://org.ngc.nvidia.com/setup/api-keys

Action May Be Required: Accessing NVIDIA Mission Control Collection on NGC

To access the NVIDIA Mission Control collections in NGC, your NGC organization must have a registered NVIDIA Mission Control entitlement.

To verify or enable access:

  1. Sign in to NGC: https://org.ngc.nvidia.com
  2. Confirm you are in the correct NGC organization (top‑right corner).
  3. Go to NGC Catalog and search for NVIDIA Mission Control.
  4. If the NVIDIA Mission Control collection is visible and accessible, no action is required.
  5. If the collection is not accessible, complete NVIDIA Mission Control entitlement registration to enable access.

Access to NVIDIA Mission Control Kubernetes artifacts is enabled automatically after entitlement registration.

NVIDIA Mission Control Kubernetes components are installed using documented workflows, including:

  • Installation via NVIDIA Base Command Manager Terminal User Interface (cm‑mission‑control‑setup)
  • Artifacts pulled automatically from NGC
  • Token‑based authentication during install

Refer to the NVIDIA Mission Control Software Installation Guide for component‑specific installation instructions.

For the latest installation guides, resources, manuals, and release information, visit the NVIDIA Mission Control Documentation Portal.

If you need assistance, please contact your NVIDIA representative, NVIDIA Infrastructure Services, or NVIDIA Support.

For NVIDIA Enterprise Support Launch Announcement, please go to: 

https://enterprise-support.nvidia.com/s/announcement/a4zVv000000JhZ7

Sincerely,

NVIDIA Mission Control Product Team

Access NVIDIA Mission Control Documentation here

This widget helps customers determine the appropriate Classless Inter-Domain Routing (CIDR) size required for their project.

It supports both DGX B300/B200 and NVIDIA GB300/GB200 NVL72 systems and provides subnet breakdowns based on the recommended prefix size for Out-of-Band (OOB), In-Band, and Storage networks.

Open the Network Calculation Widget