Skip to main content
country_code
Ctrl+K
NVIDIA DGX BasePOD: Deployment Guide Featuring NVIDIA DGX H200/H100 Systems - Home NVIDIA DGX BasePOD: Deployment Guide Featuring NVIDIA DGX H200/H100 Systems - Home

NVIDIA DGX BasePOD: Deployment Guide Featuring NVIDIA DGX H200/H100 Systems

NVIDIA DGX BasePOD: Deployment Guide Featuring NVIDIA DGX H200/H100 Systems - Home NVIDIA DGX BasePOD: Deployment Guide Featuring NVIDIA DGX H200/H100 Systems - Home

NVIDIA DGX BasePOD: Deployment Guide Featuring NVIDIA DGX H200/H100 Systems

Table of Contents

Overview

  • Introduction
  • Hardware Overview
  • Networking
  • Software Overview
  • NFS Storage

BCM/Network Deployment

  • BCM Introduction
  • Network Deployment
  • BCM Headnodes Pre-install Preparation
  • BCM Headnodes Installation
  • Cluster Bring Up

BCM HA and NFS

  • BCM HA
  • Setup shared NFS storage.
  • Testing BCM HA

Deploying Workload Manager

  • Deploy Kubernetes
  • Compute network/ IB Interfaces Configuration

Cluster Validation

  • Validate the GPU status/health
  • Validate the system topology/NVlink
  • Validate the GPU/RDMA access within the container
  • Validate the node level NCCL test with 8 GPUs
  • Validate the cluster level NCCL test with 4 nodes and 32 GPUs

Appendix

  • Site Survey
  • Switch Configurations
Is this page helpful?

Index

NVIDIA NVIDIA
Privacy Policy | Your Privacy Choices | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2024-2025, NVIDIA Corporation.

Last updated on Nov 19, 2025.