> For clean Markdown of any page, append .md to the page URL.
> For a complete documentation index, see https://docs.nvidia.com/dsx/llms.txt.
> For full documentation content, see https://docs.nvidia.com/dsx/llms-full.txt.

# NVIDIA DSX Documentation

> NVIDIA DSX unifies design, simulation, operations, and ecosystem technologies to help build AI factories optimized for tokens per watt.

NVIDIA DSX<img src="https://files.buildwithfern.com/nvidia-dsx.docs.buildwithfern.com/dsx/6a7503dbd4f708992a14bdb0cbc419fd2880370b550afc696af95d3f93cfc17d/_dot_dot_/docs/assets/images/gtc26-dsx-demo-kv.png" alt="NVIDIA DSX" /><p>
  NVIDIA DSX unifies design, simulation, operations, and ecosystem technologies to help build AI factories optimized for tokens per watt.
</p><h2>
  DSX Sim
</h2><p>
  A suite of simulation technologies that together enable partners to design, validate, and operate AI factories before and after physical deployment.
</p><hr /><CardGroup cols={4}>
  <Card title="NVIDIA DSX Air" href="https://docs.nvidia.com/networking-ethernet-software/nvidia-air/">
    High-fidelity logical simulation of NVIDIA hardware and software infrastructure — GPUs, SuperNICs, DPUs, switches — plus validated integrations with storage, security, and orchestration partners.
  </Card>

  <Card title="NVIDIA Omniverse DSX Blueprint for AI Factories" href="https://docs.omniverse.nvidia.com/dsx/latest/index.html">
    An end-to-end framework to design, simulate, build, and operate gigawatt-scale AI factory digital twins.<br /><br />[Browse the DSX Blueprint Github repo](https://github.com/NVIDIA-Omniverse-blueprints/omniverse-dsx-blueprint-for-ai-factories) or get started on [build.nvidia.com](https://build.nvidia.com/nvidia/omniverse-dsx-blueprint-for-ai-factories)
  </Card>

  <Card title="AI Factory Digital Twin Pipeline Samples" href="https://nvidia-omniverse.github.io/aif-pipeline-samples/">
    Sample scripts and presets for creating DSX SimReady USD assets, covering CAD ingestion, optimization, validation, and metadata workflows for digital twin and AI factory applications.<br /><br />[Browse the AIF Pipeline Samples Github repo](https://github.com/NVIDIA-Omniverse/aif-pipeline-samples)
  </Card>
</CardGroup><h2>
  DSX OS
</h2><p>
  The operating layer within the broader DSX portfolio — open source, modular software for lifecycle management, runtime consistency, fleet health, and multi-tenant AI factory operations.
</p><hr /><CardGroup cols={4}>
  <Card title="NVIDIA Cloud Functions (NVCF)" href="https://docs.nvidia.com/nvcf">
    Unified API layer for scaling inference and simulation workloads across one or more Kubernetes clusters.
  </Card>

  <Card title="KAI Scheduler" href="https://github.com/NVIDIA/KAI-Scheduler">
    Scalable Kubernetes scheduler optimized for GPU resource allocation.
  </Card>

  <Card title="Grove" href="https://github.com/ai-dynamo/grove">
    Modular component of NVIDIA Dynamo — provides a Kubernetes API for defining and scaling multi-component AI inference workloads.
  </Card>

  <Card title="Dynamo" href="https://docs.nvidia.com/dynamo">
    Distributed inference-serving framework built to deploy models in multi-node environments.
  </Card>

  <Card title="NVIDIA Fleet Intelligence" href="https://docs.nvidia.com/fleet-intelligence/latest/index.html">
    Agent-based managed service offering continuous GPU health monitoring and predictive failure signals.
  </Card>

  <Card title="NVIDIA Infra Controller" href="https://github.com/NVIDIA/infra-controller-core">
    Bare-metal provisioning and secure lifecycle management for multi-tenant GPU infrastructure.
  </Card>

  <Card title="AI Cluster Runtime" href="https://github.com/NVIDIA/aicr">
    Canonical, continuously validated definition of the NVIDIA-accelerated Kubernetes runtime.
  </Card>

  <Card title="NVSentinel" href="https://github.com/NVIDIA/NVSentinel">
    Open-source, Kubernetes-native GPU monitoring and fault remediation.
  </Card>

  <Card title="NVIDIA DOCA Platform Framework (DPF)" href="https://github.com/NVIDIA/doca-platform">
    Orchestration system to build, deploy, and operate BlueField-accelerated infrastructure services.
  </Card>
</CardGroup><h2>
  DSX Reference Design
</h2><p>
  Generation-specific, validated AI factory architectures covering compute, networking, storage, facilities infrastructure, and hardware cluster design.
</p><Icon icon="regular lock" size="3" color="#76b900" /> (Requires NVIDIA Partner Network membership.)<hr /><CardGroup cols={4}>
  <Card title="NVIDIA Vera Rubin NVL72 Reference Design" href="https://partners.nvidia.com/DocumentDetails?DocID=1151654" icon="regular lock" iconSize={3} iconPosition="left">
    (NVOnline #1151654)
  </Card>

  <Card title="NVIDIA DSX - Vera Rubin Facilities Infrastructure Reference Design" href="https://partners.nvidia.com/DocumentDetails?DocID=1145739" icon="regular lock" iconSize={3} iconPosition="left">
    (NVOnline #1145739)
  </Card>

  <Card title="NVIDIA DSX Facilities Infrastructure Design Guide" href="https://partners.nvidia.com/DocumentDetails?DocID=1152370" icon="regular lock" iconSize={3} iconPosition="left">
    (NVOnline #1152370)
  </Card>
</CardGroup><h2>
  Resources
</h2><hr /><CardGroup cols={4}>
  <Card title="NVIDIA Cloud Partner Software Reference Guide" href="./guides/ncp-software-reference-guide/introduction">
    Infrastructure-native reference for building AI cloud services — software components, architecture patterns, and deployment guidance.
  </Card>

  <Card title="NVIDIA Inference Reference Architecture" href="./guides/inference-ra/introduction">
    Software architecture to help operators build performant, cost-effective inference solutions on NVIDIA platforms.
  </Card>

  <Card title="NVIDIA Requirements for AI Clouds" href="./guides/nvidia-requirements-for-ai-clouds/introduction">
    Primary requirements document covering the full stack of AI cloud infrastructure services — SLAs, security, networking, storage, and operations.
  </Card>

  <Card title="BESS Self-Qualification Guidelines" href="https://docs.nvidia.com/datacenter/dsx/BESS-Self-Qualification-Guidelines.html">
    Defines the partner-run qualification process for a Battery Energy Storage System (BESS) intended to support AI load buffering, demand response (DR), and low voltage ride-through (LVRT) in grid-connected and islanded on-site generation modes.
  </Card>

  <Card title="NVIDIA DSX AI Factory Marketplace" href="https://marketplace.nvidia.com/en-us/enterprise/dsx-infrastructure/">
    Products that have been validated to meet NVIDIA functional requirements for AI factory applications.
  </Card>

  <Card title="NVIDIA Exemplar Cloud" href="https://www.nvidia.com/en-us/data-center/ai-cloud-performance/">
    Improves performance per TCO, security, and reliability for cloud providers with hardware and software recipes, tools, capabilities, and references.<br /><br />[Browse the NVIDIA Performance Benchmarking recipes](https://github.com/NVIDIA/dgxc-benchmarking)
  </Card>

  <Card title="NVIDIA AI Cloud-Ready Validation Initiative" href="https://www.nvidia.com/en-us/data-center/isv-validation-program/">
    Qualifies and validates AI infrastructure and platform software for deployment on NVIDIA-accelerated cloud infrastructure.<br /><br />[Browse the ISV NCP Validation Suite](https://github.com/NVIDIA/ISV-NCP-Validation-Suite)
  </Card>
</CardGroup>