For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
NVIDIANVIDIA
Developer-friendly docs for your API
Privacy Policy | Your Privacy Choices | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2026, NVIDIA Corporation.

LogoLogoDSX Documentation

NVIDIA DSX™

||View as Markdown|
NVIDIA DSX™

NVIDIA DSX platform brings together reference designs, APIs, software libraries, and technologies to design, simulate, build, and operate AI factories. Explore the features and capabilities for each DSX component, and view linked technical resources.

Select a feature/capability tile to view linked resources, select a section title to jump to its detailed tab below.

DSX Sim™
DSX OS™
Platform Software
Infrastructure Software
Resiliency
Observability
Compliance
Security
DSX MaxLPS™
DSX Exchange™
DSX Hardware Reference Design
DSX Facilities Infrastructure Reference Design
DSX Flex
Grid Operator
All
DSX Sim
DSX OS
DSX MaxLPS & Flex
DSX Exchange
DSX Reference Designs
NCP Resources

DSX Sim

A suite of simulation technologies that together enable partners to design, validate, and operate AI factories before and after physical deployment.


NVIDIA DSX Air
High-fidelity logical simulation of NVIDIA hardware and software infrastructure — GPUs, SuperNICs, DPUs, switches — plus validated integrations with storage, security, and orchestration partners.
NVIDIA Omniverse DSX Blueprint for AI Factories
An end-to-end framework to design, simulate, build, and operate gigawatt-scale AI factory digital twins.

Browse the DSX Blueprint GitHub repo or get started on build.nvidia.com
AI Factory Digital Twin Pipeline Samples
Sample scripts and presets for creating DSX SimReady USD assets, covering CAD ingestion, optimization, validation, and metadata workflows for digital twin and AI factory applications.

Browse the AIF Pipeline Samples GitHub repo

DSX OS

Open-source, modular software for building, operating, and scaling AI factory infrastructure, with composable components for lifecycle management, runtime consistency, health automation, resiliency, multi-tenant operations, AI platform services, and more.


NVIDIA Run:ai
Kubernetes-native platform for AI workload and GPU orchestration, designed to maximize accelerated infrastructure utilization.
NVIDIA Cloud Functions (NVCF)
Unified API layer for scaling inference and simulation workloads across one or more Kubernetes clusters.

Browse the NVCF GitHub repo
KAI Scheduler
Open source Kubernetes-native scheduler for AI workloads with topology-aware placement and resource allocation.
Grove
Modular component of NVIDIA Dynamo — provides a Kubernetes API for defining and scaling multi-component AI inference workloads.
Dynamo
Distributed inference-serving framework built to deploy models in multi-node environments.

Browse the Dynamo GitHub repo
NVIDIA Fleet Intelligence
Agent-based managed service offering real-time insights into GPU fleet health and integrity.

Browse the Fleet Intelligence Agent GitHub repo
NVIDIA Infra Controller (NICo™)
Bare-metal provisioning and secure lifecycle management for multi-tenant GPU infrastructure.

Browse the NICo GitHub repo
NVIDIA Switch Infrastructure - Config Manager
Open-source network automation and configuration management platform for large-scale datacenter operations.

Browse the Config Manager GitHub repo
AI Cluster Runtime
Canonical, continuously validated definition of the NVIDIA-accelerated Kubernetes runtime.

Browse the AICR GitHub repo
NVSentinel
Open-source, Kubernetes-native GPU monitoring and fault remediation.

Browse the NVSentinel GitHub repo
NVIDIA DOCA Platform Framework (DPF)
Orchestration system to build, deploy, and operate BlueField-accelerated infrastructure services.

DSX MaxLPS & Flex

DSX MaxLPS is a suite of technologies to maximize AI factory compute throughput and tokens per watt within a fixed power budget, by applying intelligent optimizations and dynamically enforcing power policies at GPU, rack, and workload level.

DSX Flex enables renewable and hybrid power orchestration across utility, on-site renewables, and storage, receiving grid signals (load shedding, demand response, pricing events) and adapting AI workloads dynamically.

(NVOnline access required. Contact your NVIDIA representative for details.)
NVIDIA Domain Power Service
Dynamic power allocation software that continuously monitors GPU and rack-level power consumption, reallocating it where needed based on defined power budgets to unlock stranded capacity and optimize overall utilization.
Power Management for NVIDIA Vera Rubin Data Center Systems
(NVOnline #1153760)
Workload Power Profiles
Learn more about workload-aware optimized power profiles for improving AI factory energy efficiency and performance.
NvGrid
Provides grid integration capabilities that enable AI factory power management in response to utility grid signals.

DSX Exchange

The IT/OT integration and communications hub, coordinating compute, network, energy, power, and cooling plant signals in the AI factory.


DSX Event Bus
NATS-based event bus for AI factory communications and operations.

Browse the DSX Exchange GitHub repo
AsyncAPI Schemas
Event bus schemas for DSX data exchange services.

Browse the AsyncAPI Schemas GitHub repo
BMS Companion Guide
This guide is for System Integrators and BMS contractors who will configure a Building Management System to publish facility data to the DSX Event Bus. It covers the concepts, topic structure, metadata fields, and implementation steps needed to complete a BMS integration.

DSX Reference Designs

Generation-specific, validated AI factory architectures covering compute, networking, storage, facilities infrastructure, and hardware cluster design.

(NVOnline access required. Contact your NVIDIA representative for details.)
NVIDIA Vera Rubin NVL72 Reference Design
(NVOnline #1151654)
NVIDIA DSX - Vera Rubin Facilities Infrastructure Reference Design
(NVOnline #1145739)
NVIDIA DSX Facilities Infrastructure Design Guide
(NVOnline #1152370)
BESS Self-Qualification Guidelines
Defines the partner-run qualification process for a Battery Energy Storage System (BESS) intended to support AI load buffering, demand response (DR), and low voltage ride-through (LVRT) in grid-connected and islanded on-site generation modes.
NVIDIA DSX AI Factory Marketplace
Products that have been validated to meet NVIDIA functional requirements for AI factory applications.

NVIDIA Cloud Partner Resources


NVIDIA Cloud Partner Software Reference Guide
Infrastructure-native reference for building AI cloud services — software components, architecture patterns, and deployment guidance.
NVIDIA Inference Reference Architecture
Software architecture to help operators build performant, cost-effective inference solutions on NVIDIA platforms.
NVIDIA Requirements for AI Clouds
Primary requirements document covering the full stack of AI cloud infrastructure services — SLAs, security, networking, storage, and operations.
NVIDIA Exemplar Cloud
Improves performance per TCO, security, and reliability for cloud providers with hardware and software recipes, tools, capabilities, and references.

Browse the NVIDIA Performance Benchmarking recipes
NVIDIA AI Cloud-Ready Validation Initiative
Qualifies and validates AI infrastructure and platform software for deployment on NVIDIA-accelerated cloud infrastructure.

Browse the ISV NCP Validation Suite