NVIDIA Enterprise Reference Architectures

Build AI factories that scale. Turn your data center into a high-performance AI factory with NVIDIA Enterprise Reference Architectures.

Hardware
Software
Observability
Deployment

Program Whitepaper

This whitepaper introduces NVIDIA Enterprise Reference Architectures (Enterprise RAs), which provide recommendations for building AI Factories for enterprise-class deployments, ranging from 32 to 256 GPUs. These architectures aim to simplify the deployment of AI infrastructure, reduce complexity, and accelerate time to value.

Browse

NVIDIA RTX PRO AI Factory

The NVIDIA RTX PRO AI Factory supports a range of enterprise workloads, including agentic AI inference, physical and industrial AI, visual computing, and high-performance computing for data analytics and simulation. This document outlines the hardware components that define this scalable and modular architecture. This includes guidance regarding the SU design and specifics of Ethernet fabric topologies.

Browse

NVIDIA Enterprise AI Factory Design Guide White Paper

Presents the necessary components, including integrations from our ecosystem partners, automation tools, and deployment strategies. This design can be used by our enterprise partners for integrating accelerated computing, high-performance networking, and AI software for successfully building single tenant enterprise ready AI factories.

Browse

NVIDIA AI Enterprise: Software Reference Architecture

Provides an example infrastructure stack build that is geared towards OEMs and NVIDIA partners who intend to build systems that are ready for single-tenant production-grade AI workloads. While hardware components of the infrastructure stack can be modular, the software components of the infrastructure stack are consistent for various workloads, e.g. Inference, Finetuning, & Retrieval Augmented Generation.

Browse

Coming Soon.