Introduction

View as Markdown
VersionDateDescription of Change
2.1Feb 26, 2026Initial version
2.2April 10, 2026Update to v2.2

Purpose and Intent

This document serves three main purposes:

  1. Setting requirements for NVIDIA Cloud Partners (NCPs) delivering GPU capacity to NVIDIA
    This is the primary requirements document from NVIDIA to any NCP providing NVIDIA GPU/AI compute and software services. These requirements cover the full stack of AI cloud infrastructure services and operations needed to run NVIDIA DGX Cloud, expanding on the NVIDIA hardware reference design.
  2. Providing a reference set of requirements for the industry
    NVIDIA is publishing this document openly so that NCPs, GPU datacenter operators, and AI practitioners can use it as a reference for the capabilities a large GPU consumer requires
  3. Defining NVIDIA’s service delivery expectations
    NVIDIA expects services to be delivered as Generally Available to all — not bespoke implementations built for NVIDIA alone. NVIDIA expects operational excellence, transparent communication, and proactive engagement from all partners.

NVIDIA will consider additional services that an NCP offers or plans to offer beyond what is described here.