NVIDIA AI Enterprise Documentation#
This documentation covers the NVIDIA AI Enterprise Infrastructure Layer software β GPU and network drivers, Kubernetes operators, NVIDIA vGPU for Compute, and NVIDIA Run:ai for AI workload management.
For application-layer software (NIM, NeMo, Omniverse, domain SDKs), refer to Application Software. For enterprise support services, refer to Support.
Quick Start#
π New to NVIDIA AI Enterprise? β Start with the Quick Start Guide to deploy your first AI workload
β¬οΈ Upgrading from 7.4 or earlier? β Follow the Upgrade from 7.4 to 7.5 checklist; for highlights, refer to Whatβs New in 7.5 in the following section
π§ Planning a deployment? β Refer to Planning & Deployment on the NVIDIA AI Enterprise Docs Hub for reference architectures and sizing guidance
π₯οΈ Setting up vGPU? β Refer to Installing NVIDIA vGPU for Compute and NVIDIA vGPU for Compute Licensing
π What is New in NVIDIA AI Enterprise Infra 7.5#
Latest Release Highlights
NVIDIA Run:ai SaaS Now Included β In addition to NVIDIA Run:ai self-hosted, the NVIDIA-managed NVIDIA Run:ai SaaS offering is now included in the NVIDIA AI Enterprise license under the same enterprise SLA. Refer to the NVIDIA Run:ai SaaS Documentation for the NVIDIA-managed cloud-service option, or choose the deployment that fits your environment.
NVIDIA Run:ai 2.25 β Updated from 2.24 in 7.4. The same 2.25 release applies to both NVIDIA Run:ai self-hosted and NVIDIA Run:ai SaaS. Refer to the NVIDIA Run:ai release notes for scheduling, GPU-utilization, and platform updates in this version.
NVIDIA Data Center GPU Driver 580.159.03 β Maintenance update within the R580 production driver branch (from 580.126.09 in 7.4). NVIDIA Fabric Manager also updates to 580.159.03 in lockstep with the driver. Refer to the 580.159.03 release notes for fixes and platform-support details.
NVIDIA vGPU Software 19.5 β NVIDIA Virtual GPU Manager and the NVIDIA vGPU for Compute Guest Driver are both updated from 19.4 to 19.5, refreshing the full vGPU stack in a coordinated release.
NVIDIA DOCA 3.3.0 β NVIDIA DOCA Driver for Networking is updated from 3.2.0 to 3.3.0 and NVIDIA DOCA Microservices is updated from 3.2.1 to 3.3.0, advancing the full DOCA stack for NVIDIA BlueField DPUs and SuperNICs in a coordinated release.
Kubernetes Operator Major Updates β NVIDIA GPU Operator 26.3.1 (from 25.10.1 in 7.4) and NVIDIA Network Operator 26.1.1 (from 25.10.0 in 7.4) are both major-version bumps. NVIDIA NIM Operator 3.1.0 (from 3.0.2) and NVIDIA Container Toolkit 1.19.0 (from 1.18.1) also updated. NVIDIA DPU Operator (DPF) 25.10.1 carries forward unchanged from 7.4.
NVIDIA Base Command Manager 11.32.1 β Updated from 11.31.0 in 7.4. Refer to the BCM 11.32.1 release notes for cluster-management and provisioning updates.
Previous Releases#
π Release 7.4 Highlights
Blackwell Architecture Support β NVIDIA Data Center GPU Driver 580.126.09 adds support for the latest Blackwell GPU architecture
vGPU for Compute Updates β Enhancements and bug fixes based on vGPU Software 19.4
Updated Kubernetes Operators β GPU Operator 25.10.1, Network Operator 25.10.0, DPU Operator 25.10.1, and NIM Operator 3.0.2 deliver improved lifecycle automation and streamlined deployment for GPU workloads
Run:ai Updates β NVIDIA Run:ai 2.24 provides AI workload and GPU orchestration capabilities for self-hosted deployments
DOCA Ecosystem Updates β DOCA Driver 3.2.0 and DOCA Microservices 3.2.1 provide enhanced networking performance and infrastructure acceleration for data-intensive workloads
Enterprise Management β Base Command Manager 11.31.0 offers refined cluster provisioning and workload orchestration for large-scale AI infrastructure
Fabric Manager Support β NVIDIA Fabric Manager supported in GPU Passthrough and vGPU for Compute deployment modes
Interactive Support Matrix β Web-based support matrix tool for exploring infrastructure compatibility across releases 7.0-7.4 with progressive filtering, cross-version comparison, and dynamic search capabilities
Lifecycle and Compatibility Explorer β Interactive tool for verifying cross-stack compatibility between infrastructure components, with query modes for browsing by branch, release, component, or full stack validation
π¦ Archived Releases (7.0β7.3)
Earlier NVIDIA AI Enterprise 7.x releases with key component versions:
7.3 Release Notes β NVIDIA Data Center GPU Driver 580.105.08 (adds DGX B300, HGX B300, and GB300 NVL72 support), vGPU Software 19.3, DOCA-OFED 3.1.0, Base Command Manager 11.25.08
7.2 Release Notes β NVIDIA Data Center GPU Driver 580.95.05, vGPU Software 19.2 (RTX PRO 6000 Blackwell Server Edition support on VMware vSphere), DOCA-OFED 3.1.0, Base Command Manager 11.25.08
7.1 Release Notes β NVIDIA Data Center GPU Driver 580.82.07, vGPU Software 19.1, DOCA-OFED 3.1.0, Base Command Manager 11.25.05
7.0 Release Notes β Initial R580 driver branch (580.65.06) and vGPU Software 19.0 with HGX B200 and RTX PRO 6000 Blackwell Server Edition support, DOCA-OFED 3.0.0, Base Command Manager 11.25.05