NVIDIA AI Enterprise Documentation#

This documentation covers the NVIDIA AI Enterprise Infrastructure Layer software β€” GPU and network drivers, Kubernetes operators, NVIDIA vGPU for Compute, and NVIDIA Run:ai for AI workload management.

For application-layer software (NIM, NeMo, Omniverse, domain SDKs), refer to Application Software. For enterprise support services, refer to Support.

Quick Start#

πŸ†• What is New in NVIDIA AI Enterprise Infra 7.5#

Latest Release Highlights

  • NVIDIA Run:ai SaaS Now Included β€” In addition to NVIDIA Run:ai self-hosted, the NVIDIA-managed NVIDIA Run:ai SaaS offering is now included in the NVIDIA AI Enterprise license under the same enterprise SLA. Refer to the NVIDIA Run:ai SaaS Documentation for the NVIDIA-managed cloud-service option, or choose the deployment that fits your environment.

  • NVIDIA Run:ai 2.25 β€” Updated from 2.24 in 7.4. The same 2.25 release applies to both NVIDIA Run:ai self-hosted and NVIDIA Run:ai SaaS. Refer to the NVIDIA Run:ai release notes for scheduling, GPU-utilization, and platform updates in this version.

  • NVIDIA Data Center GPU Driver 580.159.03 β€” Maintenance update within the R580 production driver branch (from 580.126.09 in 7.4). NVIDIA Fabric Manager also updates to 580.159.03 in lockstep with the driver. Refer to the 580.159.03 release notes for fixes and platform-support details.

  • NVIDIA vGPU Software 19.5 β€” NVIDIA Virtual GPU Manager and the NVIDIA vGPU for Compute Guest Driver are both updated from 19.4 to 19.5, refreshing the full vGPU stack in a coordinated release.

  • NVIDIA DOCA 3.3.0 β€” NVIDIA DOCA Driver for Networking is updated from 3.2.0 to 3.3.0 and NVIDIA DOCA Microservices is updated from 3.2.1 to 3.3.0, advancing the full DOCA stack for NVIDIA BlueField DPUs and SuperNICs in a coordinated release.

  • Kubernetes Operator Major Updates β€” NVIDIA GPU Operator 26.3.1 (from 25.10.1 in 7.4) and NVIDIA Network Operator 26.1.1 (from 25.10.0 in 7.4) are both major-version bumps. NVIDIA NIM Operator 3.1.0 (from 3.0.2) and NVIDIA Container Toolkit 1.19.0 (from 1.18.1) also updated. NVIDIA DPU Operator (DPF) 25.10.1 carries forward unchanged from 7.4.

  • NVIDIA Base Command Manager 11.32.1 β€” Updated from 11.31.0 in 7.4. Refer to the BCM 11.32.1 release notes for cluster-management and provisioning updates.

View Full 7.5 Release Notes

Previous Releases#

πŸ“‹ Release 7.4 Highlights
  • Blackwell Architecture Support β€” NVIDIA Data Center GPU Driver 580.126.09 adds support for the latest Blackwell GPU architecture

  • vGPU for Compute Updates β€” Enhancements and bug fixes based on vGPU Software 19.4

  • Updated Kubernetes Operators β€” GPU Operator 25.10.1, Network Operator 25.10.0, DPU Operator 25.10.1, and NIM Operator 3.0.2 deliver improved lifecycle automation and streamlined deployment for GPU workloads

  • Run:ai Updates β€” NVIDIA Run:ai 2.24 provides AI workload and GPU orchestration capabilities for self-hosted deployments

  • DOCA Ecosystem Updates β€” DOCA Driver 3.2.0 and DOCA Microservices 3.2.1 provide enhanced networking performance and infrastructure acceleration for data-intensive workloads

  • Enterprise Management β€” Base Command Manager 11.31.0 offers refined cluster provisioning and workload orchestration for large-scale AI infrastructure

  • Fabric Manager Support β€” NVIDIA Fabric Manager supported in GPU Passthrough and vGPU for Compute deployment modes

  • Interactive Support Matrix β€” Web-based support matrix tool for exploring infrastructure compatibility across releases 7.0-7.4 with progressive filtering, cross-version comparison, and dynamic search capabilities

  • Lifecycle and Compatibility Explorer β€” Interactive tool for verifying cross-stack compatibility between infrastructure components, with query modes for browsing by branch, release, component, or full stack validation

View Full 7.4 Release Notes

πŸ“¦ Archived Releases (7.0–7.3)

Earlier NVIDIA AI Enterprise 7.x releases with key component versions:

  • 7.3 Release Notes β€” NVIDIA Data Center GPU Driver 580.105.08 (adds DGX B300, HGX B300, and GB300 NVL72 support), vGPU Software 19.3, DOCA-OFED 3.1.0, Base Command Manager 11.25.08

  • 7.2 Release Notes β€” NVIDIA Data Center GPU Driver 580.95.05, vGPU Software 19.2 (RTX PRO 6000 Blackwell Server Edition support on VMware vSphere), DOCA-OFED 3.1.0, Base Command Manager 11.25.08

  • 7.1 Release Notes β€” NVIDIA Data Center GPU Driver 580.82.07, vGPU Software 19.1, DOCA-OFED 3.1.0, Base Command Manager 11.25.05

  • 7.0 Release Notes β€” Initial R580 driver branch (580.65.06) and vGPU Software 19.0 with HGX B200 and RTX PRO 6000 Blackwell Server Edition support, DOCA-OFED 3.0.0, Base Command Manager 11.25.05