NVIDIA AI Enterprise Documentation#
This documentation covers the NVIDIA AI Enterprise Infrastructure Layer software β GPU and network drivers, Kubernetes operators, NVIDIA vGPU for Compute, and NVIDIA Run:ai for AI workload management.
For application-layer software (NIM, NeMo, Omniverse, domain SDKs), refer to Application Software. For enterprise support services, refer to Support.
Quick Start#
π New to NVIDIA AI Enterprise? β Start with the Quick Start Guide to deploy your first AI workload
β¬οΈ Upgrading from 7.3 or earlier? β Refer to Whatβs New in 7.4 in the following section
π§ Need help with a specific task? β Jump to the Deployment Guide
π What is New in NVIDIA AI Enterprise Infra 7.4#
Latest Release Highlights
Blackwell Architecture Support - NVIDIA GPU Data Center Driver 580.126.09 adds support for the latest Blackwell GPU architecture
vGPU for Compute Updates - Enhancements and bug fixes based on vGPU Software 19.4
Updated Kubernetes Operators - GPU Operator 25.10.1, Network Operator 25.10.0, DPU Operator 25.10.1, and NIM Operator 3.0.2 deliver improved lifecycle automation and streamlined deployment for GPU workloads
Run:ai Updates - NVIDIA Run:ai 2.24 provides AI workload and GPU orchestration capabilities for self-hosted deployments
DOCA Ecosystem Updates - DOCA Driver 3.2.0 and DOCA Microservices 3.2.1 provide enhanced networking performance and infrastructure acceleration for data-intensive workloads
Enterprise Management - Base Command Manager 11.31.0 offers refined cluster provisioning and workload orchestration for large-scale AI infrastructure
Fabric Manager Support - NVIDIA Fabric Manager supported in GPU Passthrough and vGPU for Compute deployment modes
Interactive Support Matrix - New web-based support matrix tool for exploring infrastructure compatibility across releases 7.0-7.4 with progressive filtering, cross-version comparison, and dynamic search capabilities
Lifecycle and Compatibility Explorer - New interactive tool for verifying cross-stack compatibility between infrastructure components, with query modes for browsing by branch, release, component, or full stack validation