NVIDIA AI Enterprise Documentation#

This documentation covers the NVIDIA AI Enterprise Infrastructure Layer software β€” GPU and network drivers, Kubernetes operators, NVIDIA vGPU for Compute, and NVIDIA Run:ai for AI workload management.

For application-layer software (NIM, NeMo, Omniverse, domain SDKs), refer to Application Software. For enterprise support services, refer to Support.

Quick Start#

  • πŸ†• New to NVIDIA AI Enterprise? β†’ Start with the Quick Start Guide to deploy your first AI workload

  • ⬆️ Upgrading from 7.3 or earlier? β†’ Refer to What’s New in 7.4 in the following section

  • πŸ”§ Need help with a specific task? β†’ Jump to the Deployment Guide

πŸ†• What is New in NVIDIA AI Enterprise Infra 7.4#

Latest Release Highlights

  • Blackwell Architecture Support - NVIDIA GPU Data Center Driver 580.126.09 adds support for the latest Blackwell GPU architecture

  • vGPU for Compute Updates - Enhancements and bug fixes based on vGPU Software 19.4

  • Updated Kubernetes Operators - GPU Operator 25.10.1, Network Operator 25.10.0, DPU Operator 25.10.1, and NIM Operator 3.0.2 deliver improved lifecycle automation and streamlined deployment for GPU workloads

  • Run:ai Updates - NVIDIA Run:ai 2.24 provides AI workload and GPU orchestration capabilities for self-hosted deployments

  • DOCA Ecosystem Updates - DOCA Driver 3.2.0 and DOCA Microservices 3.2.1 provide enhanced networking performance and infrastructure acceleration for data-intensive workloads

  • Enterprise Management - Base Command Manager 11.31.0 offers refined cluster provisioning and workload orchestration for large-scale AI infrastructure

  • Fabric Manager Support - NVIDIA Fabric Manager supported in GPU Passthrough and vGPU for Compute deployment modes

  • Interactive Support Matrix - New web-based support matrix tool for exploring infrastructure compatibility across releases 7.0-7.4 with progressive filtering, cross-version comparison, and dynamic search capabilities

  • Lifecycle and Compatibility Explorer - New interactive tool for verifying cross-stack compatibility between infrastructure components, with query modes for browsing by branch, release, component, or full stack validation

View Full 7.4 Release Notes