AI Enterprise Infrastructure Software#

The NVIDIA AI Enterprise Infrastructure software encompasses all necessary components for managing and optimizing infrastructure along with AI workloads. The NVIDIA Kubernetes Operators facilitate a standardized management of NVIDIA GPUs, AI models, and network resources within Kubernetes environments. The following table outlines the components and versions of the NVIDIA AI Enterprise Infrastructure software.

Government Ready software resources are available through NGC.

Component

Software

Notes

GPU Driver

NVIDIA Linux Driver

Supported by the GPU Operator

GPU Management

NVIDIA GPU Operator

Simplifies the deployment of NVIDIA AI Enterprise by automating the management of all NVIDIA software components needed to provision GPUs in Kubernetes (drivers, toolkit, DCGM).

AI Inference Management

NVIDIA NIM Operator

Simplifies deployment and lifecycle management of NIM and NeMo microservices at scale by providing custom resources for model caching, autoscaling, and cluster-level management of AI inference pipelines

Network Management (Hardware)

NVIDIA Network Operator

Simplifies the provisioning and management of NVIDIA networking resources in a Kubernetes cluster (NVIDIA NICs, integrates with NetQ.)