NVIDIA AI Enterprise and NVIDIA vGPU for Compute#

NVIDIA vGPU for Compute lets multiple virtual machines share one physical GPU for AI training, fine-tuning, and inference. Use Quick Start below for the recommended reading order.

About NVIDIA vGPU for Compute

NVIDIA AI Enterprise is NVIDIA’s supported software stack for production AI (tools, libraries, and frameworks) in cloud and data center environments. It spans the application layer and the infrastructure layer.

NVIDIA vGPU for Compute is licensed only through NVIDIA AI Enterprise. It partitions GPU capacity across VMs so several workloads can run on fewer physical GPUs. For cloud service providers (CSPs) and IT teams, supported operations include Suspend/Resume, Live Migration, and Warm Updates in addition to GPU compute in guests.

Documentation Structure#

The following sections move from concepts and features to install, license, configure, and reference:

Quick Start#

New to NVIDIA vGPU for Compute? Follow this recommended path:

  1. Understand the Basics - Overview for concepts, architecture, and vGPU configuration modes; Features for capability detail.

  2. Install Components - Installation for NGC CLI, Virtual GPU Manager, and guest drivers.

  3. Configure Licensing - Licensing to connect VMs to NVIDIA License System.

  4. Configure vGPU - Configuration for MIG-backed vGPU and profile selection.

  5. Reference vGPU Types - Reference for vGPU type tables by GPU.

Common Tasks#

Installation and Setup

Licensing

Configuration

Additional Resources#