NVIDIA AI Enterprise and NVIDIA vGPU for Compute#

NVIDIA vGPU for Compute enables multiple virtual machines to share a single physical GPU while providing compute capabilities for AI model training, fine-tuning, and inference workloads.

About NVIDIA vGPU for Compute

NVIDIA AI Enterprise is a cloud-native suite of software tools, libraries, and frameworks for production AI deployments. It delivers optimized performance, security, and stability with enterprise-grade support. NVIDIA AI Enterprise consists of two primary layers: the application layer and the infrastructure layer.

NVIDIA vGPU for Compute is licensed exclusively through NVIDIA AI Enterprise. NVIDIA vGPU for Compute distributes GPU resources efficiently across multiple VMs, optimizes utilization, and lowers overall hardware costs. It offers advanced monitoring and management capabilities including Suspend/Resume, Live Migration, and Warm Updates for Cloud Service Providers (CSPs) and organizations that require scalable, cost-effective GPU acceleration.

Documentation Structure#

This documentation is organized into the following sections:

Quick Start#

New to NVIDIA vGPU for Compute? Follow this recommended path:

  1. Understand the Basics - Start with Overview to learn key concepts, architecture, and vGPU configuration modes. For detailed feature information, refer to Features.

  2. Install Components - Follow Installation to set up NGC CLI, Virtual GPU Manager, and Guest Drivers.

  3. Configure Licensing - Set up Licensing to connect VMs to NVIDIA License System.

  4. Configure vGPU - Use Configuration to set up MIG-backed vGPU and select vGPU profiles.

  5. Reference vGPU Types - Consult Reference for complete vGPU type tables for your GPU.

Common Tasks#

Installation and Setup

Licensing

Configuration

Additional Resources#