NVIDIA vGPU Types Reference#

This reference provides complete vGPU type specifications for all supported NVIDIA GPU architectures.

Quick Navigation by GPU Architecture

🆕 Blackwell Architecture

Latest generation GPUs

Hopper Architecture

High-performance AI training and inference

Ada Lovelace Architecture

Advanced ray tracing and AI

Ampere Architecture

Proven AI and HPC performance

Turing Architecture

First-generation ray tracing

Understanding vGPU Types

vGPU types define the GPU resources allocated to virtual machines. Each type specifies:

vGPU Configuration Modes

Table 60 vGPU Configuration Comparison#
Mode	Isolation	Use Case	Supported Architectures
Time-Sliced	Temporal	General-purpose, cost-effective	All architectures
MIG-Backed	Spatial (hardware)	Multi-tenant, guaranteed performance	Ampere, Hopper, Blackwell
Time-Sliced MIG-Backed	Spatial + Temporal	Maximum density with isolation	Blackwell (RTX PRO 4500, RTX PRO 6000)

For detailed configuration guidance, refer to vGPU Configuration.

Frequently Asked Questions#

Q. What are the differences between NVIDIA vGPU for Compute and GPU passthrough?

Both are supported ways to use NVIDIA GPUs with NVIDIA AI Enterprise in a virtualized environment.

Table 61 vGPU for Compute vs GPU Passthrough#
	vGPU for Compute	GPU Passthrough
GPU sharing	One physical GPU shared by multiple VMs	One physical GPU dedicated to one VM
Memory Isolation	Time sliced vGPU: Strong hardware based memory isolation enforced by IOMMU, configured through hypervisor MIG backed vGPU: Dedicated L2 cache, memory controllers, and DRAM address buses per MIG instance provide strong hardware level spatial isolation	Complete. A single VM has exclusive GPU access
Fault isolation	Faults in one VM do not propagate to others	Complete. GPU fault or VM crash affects only that VM and its dedicated GPU
Live migration	Supported on compatible hypervisors and vGPU types	Not supported for the GPU device
Suspend/resume	Supported on compatible hypervisors and vGPU types	Not supported for the GPU device
Framebuffer per VM	Fraction of physical GPU memory (configured per vGPU profile)	Full physical GPU memory
Scheduling	Hypervisor-managed (Best Effort, Equal Share, or Fixed Share)	N/A - VM has exclusive GPU access
Heterogeneous profiles	Supported - mixed framebuffer sizes on one GPU	N/A - single VM per GPU
Best for	Multi-tenant, shared infrastructure, density optimization	Single-VM workloads requiring full GPU performance

Reference Pages by Architecture