Heterogeneous vGPU#

Heterogeneous vGPU allows a single physical GPU to host multiple vGPU profiles with different framebuffer sizes at the same time. Use it when VMs need different GPU memory footprints on one physical GPU so guests can take smaller profiles instead of all sharing one large profile.

When a GPU is configured for heterogeneous vGPU, its behavior during events like a host reboot, NVIDIA Virtual GPU Manager reload, or GPU reset varies by hypervisor.

Note

Heterogeneous vGPU configuration only supports the Best Effort and Equal Share schedulers.

Heterogeneous vGPU is supported on Volta and later GPUs. For additional information and operational instructions across different hypervisors, refer to the Heterogeneous vGPU documentation.

Platform Support for Heterogeneous vGPUs#

Table 20 Platform Support for Heterogeneous vGPUs#

Hypervisor Platform

NVIDIA AI Enterprise Infra Release

Documentation

Red Hat Enterprise Linux with KVM

  • NVIDIA AI Enterprise Infra 8.x

  • NVIDIA AI Enterprise Infra 7.x

Configuring a GPU for Heterogeneous vGPU on RHEL KVM

Canonical Ubuntu with KVM

  • NVIDIA AI Enterprise Infra 8.x

  • NVIDIA AI Enterprise Infra 7.x

Configuring a GPU for Heterogeneous vGPU on Linux KVM

VMware vSphere

  • NVIDIA AI Enterprise Infra 8.x

  • NVIDIA AI Enterprise Infra 7.x

Configuring a GPU for Heterogeneous vGPU on VMware vSphere