Multi-vGPU#

Multi-vGPU attaches several vGPU devices to one VM. Devices may be time-sliced or MIG-backed and can sit on different physical GPUs—you are not limited to slicing one physical GPU across many VMs.

That layout suits training and inference that need multiple GPUs inside one guest: each vGPU is dedicated to that VM, so workloads in the VM do not compete with other VMs on the same physical GPU for those devices (for example, a VM with two A100-class vGPUs versus one).

For peer-to-peer NVLink transfers between vGPUs in the same VM, see Peer-to-Peer CUDA Transfers. For the shared hypervisor support matrix that covers both Multi-vGPU and P2P, see Multi-vGPU and P2P.

vGPU Support for Multi-vGPU#

You can assign multiple vGPUs with differing amounts of frame buffer to a single VM, provided the board type and the series of all the vGPUs are the same. For example, you can assign an A40-48C vGPU and an A40-16C time-sliced vGPUs to the same VM. You can also assign an A100-4-20C vGPU and one A100-2-10C vGPU to a VM, both on MIG instances from an A100 board. However, you cannot assign an A30-8C vGPU and an A16-8C vGPU to the same VM.

Table 24 vGPU Support for Multi-vGPU on the NVIDIA Blackwell Architecture#

Board

vGPU [1]

NVIDIA HGX B300 279 GB

Generic Linux with KVM hypervisors [2], Red Hat Enterprise Linux KVM, and Ubuntu: - All NVIDIA vGPU for Compute

NVIDIA HGX B200 180 GB

Generic Linux with KVM hypervisors [2], Red Hat Enterprise Linux KVM, and Ubuntu: - All NVIDIA vGPU for Compute

NVIDIA RTX PRO 6000 Blackwell Server Edition 96 GB

  • Generic Linux with KVM hypervisors [2], Red Hat Enterprise Linux KVM, and Ubuntu:
    • All NVIDIA vGPU for Compute

  • Since VMware vSphere 8.0:
    • All NVIDIA vGPU for Compute

NVIDIA RTX PRO 4500 Blackwell Server Edition 32 GB

  • Generic Linux with KVM hypervisors [2], Red Hat Enterprise Linux KVM, and Ubuntu:
    • All NVIDIA vGPU for Compute

  • Since VMware vSphere 8.0:
    • All NVIDIA vGPU for Compute

Table 25 vGPU Support for Multi-vGPU on the NVIDIA Hopper GPU Architecture#

Board

vGPU [1]

NVIDIA H800 PCIe 94 GB (H800 NVL)

All NVIDIA vGPU for Compute

NVIDIA H800 PCIe 80 GB

All NVIDIA vGPU for Compute

NVIDIA H800 SXM5 80 GB

NVIDIA vGPU for Compute

NVIDIA H200 PCIe 141 GB (H200 NVL)

All NVIDIA vGPU for Compute

NVIDIA H200 SXM5 141 GB

NVIDIA vGPU for Compute

NVIDIA H100 PCIe 94 GB (H100 NVL)

All NVIDIA vGPU for Compute

NVIDIA H100 SXM5 94 GB

NVIDIA vGPU for Compute

NVIDIA H100 PCIe 80 GB

All NVIDIA vGPU for Compute

NVIDIA H100 SXM5 80 GB

NVIDIA vGPU for Compute

NVIDIA H100 SXM5 64 GB

NVIDIA vGPU for Compute

NVIDIA H20 SXM5 141 GB

NVIDIA vGPU for Compute

NVIDIA H20 SXM5 96 GB

NVIDIA vGPU for Compute

Table 26 vGPU Support for Multi-vGPU on the NVIDIA Ada Lovelace Architecture#

Board

vGPU

NVIDIA L40

  • Generic Linux with KVM hypervisors [2], Red Hat Enterprise Linux KVM, and Ubuntu:
    • All NVIDIA vGPU for Compute

  • Since VMware vSphere 8.0:
    • All NVIDIA vGPU for Compute

NVIDIA L40S

  • Generic Linux with KVM hypervisors [2], Red Hat Enterprise Linux KVM, and Ubuntu:
    • All NVIDIA vGPU for Compute

  • Since VMware vSphere 8.0:
    • All NVIDIA vGPU for Compute

NVIDIA L20

  • Generic Linux with KVM hypervisors [2], Red Hat Enterprise Linux KVM, and Ubuntu:
    • All NVIDIA vGPU for Compute

  • Since VMware vSphere 8.0:
    • All NVIDIA vGPU for Compute

NVIDIA L4

  • Generic Linux with KVM hypervisors [2], Red Hat Enterprise Linux KVM, and Ubuntu:
    • All NVIDIA vGPU for Compute

  • Since VMware vSphere 8.0:
    • All NVIDIA vGPU for Compute

NVIDIA L2

  • Generic Linux with KVM hypervisors [2], Red Hat Enterprise Linux KVM, and Ubuntu:
    • All NVIDIA vGPU for Compute

  • Since VMware vSphere 8.0:
    • All NVIDIA vGPU for Compute

NVIDIA RTX 6000 Ada

  • Generic Linux with KVM hypervisors [2], Red Hat Enterprise Linux KVM, and Ubuntu:
    • All NVIDIA vGPU for Compute

  • Since VMware vSphere 8.0:
    • All NVIDIA vGPU for Compute

NVIDIA RTX 5880 Ada

  • Generic Linux with KVM hypervisors [2], Red Hat Enterprise Linux KVM, and Ubuntu:
    • All NVIDIA vGPU for Compute

  • Since VMware vSphere 8.0:
    • All NVIDIA vGPU for Compute

NVIDIA RTX 5000 Ada

  • Generic Linux with KVM hypervisors [2], Red Hat Enterprise Linux KVM, and Ubuntu:
    • All NVIDIA vGPU for Compute

  • Since VMware vSphere 8.0:
    • All NVIDIA vGPU for Compute

Table 27 vGPU Support for Multi-vGPU on the NVIDIA Ampere GPU Architecture#

Board

vGPU [1]

  • NVIDIA A800 PCIe 80 GB

  • NVIDIA A800 PCIe 80 GB liquid-cooled

  • NVIDIA AX800

  • Generic Linux with KVM hypervisors [2], Red Hat Enterprise Linux KVM, and Ubuntu:
    • All NVIDIA vGPU for Compute

  • Since VMware vSphere 8.0:
    • All NVIDIA vGPU for Compute

NVIDIA A800 PCIe 40 GB active-cooled

  • Generic Linux with KVM hypervisors [2], Red Hat Enterprise Linux KVM, and Ubuntu:
    • All NVIDIA vGPU for Compute

  • Since VMware vSphere 8.0:
    • All NVIDIA vGPU for Compute

NVIDIA A800 HGX 80 GB

  • Since VMware vSphere 8.0:
    • All NVIDIA vGPU for Compute

  • NVIDIA A100 PCIe 80 GB

  • NVIDIA A100 PCIe 80 GB liquid-cooled

  • NVIDIA A100X

  • Generic Linux with KVM hypervisors [2], Red Hat Enterprise Linux KVM, and Ubuntu:
    • All NVIDIA vGPU for Compute

  • Since VMware vSphere 8.0:
    • All NVIDIA vGPU for Compute

NVIDIA A100 HGX 80 GB

  • Since VMware vSphere 8.0:
    • All NVIDIA vGPU for Compute

NVIDIA A100 PCIe 40 GB

  • Generic Linux with KVM hypervisors [2], Red Hat Enterprise Linux KVM, and Ubuntu:
    • All NVIDIA vGPU for Compute

  • Since VMware vSphere 8.0:
    • All NVIDIA vGPU for Compute

NVIDIA A100 HGX 40 GB

  • Since VMware vSphere 8.0:
    • All NVIDIA vGPU for Compute

NVIDIA A40

  • Generic Linux with KVM hypervisors [2], Red Hat Enterprise Linux KVM, and Ubuntu:
    • All NVIDIA vGPU for Compute

  • Since VMware vSphere 8.0:
    • All NVIDIA vGPU for Compute

  • NVIDIA A30

  • NVIDIA A30X

  • NVIDIA A30 liquid-cooled

  • Generic Linux with KVM hypervisors [2], Red Hat Enterprise Linux KVM, and Ubuntu:
    • All NVIDIA vGPU for Compute

  • Since VMware vSphere 8.0:
    • All NVIDIA vGPU for Compute

NVIDIA A16

  • Generic Linux with KVM hypervisors [2], Red Hat Enterprise Linux KVM, and Ubuntu:
    • All NVIDIA vGPU for Compute

  • Since VMware vSphere 8.0:
    • All NVIDIA vGPU for Compute

NVIDIA A10

  • Generic Linux with KVM hypervisors [2], Red Hat Enterprise Linux KVM, and Ubuntu:
    • All NVIDIA vGPU for Compute

  • Since VMware vSphere 8.0:
    • All NVIDIA vGPU for Compute

NVIDIA RTX A6000

  • Generic Linux with KVM hypervisors [2], Red Hat Enterprise Linux KVM, and Ubuntu:
    • All NVIDIA vGPU for Compute

  • Since VMware vSphere 8.0:
    • All NVIDIA vGPU for Compute

NVIDIA RTX A5500

  • Generic Linux with KVM hypervisors [2], Red Hat Enterprise Linux KVM, and Ubuntu:
    • All NVIDIA vGPU for Compute

  • Since VMware vSphere 8.0:
    • All NVIDIA vGPU for Compute

NVIDIA RTX A5000

  • Generic Linux with KVM hypervisors [2], Red Hat Enterprise Linux KVM, and Ubuntu:
    • All NVIDIA vGPU for Compute

  • Since VMware vSphere 8.0:
    • All NVIDIA vGPU for Compute

Table 28 vGPU Support for Multi-vGPU on the NVIDIA Turing GPU Architecture#

Board

vGPU

Tesla T4

  • Generic Linux with KVM hypervisors [2], Red Hat Enterprise Linux KVM, and Ubuntu:
    • All NVIDIA vGPU for Compute

  • Since VMware vSphere 8.0:
    • All NVIDIA vGPU for Compute