Multi-vGPU#
Multi-vGPU attaches several vGPU devices to one VM. Devices may be time-sliced or MIG-backed and can sit on different physical GPUs—you are not limited to slicing one physical GPU across many VMs.
That layout suits training and inference that need multiple GPUs inside one guest: each vGPU is dedicated to that VM, so workloads in the VM do not compete with other VMs on the same physical GPU for those devices (for example, a VM with two A100-class vGPUs versus one).
For peer-to-peer NVLink transfers between vGPUs in the same VM, see Peer-to-Peer CUDA Transfers. For the shared hypervisor support matrix that covers both Multi-vGPU and P2P, see Multi-vGPU and P2P.
vGPU Support for Multi-vGPU#
You can assign multiple vGPUs with differing amounts of frame buffer to a single VM, provided the board type and the series of all the vGPUs are the same. For example, you can assign an A40-48C vGPU and an A40-16C time-sliced vGPUs to the same VM. You can also assign an A100-4-20C vGPU and one A100-2-10C vGPU to a VM, both on MIG instances from an A100 board. However, you cannot assign an A30-8C vGPU and an A16-8C vGPU to the same VM.
Board |
vGPU [1] |
|---|---|
NVIDIA HGX B300 279 GB |
Generic Linux with KVM hypervisors [2], Red Hat Enterprise Linux KVM, and Ubuntu: - All NVIDIA vGPU for Compute |
NVIDIA HGX B200 180 GB |
Generic Linux with KVM hypervisors [2], Red Hat Enterprise Linux KVM, and Ubuntu: - All NVIDIA vGPU for Compute |
NVIDIA RTX PRO 6000 Blackwell Server Edition 96 GB |
|
NVIDIA RTX PRO 4500 Blackwell Server Edition 32 GB |
|
Board |
vGPU [1] |
|---|---|
NVIDIA H800 PCIe 94 GB (H800 NVL) |
All NVIDIA vGPU for Compute |
NVIDIA H800 PCIe 80 GB |
All NVIDIA vGPU for Compute |
NVIDIA H800 SXM5 80 GB |
NVIDIA vGPU for Compute |
NVIDIA H200 PCIe 141 GB (H200 NVL) |
All NVIDIA vGPU for Compute |
NVIDIA H200 SXM5 141 GB |
NVIDIA vGPU for Compute |
NVIDIA H100 PCIe 94 GB (H100 NVL) |
All NVIDIA vGPU for Compute |
NVIDIA H100 SXM5 94 GB |
NVIDIA vGPU for Compute |
NVIDIA H100 PCIe 80 GB |
All NVIDIA vGPU for Compute |
NVIDIA H100 SXM5 80 GB |
NVIDIA vGPU for Compute |
NVIDIA H100 SXM5 64 GB |
NVIDIA vGPU for Compute |
NVIDIA H20 SXM5 141 GB |
NVIDIA vGPU for Compute |
NVIDIA H20 SXM5 96 GB |
NVIDIA vGPU for Compute |
Board |
vGPU |
|---|---|
NVIDIA L40 |
|
NVIDIA L40S |
|
NVIDIA L20 |
|
NVIDIA L4 |
|
NVIDIA L2 |
|
NVIDIA RTX 6000 Ada |
|
NVIDIA RTX 5880 Ada |
|
NVIDIA RTX 5000 Ada |
|
Board |
vGPU [1] |
|---|---|
|
|
NVIDIA A800 PCIe 40 GB active-cooled |
|
NVIDIA A800 HGX 80 GB |
|
|
|
NVIDIA A100 HGX 80 GB |
|
NVIDIA A100 PCIe 40 GB |
|
NVIDIA A100 HGX 40 GB |
|
NVIDIA A40 |
|
|
|
NVIDIA A16 |
|
NVIDIA A10 |
|
NVIDIA RTX A6000 |
|
NVIDIA RTX A5500 |
|
NVIDIA RTX A5000 |
|