Platform Support#
NVIDIA GPU Operator Versioning#
NVIDIA GPU Operator is versioned following the calendar versioning convention.
The version follows the pattern YY.MM.PP
, such as 23.6.0, 23.6.1, and 23.9.0.
The first two fields, YY.MM
identify a major version and indicates when the major version was initially released.
The third field, PP
, identifies the patch version of the major version.
Patch releases typically include critical bug and CVE fixes, but can include minor features.
NVIDIA GPU Operator Life Cycle#
When a major version of NVIDIA GPU Operator is released, the previous major version enters maintenance support and only receives patch release updates for critical bug and CVE fixes. All prior major versions enter end-of-life (EOL) and are no longer supported and do not receive patch release updates.
The product life cycle and versioning are subject to change in the future.
Note
Upgrades are only supported within a major release or to the next major release.
GPU Operator Version |
Status |
---|---|
25.3.x |
Generally Available |
24.9.x |
Maintenance |
24.6.x and lower |
EOL |
GPU Operator Component Matrix#
The following table shows the operands and default operand versions that correspond to a GPU Operator version.
When post-release testing confirms support for newer versions of operands, these updates are identified as recommended updates to a GPU Operator version. Refer to Upgrading the NVIDIA GPU Operator for more information.
Component |
Version |
---|---|
NVIDIA GPU Operator |
v25.3.0 |
NVIDIA GPU Driver |
570.124.06 (default, recommended),
|
NVIDIA Driver Manager for Kubernetes |
|
NVIDIA Container Toolkit |
|
NVIDIA Kubernetes Device Plugin |
|
DCGM Exporter |
|
Node Feature Discovery |
|
NVIDIA GPU Feature Discovery
for Kubernetes
|
|
NVIDIA MIG Manager for Kubernetes |
|
DCGM |
|
Validator for NVIDIA GPU Operator |
v25.3.0 |
NVIDIA KubeVirt GPU Device Plugin |
|
NVIDIA vGPU Device Manager |
|
NVIDIA GDS Driver 1 |
|
NVIDIA Kata Manager for Kubernetes |
|
NVIDIA Confidential Computing
Manager for Kubernetes
|
v0.1.1 |
NVIDIA GDRCopy Driver |
1 This release of the GDS driver requires that you use the NVIDIA Open GPU Kernel module driver for the GPUs. Refer to GPUDirect RDMA and GPUDirect Storage for more information.
Note
Driver version could be different with NVIDIA vGPU, as it depends on the driver version downloaded from the NVIDIA vGPU Software Portal.
The GPU Operator is supported on all active NVIDIA data center production drivers. Refer to Supported Drivers and CUDA Toolkit Versions for more information.
Supported NVIDIA Data Center GPUs and Systems#
The following NVIDIA data center GPUs are supported on x86 based platforms:
Product |
Architecture |
---|---|
NVIDIA GH200 1 |
NVIDIA Grace Hopper |
1
NVIDIA GH200 systems require the NVIDIA Open GPU Kernel module driver.
You can install the open kernel modules by specifying the driver.useOpenKernelModules=true
argument to the helm
command.
Refer to Common Chart Customization Options for more information.
Product |
Architecture |
---|---|
NVIDIA H800 |
NVIDIA Hopper |
NVIDIA H200,
NVIDIA H200 NVL
|
NVIDIA Hopper |
NVIDIA HGX H200 |
NVIDIA Hopper and NVSwitch |
NVIDIA DGX H100 |
NVIDIA Hopper and NVSwitch |
NVIDIA HGX H100 |
NVIDIA Hopper and NVSwitch |
NVIDIA H100,
NVIDIA H100 NVL
|
NVIDIA Hopper |
NVIDIA H20 |
NVIDIA Hopper |
NVIDIA L20 |
NVIDIA Ada |
NVIDIA L40,
NVIDIA L40S
|
NVIDIA Ada |
NVIDIA L4 |
NVIDIA Ada |
NVIDIA DGX A100 |
A100 and NVSwitch |
NVIDIA HGX A100 |
A100 and NVSwitch |
NVIDIA A800 |
NVIDIA Ampere |
NVIDIA A100 |
NVIDIA Ampere |
NVIDIA A100X |
NVIDIA Ampere |
NVIDIA A40 |
NVIDIA Ampere |
NVIDIA A30 |
NVIDIA Ampere |
NVIDIA A30X |
NVIDIA Ampere |
NVIDIA A16 |
NVIDIA Ampere |
NVIDIA A10 |
NVIDIA Ampere |
NVIDIA A2 |
NVIDIA Ampere |
Note
The GPU Operator supports DGX A100 with DGX OS 5.1+ and Red Hat OpenShift using Red Hat Core OS. For installation instructions, see Pre-Installed NVIDIA GPU Drivers and NVIDIA Container Toolkit for DGX OS 5.1+ and Introduction to NVIDIA GPU Operator on OpenShift for Red Hat OpenShift.
Product |
Architecture |
---|---|
NVIDIA T4 |
Turing |
NVIDIA V100 |
Volta |
NVIDIA P100 |
Pascal |
NVIDIA P40 |
Pascal |
NVIDIA P4 |
Pascal |
Product |
Architecture |
---|---|
NVIDIA RTX A6000 |
NVIDIA Ampere /Ada |
NVIDIA RTX A5000 |
NVIDIA Ampere |
NVIDIA RTX A4500 |
NVIDIA Ampere |
NVIDIA RTX A4000 |
NVIDIA Ampere |
NVIDIA Quadro RTX 8000 |
Turing |
NVIDIA Quadro RTX 6000 |
Turing |
NVIDIA Quadro RTX 5000 |
Turing |
NVIDIA Quadro RTX 4000 |
Turing |
NVIDIA T1000 |
Turing |
NVIDIA T600 |
Turing |
NVIDIA T400 |
Turing |
Product |
Architecture |
---|---|
NVIDIA HGX B200 |
NVIDIA Blackwell |
NVIDIA HGX GB200 NVL |
NVIDIA Blackwell |
Supported ARM Based Platforms#
The following NVIDIA data center GPUs are supported:
Product |
Architecture |
---|---|
NVIDIA A100X |
Ampere |
NVIDIA A30X |
Ampere |
NVIDIA IGX Orin |
Ampere |
AWS EC2 G5g instances |
Turing |
In addition to the products specified in the preceding table, any ARM based system that meets the following requirements is supported:
NVIDIA GPUs connected to the PCI bus.
A supported operating system such as Ubuntu or Red Hat Enterprise Linux.
Note
The GPU Operator only supports platforms using discrete GPUs. NVIDIA Jetson, or other embedded products with integrated GPUs, are not supported.
NVIDIA IGX Orin, a platform with an integrated GPU, is supported as long as the discrete GPU is the device being used.
Supported Deployment Options#
The GPU Operator has been validated in the following scenarios:
Deployment Options |
---|
Bare Metal |
Virtual machines with GPU Passthrough |
Virtual machines with NVIDIA vGPU based products |
Note
GPU Operator is supported with NVIDIA vGPU 12.0+.
Supported Operating Systems and Kubernetes Platforms#
The GPU Operator has been validated in the following scenarios:
Operating
System
|
Kubernetes 1 |
Red Hat
OpenShift
|
VMware vSphere
with Tanzu
|
Rancher Kubernetes
Engine 2
|
HPE Ezmeral
Runtime
Enterprise
|
Canonical
MicroK8s
|
Nutanix
NKP
|
---|---|---|---|---|---|---|---|
Ubuntu 20.04 LTS 2 |
1.29—1.32 |
7.0 U3c, 8.0 U2, 8.0 U3 |
1.29—1.32 |
2.12, 2.13 |
|||
Ubuntu 22.04 LTS 2 |
1.29—1.32 |
8.0 U2, 8.0 U3 |
1.29—1.32 |
1.26 |
2.12, 2.13 |
||
Ubuntu 24.04 LTS |
1.29—1.32 |
||||||
Red Hat Core OS |
4.12—4.18
|
||||||
Red Hat
Enterprise
Linux 8.8,
8.10
|
1.29—1.32 |
1.29—1.32 |
|||||
Red Hat
Enterprise
Linux 8.4, 8.5
|
5.5 |
1 The Kubernetes community only supports the last three minor releases. Older releases may be supported through enterprise distributions of Kubernetes such as Red Hat OpenShift.
2
For Ubuntu 22.04 LTS, kernel versions 6.8 (non-precompiled driver containers only) 6.5 and 5.15 are LTS ESM kernels.
For Ubuntu 20.04 LTS, kernel versions 5.4 and 5.15 are LTS ESM kernels.
The GPU Driver containers support these Linux kernels.
Refer to the Kernel release schedule on Canonical’s
Ubuntu kernel lifecycle and enablement stack page for more information.
NVIDIA recommends disabling automatic updates for the Linux kernel that are performed
by the unattended-upgrades
package to prevent an upgrade to an unsupported kernel version.
Note
Red Hat OpenShift Container Platform is supported on AWS, Azure, GCP, and OCI (Oracle) Virtual Machine or Bare Metal instances with T4, V100, L4, L40s, A10, A100, H100, and H200.
Operating
System
|
Amazon EKS
Kubernetes
|
Google GKE
Kubernetes
|
Microsoft Azure
Kubernetes Service
|
---|---|---|---|
Ubuntu 20.04 LTS |
1.25—1.28 |
1.25—1.28 |
1.25—1.28 |
Ubuntu 22.04 LTS |
1.25—1.28 |
1.25—1.28 |
1.25—1.28 |
Operating
System
|
Kubernetes |
Red Hat
OpenShift
|
VMware vSphere
with Tanzu
|
Rancher Kubernetes
Engine 2
|
Nutanix
NKP
|
---|---|---|---|---|---|
Ubuntu 20.04 LTS |
1.29–1.32 |
7.0 U3c, 8.0 U2, 8.0 U3 |
1.23—1.25 |
2.12, 2.13 |
|
Ubuntu 22.04 LTS |
1.29–1.32 |
8.0 U2, 8.0 U3 |
2.12, 2.13 |
||
Ubuntu 24.04 LTS |
1.29–1.32 |
||||
Red Hat Core OS |
4.12—4.18 |
||||
Red Hat
Enterprise
Linux 8.4,
8.6—8.10
|
1.29—1.32 |
1.29—1.32 |
Supported Precompiled Drivers#
The GPU Operator has been validated with the following precomplied drivers. See the Precompiled Driver Containers page for more on using precompiled drivers.
Operating System |
Kernel Flavor |
Kernel Version |
CUDA Driver Branch |
---|---|---|---|
Ubuntu 22.04 |
Generic, NVIDIA, Azure AWS, Oracle |
5.15 |
R535, R550, R570 |
Ubuntu 24.04 |
Generic, NVIDIA, Azure AWS, Oracle |
6.8 |
R550, R570 |
Supported Container Runtimes#
The GPU Operator has been validated in the following scenarios:
Operating System |
Containerd 1.6 - 2.0 |
CRI-O |
---|---|---|
Ubuntu 20.04 LTS |
Yes |
Yes |
Ubuntu 22.04 LTS |
Yes |
Yes |
Ubuntu 24.04 LTS |
Yes |
Yes |
Red Hat Core OS (RHCOS) |
No |
Yes |
Red Hat Enterprise Linux 8 |
Yes |
Yes |
Support for KubeVirt and OpenShift Virtualization#
Red Hat OpenShift Virtualization is based on KubeVirt.
Operating System |
Kubernetes |
KubeVirt |
OpenShift Virtualization |
||
---|---|---|---|---|---|
GPU
Passthrough
|
vGPU |
GPU
Passthrough
|
vGPU |
||
Ubuntu 24.04 LTS |
1.23—1.29 |
0.36+ |
0.59.1+ |
||
Ubuntu 20.04 LTS |
1.23—1.29 |
0.36+ |
0.59.1+ |
||
Ubuntu 22.04 LTS |
1.23—1.29 |
0.36+ |
0.59.1+ |
||
Red Hat Core OS |
4.12—4.18 |
4.13—4.18 |
You can run GPU passthrough and NVIDIA vGPU in the same cluster as long as you use a software version that meets both requirements.
NVIDIA vGPU is incompatible with KubeVirt v0.58.0, v0.58.1, and v0.59.0, as well
as OpenShift Virtualization 4.12.0—4.12.2.
Starting with KubeVirt v0.58.2 and v0.59.1, and OpenShift Virtualization 4.12.3 and 4.13,
you must set the DisableMDEVConfiguration
feature gate.
Refer to GPU Operator with KubeVirt or NVIDIA GPU Operator with OpenShift Virtualization.
KubeVirt and OpenShift Virtualization with NVIDIA vGPU is supported on the following devices:
H200NVL
H100
GA10x: A100, A40, RTX A6000, RTX A5500, RTX A5000, A30, A16, A10, A2.
The A10G and A10M GPUs are excluded.
AD10x: L40, RTX 6000 Ada, L4.
The L40G GPU is excluded.
Note that HGX platforms are not supported.
Support for GPUDirect RDMA#
Supported operating systems and NVIDIA GPU Drivers with GPUDirect RDMA.
Ubuntu 24.04 LTS with Network Operator 25.1.0.
Ubuntu 20.04 and 22.04 LTS with Network Operator 24.10.0.
Red Hat OpenShift 4.12 and higher with Network Operator 23.10.0
For information about configuring GPUDirect RDMA, refer to GPUDirect RDMA and GPUDirect Storage.
Support for GPUDirect Storage#
Supported operating systems and NVIDIA GPU Drivers with GPUDirect Storage.
-Ubuntu 24.04 LTS Network Operator 25.1.0 - Ubuntu 20.04 and 22.04 LTS with Network Operator 24.10.0 - Red Hat OpenShift Container Platform 4.12 and higher
Note
Version v2.17.5 and higher of the NVIDIA GPUDirect Storage kernel driver, nvidia-fs
,
requires the NVIDIA Open GPU Kernel module driver.
You can install the open kernel modules by specifying the driver.kernelModuleType=auto
if you are using driver container version 570.86.15, 570.124.06 or later.
Or use driver.kernelModuleType=open
if you are using a different driver version or branch.
argument to the helm
command.
Refer to Common Chart Customization Options for more information.
Not supported with secure boot. Supported storage types are local NVMe and remote NFS.
Additional Supported Container Management Tools#
Helm v3
Red Hat Operator Lifecycle Manager (OLM)