Platform Support#

NVIDIA GPU Operator Versioning#

NVIDIA GPU Operator is versioned following the calendar versioning convention.

The version follows the pattern YY.MM.PP, such as 23.6.0, 23.6.1, and 23.9.0. The first two fields, YY.MM identify a major version and indicates when the major version was initially released. The third field, PP, identifies the patch version of the major version. Patch releases typically include critical bug and CVE fixes, but can include minor features.

NVIDIA GPU Operator Life Cycle#

When a major version of NVIDIA GPU Operator is released, the previous major version enters maintenance support and only receives patch release updates for critical bug and CVE fixes. All prior major versions enter end-of-life (EOL) and are no longer supported and do not receive patch release updates.

The product life cycle and versioning are subject to change in the future.

Note

  • Upgrades are only supported within a major release or to the next major release.

Support Status for Releases#

GPU Operator Version

Status

25.3.x

Generally Available

24.9.x

Maintenance

24.6.x and lower

EOL

GPU Operator Component Matrix#

The following table shows the operands and default operand versions that correspond to a GPU Operator version.

When post-release testing confirms support for newer versions of operands, these updates are identified as recommended updates to a GPU Operator version. Refer to Upgrading the NVIDIA GPU Operator for more information.

Component

Version

NVIDIA GPU Operator

v25.3.0

NVIDIA GPU Driver

570.124.06 (default, recommended),

NVIDIA Driver Manager for Kubernetes

v0.8.0

NVIDIA Container Toolkit

1.17.5

NVIDIA Kubernetes Device Plugin

0.17.1

DCGM Exporter

4.1.1-4.0.4

Node Feature Discovery

v0.17.2

NVIDIA GPU Feature Discovery
for Kubernetes

0.17.1

NVIDIA MIG Manager for Kubernetes

0.12.1

DCGM

4.1.1-2

Validator for NVIDIA GPU Operator

v25.3.0

NVIDIA KubeVirt GPU Device Plugin

v1.3.1

NVIDIA vGPU Device Manager

v0.3.0

NVIDIA GDS Driver 1

2.20.5

NVIDIA Kata Manager for Kubernetes

v0.2.3

NVIDIA Confidential Computing
Manager for Kubernetes

v0.1.1

NVIDIA GDRCopy Driver

v2.4.4

1 This release of the GDS driver requires that you use the NVIDIA Open GPU Kernel module driver for the GPUs. Refer to GPUDirect RDMA and GPUDirect Storage for more information.

Note

Supported NVIDIA Data Center GPUs and Systems#

The following NVIDIA data center GPUs are supported on x86 based platforms:

Product

Architecture

NVIDIA GH200 1

NVIDIA Grace Hopper

1 NVIDIA GH200 systems require the NVIDIA Open GPU Kernel module driver. You can install the open kernel modules by specifying the driver.useOpenKernelModules=true argument to the helm command. Refer to Common Chart Customization Options for more information.

Product

Architecture

NVIDIA H800

NVIDIA Hopper

NVIDIA H200,
NVIDIA H200 NVL

NVIDIA Hopper

NVIDIA HGX H200

NVIDIA Hopper and NVSwitch

NVIDIA DGX H100

NVIDIA Hopper and NVSwitch

NVIDIA HGX H100

NVIDIA Hopper and NVSwitch

NVIDIA H100,
NVIDIA H100 NVL

NVIDIA Hopper

NVIDIA H20

NVIDIA Hopper

NVIDIA L20

NVIDIA Ada

NVIDIA L40,
NVIDIA L40S

NVIDIA Ada

NVIDIA L4

NVIDIA Ada

NVIDIA DGX A100

A100 and NVSwitch

NVIDIA HGX A100

A100 and NVSwitch

NVIDIA A800

NVIDIA Ampere

NVIDIA A100

NVIDIA Ampere

NVIDIA A100X

NVIDIA Ampere

NVIDIA A40

NVIDIA Ampere

NVIDIA A30

NVIDIA Ampere

NVIDIA A30X

NVIDIA Ampere

NVIDIA A16

NVIDIA Ampere

NVIDIA A10

NVIDIA Ampere

NVIDIA A2

NVIDIA Ampere

Note

Product

Architecture

NVIDIA T4

Turing

NVIDIA V100

Volta

NVIDIA P100

Pascal

NVIDIA P40

Pascal

NVIDIA P4

Pascal

Product

Architecture

NVIDIA RTX A6000

NVIDIA Ampere /Ada

NVIDIA RTX A5000

NVIDIA Ampere

NVIDIA RTX A4500

NVIDIA Ampere

NVIDIA RTX A4000

NVIDIA Ampere

NVIDIA Quadro RTX 8000

Turing

NVIDIA Quadro RTX 6000

Turing

NVIDIA Quadro RTX 5000

Turing

NVIDIA Quadro RTX 4000

Turing

NVIDIA T1000

Turing

NVIDIA T600

Turing

NVIDIA T400

Turing

Product

Architecture

NVIDIA HGX B200

NVIDIA Blackwell

NVIDIA HGX GB200 NVL

NVIDIA Blackwell

Supported ARM Based Platforms#

The following NVIDIA data center GPUs are supported:

Product

Architecture

NVIDIA A100X

Ampere

NVIDIA A30X

Ampere

NVIDIA IGX Orin

Ampere

AWS EC2 G5g instances

Turing

In addition to the products specified in the preceding table, any ARM based system that meets the following requirements is supported:

Note

The GPU Operator only supports platforms using discrete GPUs. NVIDIA Jetson, or other embedded products with integrated GPUs, are not supported.

NVIDIA IGX Orin, a platform with an integrated GPU, is supported as long as the discrete GPU is the device being used.

Supported Deployment Options#

The GPU Operator has been validated in the following scenarios:

Deployment Options

Bare Metal

Virtual machines with GPU Passthrough

Virtual machines with NVIDIA vGPU based products

Note

GPU Operator is supported with NVIDIA vGPU 12.0+.

Supported Operating Systems and Kubernetes Platforms#

The GPU Operator has been validated in the following scenarios:

Operating
System

Kubernetes 1

Red Hat
OpenShift
VMware vSphere
with Tanzu
Rancher Kubernetes
Engine 2
HPE Ezmeral
Runtime
Enterprise
Canonical
MicroK8s
Nutanix
NKP

Ubuntu 20.04 LTS 2

1.29—1.32

7.0 U3c, 8.0 U2, 8.0 U3

1.29—1.32

2.12, 2.13

Ubuntu 22.04 LTS 2

1.29—1.32

8.0 U2, 8.0 U3

1.29—1.32

1.26

2.12, 2.13

Ubuntu 24.04 LTS

1.29—1.32

Red Hat Core OS

4.12—4.18
Red Hat
Enterprise
Linux 8.8,
8.10

1.29—1.32

1.29—1.32

Red Hat
Enterprise
Linux 8.4, 8.5

5.5

1 The Kubernetes community only supports the last three minor releases. Older releases may be supported through enterprise distributions of Kubernetes such as Red Hat OpenShift.

2 For Ubuntu 22.04 LTS, kernel versions 6.8 (non-precompiled driver containers only) 6.5 and 5.15 are LTS ESM kernels. For Ubuntu 20.04 LTS, kernel versions 5.4 and 5.15 are LTS ESM kernels. The GPU Driver containers support these Linux kernels. Refer to the Kernel release schedule on Canonical’s Ubuntu kernel lifecycle and enablement stack page for more information. NVIDIA recommends disabling automatic updates for the Linux kernel that are performed by the unattended-upgrades package to prevent an upgrade to an unsupported kernel version.

Note

Red Hat OpenShift Container Platform is supported on AWS, Azure, GCP, and OCI (Oracle) Virtual Machine or Bare Metal instances with T4, V100, L4, L40s, A10, A100, H100, and H200.

Operating
System
Amazon EKS
Kubernetes
Google GKE
Kubernetes
Microsoft Azure
Kubernetes Service

Ubuntu 20.04 LTS

1.25—1.28

1.25—1.28

1.25—1.28

Ubuntu 22.04 LTS

1.25—1.28

1.25—1.28

1.25—1.28

Operating
System

Kubernetes

Red Hat
OpenShift
VMware vSphere
with Tanzu
Rancher Kubernetes
Engine 2
Nutanix
NKP

Ubuntu 20.04 LTS

1.29–1.32

7.0 U3c, 8.0 U2, 8.0 U3

1.23—1.25

2.12, 2.13

Ubuntu 22.04 LTS

1.29–1.32

8.0 U2, 8.0 U3

2.12, 2.13

Ubuntu 24.04 LTS

1.29–1.32

Red Hat Core OS

4.12—4.18

Red Hat
Enterprise
Linux 8.4,
8.6—8.10

1.29—1.32

1.29—1.32

Supported Precompiled Drivers#

The GPU Operator has been validated with the following precomplied drivers. See the Precompiled Driver Containers page for more on using precompiled drivers.

Operating System

Kernel Flavor

Kernel Version

CUDA Driver Branch

Ubuntu 22.04

Generic, NVIDIA, Azure AWS, Oracle

5.15

R535, R550, R570

Ubuntu 24.04

Generic, NVIDIA, Azure AWS, Oracle

6.8

R550, R570

Supported Container Runtimes#

The GPU Operator has been validated in the following scenarios:

Operating System

Containerd 1.6 - 2.0

CRI-O

Ubuntu 20.04 LTS

Yes

Yes

Ubuntu 22.04 LTS

Yes

Yes

Ubuntu 24.04 LTS

Yes

Yes

Red Hat Core OS (RHCOS)

No

Yes

Red Hat Enterprise Linux 8

Yes

Yes

Support for KubeVirt and OpenShift Virtualization#

Red Hat OpenShift Virtualization is based on KubeVirt.

Operating System

Kubernetes

KubeVirt

OpenShift Virtualization

GPU
Passthrough

vGPU

GPU
Passthrough

vGPU

Ubuntu 24.04 LTS

1.23—1.29

0.36+

0.59.1+

Ubuntu 20.04 LTS

1.23—1.29

0.36+

0.59.1+

Ubuntu 22.04 LTS

1.23—1.29

0.36+

0.59.1+

Red Hat Core OS

4.12—4.18

4.13—4.18

You can run GPU passthrough and NVIDIA vGPU in the same cluster as long as you use a software version that meets both requirements.

NVIDIA vGPU is incompatible with KubeVirt v0.58.0, v0.58.1, and v0.59.0, as well as OpenShift Virtualization 4.12.0—4.12.2. Starting with KubeVirt v0.58.2 and v0.59.1, and OpenShift Virtualization 4.12.3 and 4.13, you must set the DisableMDEVConfiguration feature gate. Refer to GPU Operator with KubeVirt or NVIDIA GPU Operator with OpenShift Virtualization.

KubeVirt and OpenShift Virtualization with NVIDIA vGPU is supported on the following devices:

  • H200NVL

  • H100

  • GA10x: A100, A40, RTX A6000, RTX A5500, RTX A5000, A30, A16, A10, A2.

    The A10G and A10M GPUs are excluded.

  • AD10x: L40, RTX 6000 Ada, L4.

    The L40G GPU is excluded.

Note that HGX platforms are not supported.

Support for GPUDirect RDMA#

Supported operating systems and NVIDIA GPU Drivers with GPUDirect RDMA.

  • Ubuntu 24.04 LTS with Network Operator 25.1.0.

  • Ubuntu 20.04 and 22.04 LTS with Network Operator 24.10.0.

  • Red Hat OpenShift 4.12 and higher with Network Operator 23.10.0

For information about configuring GPUDirect RDMA, refer to GPUDirect RDMA and GPUDirect Storage.

Support for GPUDirect Storage#

Supported operating systems and NVIDIA GPU Drivers with GPUDirect Storage.

-Ubuntu 24.04 LTS Network Operator 25.1.0 - Ubuntu 20.04 and 22.04 LTS with Network Operator 24.10.0 - Red Hat OpenShift Container Platform 4.12 and higher

Note

Version v2.17.5 and higher of the NVIDIA GPUDirect Storage kernel driver, nvidia-fs, requires the NVIDIA Open GPU Kernel module driver. You can install the open kernel modules by specifying the driver.kernelModuleType=auto if you are using driver container version 570.86.15, 570.124.06 or later. Or use driver.kernelModuleType=open if you are using a different driver version or branch. argument to the helm command. Refer to Common Chart Customization Options for more information.

Not supported with secure boot. Supported storage types are local NVMe and remote NFS.

Additional Supported Container Management Tools#

  • Helm v3

  • Red Hat Operator Lifecycle Manager (OLM)