NVIDIA DRA Driver for GPUs#

GPU allocation#

Compared to traditional GPU allocation using coarse-grained count-based requests, the GPU allocation side of this driver enables fine-grained control and powerful features long desired by the community, such as:

  1. Controlled sharing of individual GPUs between multiple pods and/or containers.

  2. GPU selection via complex constraints expressed via CEL.

  3. Dynamic partitioning.

To learn more about this part of the driver and about what we are planning to build in the future, have a look at these release notes.

While the GPU allocation features of this driver can be tried out, they are not yet officially supported. Hence, the GPU kubelet plugin is currently disabled by default in the Helm chart installation.

For documentation on how to use and test the current set of GPU allocation features, please head over to the demo section of the driver’s README and to its quickstart directory.

Note

This part of the NVIDIA DRA Driver for GPUs is in Technology Preview. It is not yet supported in production environments and not yet functionally complete. Generally spoken, Technology Preview features provide early access to upcoming product features, enabling users to test functionality and provide feedback during the development process. Technology Preview releases may not have full documentation, and testing is limited.