Release Notes
This document describes the new features, improvements, fixed and known issues for the NVIDIA Container Toolkit.
NVIDIA Container Toolkit 1.12.1
This release of the NVIDIA Container Toolkit v1.12.1 is primarily a bugfix release.
Packaging Changes
Fixed a bug in the uninstall scriplet on RPM-based systems that would issue an error due to a missing
nvidia-container-runtime-hooksymlink. This did not prevent the uninstallation of the package.Removed
fedora35as a packaging target. Use thecentos8packages instead.
Fixes and Features
Fixed a bug when running containers using a generated CDI specification or when
NVIDIA_DRIVER_CAPABILITIESincludesgraphicsordisplayor is set toall. Now, containers no longer fail with an error message indicating a missing/dev/drior/dev/nvidia-capspath.Added support for detecting and injecting multiple GSP firmware files as required by the
525.xversions of the NVIDIA GPU drivers.Fixed an issue that caused the
nvidia-ctkpath to be blank in generated CDI specifications.Fixed missing NVML symbols for
nvidia-ctkon some platforms. For more information, see issue #49.
specific to libnvidia-container
Added support for detecting and injecting multiple GSP firmware files as required by the
525.xversions of the NVIDIA GPU drivers.
specific to container-toolkit container images
Updated CUDA base images to
12.1.0.
NVIDIA Container Toolkit 1.12.0
This release of the NVIDIA Container Toolkit v1.12.0 adds the following major features:
Improved support for headless Vulkan applications in containerized environments.
Tooling to generate Container Device Interface (CDI) specifications for GPU devices. The use of CDI is now the recommended mechanism for using GPUs in
podman.
The following packages are included:
nvidia-container-toolkit 1.12.0libnvidia-container-tools 1.12.0libnvidia-container1 1.12.0
The following container-toolkit containers are included:
nvcr.io/nvidia/k8s/container-toolkit:v1.12.0-centos7nvcr.io/nvidia/k8s/container-toolkit:v1.12.0-ubi8nvcr.io/nvidia/k8s/container-toolkit:v1.12.0-ubuntu18.04nvcr.io/nvidia/k8s/container-toolkit:v1.12.0-ubuntu20.04(also asnvcr.io/nvidia/k8s/container-toolkit:v1.12.0)
The following packages have also been updated to depend on nvidia-container-toolkit of at least 1.12.0:
nvidia-container-runtime 3.12.0nvidia-docker2 2.12.0
Note
This will be the last release that updates the nvidia-container-runtime and nvidia-docker2 packages. All required functionality is now included in the nvidia-container-toolkit package. This includes a utility to configure the Docker daemon to use the NVIDIA Container Runtime.
Packaging Changes
The
nvidia-container-toolkitpackages was updated to allow upgrades from pre-v1.11.0versions of the package without removing thenvidia-container-runtime-hookexecutable.On certain distributions, full mirrors have been removed. The links to the
.listand.repofiles for Debian and RPM-based distributions respectively have been maintained to ensure that the official installation instructions continue to function. This change serves to further optimize the size of our package repository.
Fixes and Features
Add
nvidia-ctk cdi generatecommand to generate CDI specifications for available NVIDIA devices. The generated CDI specification can be used to provide access to NVIDIA devices in CDI-enabled container engines such aspodman– especially in the rootless case.Add full support for headless Vulkan applications in containerized environments when
NVIDIA_DRIVER_CAPABILITIESincludes
graphicsordisplay. This includes the injection of Vulkan ICD loaders as well as direct rendering devices.
Improve the logging of errors in the NVIDIA Container Runtime.
specific to libnvidia-container
Include the NVVM compiler library in the set of injected compute libraries
Skip the creation of files that are already mounted to allow paths such as
/var/runto be mounted into containers.Add
nvcubins.binto DriverStore components under WSL2
specific to container-toolkit container images
Update CUDA base images to
12.0.1
Known Issues
When running a container using CDI or if
NVIDIA_DRIVER_CAPABILITIESincludesgraphicsordisplay, and error may be raised citing missing
/dev/driand / or/dev/nvidia-capspaths in container if the selected device does not have such nodes associated with it.
$ docker run -it --runtime=nvidia -e NVIDIA_VISIBLE_DEVICES=nvidia.com/gpu=0 nvidia/cuda:11.0.3-base-ubuntu20.04 nvidia-smi -L
docker: Error response from daemon: failed to create shim task: OCI runtime create failed: runc create failed: unable to start container process: error during container init: error running hook #1: error running hook: exit status 1, stdout: , stderr: chmod: cannot access '/var/lib/docker/overlay2/9069fafcb6e39ccf704fa47b52ca92a1d48ca5ccfedd381f407456fb6cd3f9f0/merged/dev/dri': No such file or directory: unknown.
ERRO[0000] error waiting for container: context canceled
This issue has been addressed in the v1.12.1 release.
NVIDIA Container Toolkit 1.11.0
This release of the NVIDIA Container Toolkit v1.11.0 is primarily targeted at adding support for injection of GPUDirect Storage and MOFED devices into containerized environments.
The following packages are included:
nvidia-container-toolkit 1.11.0libnvidia-container-tools 1.11.0libnvidia-container1 1.11.0
The following container-toolkit containers are included:
nvcr.io/nvidia/k8s/container-toolkit:v1.11.0-centos7nvcr.io/nvidia/k8s/container-toolkit:v1.11.0-ubi8nvcr.io/nvidia/k8s/container-toolkit:v1.11.0-ubuntu18.04nvcr.io/nvidia/k8s/container-toolkit:v1.11.0-ubuntu20.04(also asnvcr.io/nvidia/k8s/container-toolkit:v1.11.0)
The following packages have also been updated to depend on nvidia-container-toolkit of at least 1.11.0:
nvidia-container-runtime 3.11.0
Note that this release does not include an update to nvidia-docker2 and is compatible with nvidia-docker2 2.11.0.
Packaging Changes
An
nvidia-container-toolkit-basepackage has been introduced that allows for the higher-level components to be
installed in cases where the NVIDIA Container Runtime Hook, NVIDIA Container CLI, and NVIDIA Container Library are not required. This includes Tegra-based systems where the CSV mode of the NVIDIA Container Runtime is used. * The package repository includes support for Fedora 35 packages. * The package repository includes support for RHEL 8.6. This redirects to the Centos 8 packages. * Mirrors for older distributions have been removed to limit the size of the package repository.
Fixes and Features
Fix bug in CSV mode where libraries listed as
symentries in mount specification are not added to the LDCache.Rename the
nvidia-container-toolkitexecutable tonvidia-container-runtime-hookto better indicate intent.
A symlink named nvidia-container-toolkit is created that points to the nvidia-container-runtime-hook executable.
* Inject platform files into container on Tegra-based systems to allow for future support of these systems in the GPU Device Plugin.
* Add cdi mode to NVIDIA Container Runtime
* Add discovery of GPUDirect Storage (nvidia-fs*) devices if the NVIDIA_GDS environment variable of the container is set to enabled
* Add discovery of MOFED Infiniband devices if the NVIDIA_MOFED environment variable of the container is set to enabled
* Add nvidia-ctk runtime configure command to configure the Docker config file (e.g. /etc/docker/daemon.json) for use with the NVIDIA Container Runtime.
specific to libnvidia-container
Fix bug where LDCache was not updated when the
--no-pivot-rootoption was specifiedPreload
libgcc_s.so.1on arm64 systems
specific to container-toolkit container images
Update CUDA base images to
11.7.1Allow
accept-nvidia-visible-devices-*config options to be set by toolkit container
Known Issues
When upgrading from an earlier version of the NVIDIA Container Toolkit on RPM-based systems, a package manager such as
yummay remove
the installed /usr/bin/nvidia-container-runtime-hook executable due to the post-uninstall hooks defined in the older package version. To avoid this
problem either remove the older version of the nvidia-container-toolkit before installing v1.11.0 or reinstall the v1.11.0 package if the
/usr/bin/nvidia-container-runtime-hook file is missing. For systems where the v1.11.0 version of the package has already been installed and left
in an unusable state, running yum reinstall -y nvidia-container-toolkit-1.11.0-1 should address this issue.
The
container-toolkit:v1.11.0images have been released with the following known HIGH Vulnerability CVEs. These are from the base images and are not in libraries used by the components included in the container image as part of the NVIDIA Container Toolkit:nvcr.io/nvidia/k8s/container-toolkit:v1.11.0-centos7:systemd- CVE-2022-2526systemd-libs- CVE-2022-2526
nvcr.io/nvidia/k8s/container-toolkit:v1.11.0-ubi8:systemd- CVE-2022-2526systemd-libs- CVE-2022-2526systemd-pam- CVE-2022-2526
nvcr.io/nvidia/k8s/container-toolkit:v1.11.0-ubuntu18.04:libsystemd0- CVE-2022-2526libudev1- CVE-2022-2526
NVIDIA Container Toolkit 1.10.0
This release of the NVIDIA Container Toolkit v1.10.0 is primarily targeted at improving support for Tegra-based systems.
It sees the introduction of a new mode of operation for the NVIDIA Container Runtime that makes modifications to the incoming OCI runtime
specification directly instead of relying on the NVIDIA Container CLI.
The following packages are included:
nvidia-container-toolkit 1.10.0libnvidia-container-tools 1.10.0libnvidia-container1 1.10.0
The following container-toolkit containers are included:
nvcr.io/nvidia/k8s/container-toolkit:v1.10.0-centos7nvcr.io/nvidia/k8s/container-toolkit:v1.10.0-ubi8nvcr.io/nvidia/k8s/container-toolkit:v1.10.0-ubuntu18.04nvcr.io/nvidia/k8s/container-toolkit:v1.10.0-ubuntu20.04(also asnvcr.io/nvidia/k8s/container-toolkit:v1.10.0)
The following packages have also been updated to depend on nvidia-container-toolkit of at least 1.10.0:
nvidia-container-runtime 3.10.0nvidia-docker2 2.11.0
Packaging Changes
The package repository includes support for Ubuntu 22.04. This redirects to the Ubuntu 18.04 packages.
The package repository includes support for RHEL 9.0. This redirects to the Centos 8 packages.
The package repository includes support for OpenSUSE 15.2 and 15.3. These redirect to the OpenSUSE 15.1 packages.
The
nvidia-docker2Debian packages were updated to allow installation withmoby-engineinstead of requiringdocker-ce,docker-ee, ordocker.io.
Fixes and Features
Add
nvidia-ctkCLI to provide utilities for interacting with the NVIDIA Container ToolkitAdd a new mode to the NVIDIA Container Runtime targeted at Tegra-based systems using CSV-file based mount specifications.
Use default config instead of raising an error if config file cannot be found
Switch to debug logging to reduce log verbosity
Support logging to logs requested in command line
Allow low-level runtime path to be set explicitly as
nvidia-container-runtime.runtimesoptionFix failure to locate low-level runtime if PATH envvar is unset
Add
--versionflag to all CLIs
specific to libnvidia-container
Bump
libtirpcto1.3.2Fix bug when running host ldconfig using glibc compiled with a non-standard prefix
Add
libcudadebugger.soto list of compute libraries[WSL2] Fix segmentation fault on WSL2s system with no adpaters present (e.g.
/dev/dxgmissing)Ignore pending MIG mode when checking if a device is MIG enabled
[WSL2] Fix bug where
/dev/dxgis not mounted whenNVIDIA_DRIVER_CAPABILITIESdoes not include “compute”
specific to container-toolkit container images
Fix a bug in applying runtime configuratin to containerd when version 1 config files are used
Update base images to CUDA 11.7.0
Multi-arch images for Ubuntu 18.04 are no longer available. (For multi-arch support for the container toolkit images at least Ubuntu 20.04 is required)
Centos 8 images are no longer available since the OS is considered EOL and no CUDA base image updates are available
Images are no longer published to Docker Hub and the NGC images should be used instead
Known Issues
The
container-toolkit:v1.10.0images have been released with the following known HIGH Vulnerability CVEs. These are from the base images and are not in libraries used by the components included in the container image as part of the NVIDIA Container Toolkit:nvcr.io/nvidia/k8s/container-toolkit:v1.10.0-centos7:xz- CVE-2022-1271xz-libs- CVE-2022-1271
nvcr.io/nvidia/k8s/container-toolkit:v1.10.0-ubi8:xz-libs- CVE-2022-1271
NVIDIA Container Toolkit 1.9.0
This release of the NVIDIA Container Toolkit v1.9.0 is primarily targeted at adding multi-arch support for the container-toolkit images.
It also includes enhancements for use on Tegra-systems and some notable bugfixes.
The following packages are included:
nvidia-container-toolkit 1.9.0libnvidia-container-tools 1.9.0libnvidia-container1 1.9.0
The following container-toolkit containers are included (note these are also available on Docker Hub as nvidia/container-toolkit):
nvcr.io/nvidia/k8s/container-toolkit:v1.9.0-centos7nvcr.io/nvidia/k8s/container-toolkit:v1.9.0-centos8nvcr.io/nvidia/k8s/container-toolkit:v1.9.0-ubi8nvcr.io/nvidia/k8s/container-toolkit:v1.9.0andnvcr.io/nvidia/k8s/container-toolkit:v1.9.0-ubuntu18.04
The following packages have also been updated to depend on nvidia-container-toolkit of at least 1.9.0:
nvidia-container-runtime 3.9.0nvidia-docker2 2.10.0
Fixes and Features
specific to libnvidia-container
Add additional check for Tegra in
/sys/.../familyfile in CLIUpdate jetpack-specific CLI option to only load Base CSV files by default
Fix bug (from
v1.8.0) when mounting GSP firmware into containers without/libto/usr/libsymlinksUpdate
nvml.hto CUDA 11.6.1 nvML_DEV 11.6.55Update switch statement to include new brands from latest
nvml.hProcess all
--requireflags on Jetson platformsFix long-standing issue with running ldconfig on Debian systems
specific to container-toolkit container images
Publish an
nvcr.io/nvidia/k8s/container-toolkit:v1.9.0-ubuntu20.04image based onnvidia/cuda:11.6.0-base-ubuntu20.04The following images are available as multi-arch images including support for
linux/amd64andlinux/arm64platforms:nvcr.io/nvidia/k8s/container-toolkit:v1.9.0-centos8nvcr.io/nvidia/k8s/container-toolkit:v1.9.0-ubi8nvcr.io/nvidia/k8s/container-toolkit:v1.9.0-ubuntu18.04(andnvcr.io/nvidia/k8s/container-toolkit:v1.9.0)nvcr.io/nvidia/k8s/container-toolkit:v1.9.0-ubuntu20.04
Known Issues
The
container-toolkit:v1.9.0images have been released with the following known HIGH Vulnerability CVEs. These are from the base images and are not in libraries used by the components included in the container image as part of the NVIDIA Container Toolkit:nvcr.io/nvidia/k8s/container-toolkit:v1.9.0-centos7:expat- CVE-2022-25235expat- CVE-2022-25236expat- CVE-2022-25315
nvcr.io/nvidia/k8s/container-toolkit:v1.9.0-centos8:cyrus-sasl-lib- CVE-2022-24407openssl,openssl-libs- CVE-2022-0778expat- CVE-2022-25235expat- CVE-2022-25236expat- CVE-2022-25315
nvcr.io/nvidia/k8s/container-toolkit:v1.9.0-ubi8:openssl-libs- CVE-2022-0778
NVIDIA Container Toolkit 1.8.1
This version of the NVIDIA Container Toolkit is a bugfix release and fixes issue with cgroup support found in
NVIDIA Container Toolkit 1.8.0.
The following packages are included:
nvidia-container-toolkit 1.8.1libnvidia-container-tools 1.8.1libnvidia-container1 1.8.1
The following container-toolkit containers have are included (note these are also available on Docker Hub as nvidia/container-toolkit):
nvcr.io/nvidia/k8s/container-toolkit:v1.8.1-centos7nvcr.io/nvidia/k8s/container-toolkit:v1.8.1-centos8nvcr.io/nvidia/k8s/container-toolkit:v1.8.1-ubi8nvcr.io/nvidia/k8s/container-toolkit:v1.8.1andnvcr.io/nvidia/k8s/container-toolkit:v1.8.1-ubuntu18.04
The following packages have also been updated to depend on nvidia-container-toolkit of at least 1.8.1:
nvidia-container-runtime 3.8.1nvidia-docker2 2.9.1
Fixes and Features
specific to libnvidia-container
Fix bug in determining cgroup root when running in nested containers
Fix permission issue when determining cgroup version under certain conditions
NVIDIA Container Toolkit 1.8.0
This version of the NVIDIA Container Toolkit adds cgroupv2 support and removes packaging support for Amazon Linux 1.
The following packages are included:
nvidia-container-toolkit 1.8.0libnvidia-container-tools 1.8.0libnvidia-container1 1.8.0
The following container-toolkit containers have are included (note these are also available on Docker Hub as nvidia/container-toolkit):
nvcr.io/nvidia/k8s/container-toolkit:v1.8.0-centos7nvcr.io/nvidia/k8s/container-toolkit:v1.8.0-centos8nvcr.io/nvidia/k8s/container-toolkit:v1.8.0-ubi8nvcr.io/nvidia/k8s/container-toolkit:v1.8.0andnvcr.io/nvidia/k8s/container-toolkit:v1.8.0-ubuntu18.04
The following packages have also been updated to depend on nvidia-container-toolkit of at least 1.8.0:
nvidia-container-runtime 3.8.0nvidia-docker2 2.9.0
Packaging Changes
Packages for Amazon Linux 1 are no longer built or published
The
container-toolkitcontainer is built and released from the same repository as the NVIDIA Container Toolkit packages.
Fixes and Features
specific to libnvidia-container
Add cgroupv2 support
Fix a bug where the GSP firmware path was mounted with write permissions instead of read-only
Include the GSP firmware path (if present) in the output of the nvidia-container-cli list command
Add support for injecting PKS libraries into a container
NVIDIA Container Toolkit 1.7.0
This version of the NVIDIA Container Toolkit allows up to date packages to be installed on Jetson devices. The following packages are included:
nvidia-container-toolkit 1.7.0libnvidia-container-tools 1.7.0libnvidia-container1 1.7.0
The following packages have also been updated to depend on nvidia-container-toolkit of at least 1.7.0:
nvidia-container-runtime 3.7.0nvidia-docker2 2.8.0
Packaging Changes
On Ubuntu
arm64distributions thelibnvidia-container-toolspackage depends on bothlibnvidia-container0andlibnvidia-container1to support Jetson devices
Fixes and Features
Add a
supported-driver-capabilitiesconfig option to allow for a subset of all driver capabilities to be specifiedMakes the fixes from
v1.6.0to addresses an incompatibility with recent docker.io and containerd.io updates on Ubuntu installations (see NVIDIA/nvidia-container-runtime#157) available on Jetson devices.
specific to libnvidia-container
Filter command line options based on
libnvidia-containerlibrary versionInclude
libnvidia-containerversion in CLI version outputAllow for
nvidia-container-clito loadlibnvidia-container.so.0dynamically on Jetson platforms
NVIDIA Container Toolkit 1.6.0
This version of the NVIDIA Container Toolkit moves to unify the packaging of the components of the NVIDIA container stack. The following packages are included:
nvidia-container-toolkit 1.6.0libnvidia-container-tools 1.6.0libnvidia-container1 1.6.0
The following packages have also been updated to depend on nvidia-container-toolkit of at least 1.6.0:
nvidia-container-runtime 3.6.0nvidia-docker2 2.7.0
Note
All the above packages are published to the libnvidia-container repository.
Note
As of version 2.7.0 the nvidia-docker2 package depends directly on nvidia-container-toolkit.
This means that the nvidia-container-runtime package is no longer required and may be uninstalled as part of the upgrade process.
Packaging Changes
The
nvidia-container-toolkitpackage now provides thenvidia-container-runtimeexecutableThe
nvidia-docker2package now depends directly on thenvidia-container-toolkitdirectlyThe
nvidia-container-runtimepackage is now an architecture-independent meta-package serving only to define a dependency on thenvidia-container-toolkitfor workflows that require thisAdded packages for Amazon Linux 2 on AARC64 platforms for all components
Fixes and Features
Move OCI and command line checks for the NVIDIA Container Runtime to an internal go package (
oci)Update OCI runtime specification dependency to opencontainers/runtime-spec@a3c33d6 to fix compatibility with docker when overriding clone3 syscall return value [fixes NVIDIA/nvidia-container-runtime#157]
Use relative path to OCI specification file (
config.json) if bundle path is not specified as an argument to the nvidia-container-runtime
specific to libnvidia-container
Bump
nvidia-modprobedependency to495.44in the NVIDIA Container Library to allow for non-root monitoring of MIG devicesFix bug that lead to unexpected mount error when
/proc/driver/nvidiadoes not exist on the host
Known Issues
Dependency errors when installing older versions of nvidia-container-runtime on Debian-based systems
With the release of the 1.6.0 and 3.6.0 versions of the nvidia-container-toolkit and
nvidia-container-runtime packages, respectively, some files were reorganized and the package
dependencies updated accordingly. (See case 10 in the Debian Package Transition documentation).
Due to these new constraints a package manager may not correctly resolve the required version of nvidia-container-toolkit when
pinning to versions of the nvidia-container-runtime prior to 3.6.0.
This means that if a command such as:
sudo apt-get install nvidia-container-runtime=3.5.0-1
is used to install a specific version of the nvidia-container-runtime package, this may fail with the following error message:
Some packages could not be installed. This may mean that you have
requested an impossible situation or if you are using the unstable
distribution that some required packages have not yet been created
or been moved out of Incoming.
The following information may help to resolve the situation:
The following packages have unmet dependencies:
nvidia-container-runtime : Depends: nvidia-container-toolkit (>= 1.5.0) but it is not going to be installed
Depends: nvidia-container-toolkit (< 2.0.0) but it is not going to be installed
E: Unable to correct problems, you have held broken packages.
In order to address this, the versions of the nvidia-container-toolkit package should be specified explicitly to be at most 1.5.1
sudo apt-get install \
nvidia-container-runtime=3.5.0-1 \
nvidia-container-toolkit=1.5.1-1
In general, it is suggested that all components of the NVIDIA container stack be pinned to their required versions.
For the nvidia-container-runtime 3.5.0 these are:
nvidia-container-toolkit 1.5.1libnvidia-container-tools 1.5.1libnvidia-container1 1.5.1
To pin all the package versions above, run:
sudo apt-get install \
nvidia-container-runtime=3.5.0-1 \
nvidia-container-toolkit=1.5.1-1 \
libnvidia-container-tools=1.5.1-1 \
libnvidia-container1==1.5.1-1
Toolkit Container 1.7.0
Known issues
The
container-toolkit:1.7.0-ubuntu18.04image contains the CVE-2021-3711. This CVE affectslibssl1.1andopensslincluded in the ubuntu-based CUDA 11.4.1 base image. The components of the NVIDIA Container Toolkit included in the container do not uselibssl1.1oropenssland as such this is considered low risk if the container is used as intended; that is to install and configure the NVIDIA Container Toolkit in the context of the NVIDIA GPU Operator.