Production Branch (PB)#

A Production Branch (PB) contains production-ready AI frameworks and SDK branches to provide API stability and a secure environment for building mission-critical AI applications.

Learn more about NVIDIA AI Enterprise release branches in the NVIDIA AI Enterprise Release Branches document.

Production Branch - October 2025 (PB 25h2)#

Production Branch - October 2025 (PB 25h2)#
PB Collection on NGC	Production Branch - October 2025 (PB 25h2)
Government Ready PB Collection	Production Branch Government Ready - October 2025 (PB 25h2)
First planned release [3]	October 2025
Last planned release	June 2026
Planned End of Life (EOL)	July 2026
Government Ready Versions	STIG hardened, FIPS enabled versions are available for some products. Refer to the table below.
Compatible Infrastructure Release	Use the latest NVIDIA AI Enterprise Infra Release 7 on the NGC Catalog.
Support Matrix	NVIDIA AI Enterprise Infrastructure Support Matrix 7 [4]

Products Included in Production Branch - October 2025 (PB 25h2)#
Name	Version	Government Ready Versions	Product Documentation
CUDA Deep Learning	25.08	Yes for x86	Release 25.08
Multi-LLM NIM	1.14	Yes for x86	Release 1.14
NVIDIA Holoscan SDK	3.3	No	Release 3.3
NVIDIA TensorRT	25.08-py	Yes for x86	TensorRT Release 25.08
NVIDIA Triton Inference Server	25.08-py3 25.08-py3-sdk 25.08-py3-vllm 25.08-py3-trtllm	Yes for x86	Triton Inference Server 25.08
NVIDIA TAO	6.0-python 6.0-deploy 6.0-data-services	Yes for x86	TAO 6.0
PyTorch	25.08-py	Yes for x86	PyTorch Release 25.08

New Features - Government Ready

This release introduces a significant new baseline for security, Government Ready, for most x86 container images. This designation indicates that the software:

Meets software security requirements for use within a FedRAMP High or equivalent Sovereign use cases.
Matching functionality with NVIDIA software without the government ready designation.

Technical Implementation

Security Technical Implementation Guides (STIGs) are configuration standards consisting of cybersecurity requirements for specific products developed by the U.S. Department of Defense. STIGs provide a methodology for standardized secure installation and maintenance of DOD IA and IA-enabled devices and systems, helping organizations harden their systems against security vulnerabilities through detailed technical configuration guidance.

FIPS 140-3 is the U.S. government computer security standard used to approve cryptographic modules, with FIPS 140-3 superseding FIPS 140-2 for new submissions as of April 1, 2022. The goal of the CMVP is to promote the use of validated cryptographic modules and provide Federal agencies with a security metric to use in procuring equipment containing validated cryptographic modules. These standards ensure that cryptographic implementations meet rigorous security requirements for government and regulated environments.

Our containers have been built on top of Canonical’s Ubuntu 24.04 STIG hardened base image, and they include FIPS versions of common cryptography libraries, like OpenSSL. These containers can be deployed the same as normal containers. To make use of FIPS mode, your host machine must have a FIPS-enabled Linux kernel.

If you run into problems integrating your application with FIPS-enabled libraries, check the documentation for each library whether FIPS mode can be toggled. For example, for OpenSSL you can use OPENSSL_FORCE_FIPS_MODE=0 to disable FIPS mode if needed for testing.

Verifying FIPS Mode on Your Host System

To verify that your host machine is running in FIPS mode, check the /proc/sys/crypto/fips_enabled file and ensure it is set to 1. If it is set to 0, the FIPS modules will not run in FIPS mode. If the file is missing, the FIPS kernel is not installed. You can verify this with the shell command:

cat /proc/sys/crypto/fips_enabled

Additionally, you can check your kernel version using uname -a to confirm you’re running a FIPS-enabled kernel. Refer to Canonical’s FIPS documentation as an example of setting up a FIPS kernel. Any Linux distribution with a FIPS-enabled kernel should provide similar verification methods through the /proc/sys/crypto/fips_enabled flag.

Learn more about NVIDIA’s hardened image in the AI Software for Regulated Environments.

Production Branch - May 2025 (PB 25h1)#

Production Branch - May 2025 (PB 25h1)#
PB Collection on NGC	Production Branch - May 2025 (PB 25h1)
First planned release	May 2025
Last planned release	December 2025
Planned End of Life (EOL)	January 2026
Compatible Infrastructure Release	Use the latest NVIDIA AI Enterprise Infra Release 6 on the NGC Catalog.
Support Matrix	NVIDIA AI Enterprise Infrastructure Support Matrix 6

Products Included in Production Branch - May 2025 (PB 25h1)#
Name	Version	Product Documentation
NVIDIA NIM Llama-3.1-8b-instruct [2]	1.10	Release 1.10
NVIDIA NIM Llama-3.1-70b-instruct [2]	1.10	Release 1.10
NVIDIA DeepStream SDK [2]	N/A	DEEPSTREAM SDK
NVIDIA Holoscan SDK [2]	3.3.0	Holoscan SDK v3.3
NVIDIA Morpheus	25.02-runtime	NVIDIA Morpheus (25.02)
NVIDIA TensorRT	25.03-py	TensorRT Release 25.03
NVIDIA Triton Inference Server	25.03-py3 25.03-py3-sdk 25.03-py3-min 25.03-py3-vllm 25.03-py3-trtllm	Triton Inference Server 25.03
NVIDIA NIM Retrieval QA E5 Embedding v5 [2]	1.8.0	Release 1.8.0
PyTorch	25.03-py	PyTorch Release 25.03
RAPIDS	25.02-runtime	RAPIDS Documentation
RAPIDS Accelerator for Apache Spark	25.02.1	Release v25.02.1

Production Branch - October 2024 (PB 24h2) - EOL#

Important

This branch is end-of-life (EOL).

Production Branch - October 2024 (PB 24h2)#
PB Collection on NGC	Production Branch - October 2024 (PB 24h2)
First planned release	October 2024
Last planned release	June 2025
Planned End of Life (EOL)	July 2025
Compatible Infrastructure Release	Use the latest NVIDIA AI Enterprise Infra Release 6 or NVIDIA AI Enterprise Infra Release 5 on the NGC Catalog.
Support Matrix	NVIDIA AI Enterprise Infrastructure Support Matrix 6 NVIDIA AI Enterprise Infrastructure Support Matrix 5

Products Included in Production Branch - October 2024 (PB 24h2)#
Name	Version	Product Documentation
Deep Graph Library (DGL)	24.08-py3	DGL Release 24.08
NVIDIA NIM Llama-3.1-8b-instruct [1]	1.3	Release 1.3.0
NVIDIA NIM Llama-3.1-70b-instruct [1]	1.3	Release 1.3.0
NVIDIA DeepStream SDK	7.1-triton-x86	DEEPSTREAM SDK 7.1 FOR NVIDIA DGPU/X86 (NVAIE)
NVIDIA Holoscan SDK	24.08	Holoscan SDK v2.6
NVIDIA Morpheus	24.06-runtime	NVIDIA Morpheus (24.06)
NVIDIA TensorRT	24.08-py3	TensorRT Release 24.08
NVIDIA Triton Inference Server	24.08-py3 24.08-py3-sdk 24.08-py3-min	Triton Inference Server Release 24.08
NVIDIA NIM Retrieval QA E5 Embedding v5 [1]	1.2	Release 1.2.0
PyTorch	24.08-py3	PyTorch Release 24.08
PyTorch Geometric (PyG)	24.08-py3	PyG Release 24.08
RAPIDS	24.06-runtime	RAPIDS Documentation
RAPIDS Accelerator for Apache Spark	24.06.02	Release v24.06.1
TensorFlow 2	24.08-tf2-py3	TensorFlow Release 24.08

Production Branch - May 2024 (PB 24h1) - EOL#

Important

This branch is end-of-life (EOL).

Production Branch - May 2024 (PB 24h1)#
PB Collection on NGC	Production Branch - May 2024 (PB 24h1)
First planned release	May 2024
Last planned release	December 2024
Planned End of Life (EOL)	January 2025
Compatible Infrastructure Release	Use the latest NVIDIA AI Enterprise Infra Release 5 as a supported configuration.
Support Matrix	NVIDIA AI Enterprise Infrastructure Support Matrix 5

Products Included in Production Branch - May 2024 (PB 24h1)#
Name	Version	Product Documentation
NVIDIA MONAI Toolkit	24.03-py3	Project MONAI
NVIDIA Morpheus	24.02-runtime	Morpheus Documentation
NVIDIA TensorRT	24.03-py3	TensorRT Release 24.03
NVIDIA Triton Inference Server	24.03-py3 24.03-py3-sdk 24.03-py3-min	Triton Inference Server Release 24.03
PyTorch	24.03-py3	PyTorch Release 24.03
RAPIDS	24.02-runtime	RAPIDS Documentation
RAPIDS Accelerator for Apache Spark	24.02	Release v24.02.0
TensorFlow 2	24.03-tf2-py3	TensorFlow Release 24.03

Production Branch - October 2023 (PB 23h2) - EOL#

Important

This branch is end-of-life (EOL).

Production Branch - October 2023 (PB 23h2)#
PB Collection on NGC	Production Branch - October 2023 (PB 23h2)
First planned release	October 2023
Last planned release	June 2024
Planned End of Life (EOL)	July 2024
Compatible Infrastructure Release	Use the latest NVIDIA AI Enterprise Infra Release 5 as a supported configuration.

Products Included in Production Branch - October 2023 (PB 23h2)#
Name	Version	Product Documentation
NVIDIA Holoscan SDK	23.10	Holoscan SDK v1.0.3
NVIDIA TensorRT	23.08-py3	TensorRT Release 23.08
NVIDIA Triton Inference Server	23.08-py3 23.08-py3-sdk 23.08-py3-min 23.08-pyt-python-py3 23.08-tf2-python-py3	Triton Inference Server Release 23.08
PyTorch	23.08-py3	PyTorch Release 23.08
RAPIDS	23.06-runtime	RAPIDS Documentation
TensorFlow 2	23.08-tf2-py3	TensorFlow Release 23.08

Footnotes