Production Branch (PB)#

A Production Branch (PB) contains production-ready AI frameworks and SDK branches to provide API stability and a secure environment for building mission-critical AI applications.

Learn more about NVIDIA AI Enterprise release branches in the NVIDIA AI Enterprise Release Branches document.

Production Branch - October 2025 (PB 25h2)#

Production Branch - October 2025 (PB 25h2)#

PB Collection on NGC

Production Branch - October 2025 (PB 25h2)

Government Ready PB Collection

Production Branch Government Ready - October 2025 (PB 25h2)

First planned release [3]

October 2025

Last planned release

June 2026

Planned End of Life (EOL)

July 2026

Government Ready Versions

STIG hardened, FIPS enabled versions are available for some products. Refer to the table below.

Compatible Infrastructure Release

Use the latest NVIDIA AI Enterprise Infra Release 7 on the NGC Catalog.

Support Matrix

NVIDIA AI Enterprise Infrastructure Support Matrix 7 [4]

Products Included in Production Branch - October 2025 (PB 25h2)#

Name

Version

Government Ready Versions

Product Documentation

CUDA Deep Learning

25.08

Yes for x86

Release 25.08

Multi-LLM NIM

1.14

Yes for x86

Release 1.14

NVIDIA Holoscan SDK

3.3

No

Release 3.3

NVIDIA TensorRT

25.08-py

Yes for x86

TensorRT Release 25.08

NVIDIA Triton Inference Server

  • 25.08-py3

  • 25.08-py3-sdk

  • 25.08-py3-vllm

  • 25.08-py3-trtllm

Yes for x86

Triton Inference Server 25.08

NVIDIA TAO

  • 6.0-python

  • 6.0-deploy

  • 6.0-data-services

Yes for x86

TAO 6.0

PyTorch

25.08-py

Yes for x86

PyTorch Release 25.08

New Features - Government Ready

This release introduces a significant new baseline for security, Government Ready, for most x86 container images. This designation indicates that the software:

  • Meets software security requirements for use within a FedRAMP High or equivalent Sovereign use cases.

  • Matching functionality with NVIDIA software without the government ready designation.

Technical Implementation

Security Technical Implementation Guides (STIGs) are configuration standards consisting of cybersecurity requirements for specific products developed by the U.S. Department of Defense. STIGs provide a methodology for standardized secure installation and maintenance of DOD IA and IA-enabled devices and systems, helping organizations harden their systems against security vulnerabilities through detailed technical configuration guidance.

FIPS 140-3 is the U.S. government computer security standard used to approve cryptographic modules, with FIPS 140-3 superseding FIPS 140-2 for new submissions as of April 1, 2022. The goal of the CMVP is to promote the use of validated cryptographic modules and provide Federal agencies with a security metric to use in procuring equipment containing validated cryptographic modules. These standards ensure that cryptographic implementations meet rigorous security requirements for government and regulated environments.

Our containers have been built on top of Canonical’s Ubuntu 24.04 STIG hardened base image, and they include FIPS versions of common cryptography libraries, like OpenSSL. These containers can be deployed the same as normal containers. To make use of FIPS mode, your host machine must have a FIPS-enabled Linux kernel.

If you run into problems integrating your application with FIPS-enabled libraries, check the documentation for each library whether FIPS mode can be toggled. For example, for OpenSSL you can use OPENSSL_FORCE_FIPS_MODE=0 to disable FIPS mode if needed for testing.

Verifying FIPS Mode on Your Host System

To verify that your host machine is running in FIPS mode, check the /proc/sys/crypto/fips_enabled file and ensure it is set to 1. If it is set to 0, the FIPS modules will not run in FIPS mode. If the file is missing, the FIPS kernel is not installed. You can verify this with the shell command:

cat /proc/sys/crypto/fips_enabled

Additionally, you can check your kernel version using uname -a to confirm you’re running a FIPS-enabled kernel. Refer to Canonical’s FIPS documentation as an example of setting up a FIPS kernel. Any Linux distribution with a FIPS-enabled kernel should provide similar verification methods through the /proc/sys/crypto/fips_enabled flag.

Learn more about NVIDIA’s hardened image in the AI Software for Regulated Environments.

Production Branch - May 2025 (PB 25h1)#

Production Branch - May 2025 (PB 25h1)#

PB Collection on NGC

Production Branch - May 2025 (PB 25h1)

First planned release

May 2025

Last planned release

December 2025

Planned End of Life (EOL)

January 2026

Compatible Infrastructure Release

Use the latest NVIDIA AI Enterprise Infra Release 6 on the NGC Catalog.

Support Matrix

NVIDIA AI Enterprise Infrastructure Support Matrix 6

Products Included in Production Branch - May 2025 (PB 25h1)#

Name

Version

Product Documentation

NVIDIA NIM Llama-3.1-8b-instruct [2]

1.10

Release 1.10

NVIDIA NIM Llama-3.1-70b-instruct [2]

1.10

Release 1.10

NVIDIA DeepStream SDK [2]

N/A

DEEPSTREAM SDK

NVIDIA Holoscan SDK [2]

3.3.0

Holoscan SDK v3.3

NVIDIA Morpheus

25.02-runtime

NVIDIA Morpheus (25.02)

NVIDIA TensorRT

25.03-py

TensorRT Release 25.03

NVIDIA Triton Inference Server

  • 25.03-py3

  • 25.03-py3-sdk

  • 25.03-py3-min

  • 25.03-py3-vllm

  • 25.03-py3-trtllm

Triton Inference Server 25.03

NVIDIA NIM Retrieval QA E5 Embedding v5 [2]

1.8.0

Release 1.8.0

PyTorch

25.03-py

PyTorch Release 25.03

RAPIDS

25.02-runtime

RAPIDS Documentation

RAPIDS Accelerator for Apache Spark

25.02.1

Release v25.02.1

Production Branch - October 2024 (PB 24h2) - EOL#

Important

This branch is end-of-life (EOL).

Production Branch - October 2024 (PB 24h2)#

PB Collection on NGC

Production Branch - October 2024 (PB 24h2)

First planned release

October 2024

Last planned release

June 2025

Planned End of Life (EOL)

July 2025

Compatible Infrastructure Release

Use the latest NVIDIA AI Enterprise Infra Release 6 or NVIDIA AI Enterprise Infra Release 5 on the NGC Catalog.

Support Matrix

Products Included in Production Branch - October 2024 (PB 24h2)#

Name

Version

Product Documentation

Deep Graph Library (DGL)

24.08-py3

DGL Release 24.08

NVIDIA NIM Llama-3.1-8b-instruct [1]

1.3

Release 1.3.0

NVIDIA NIM Llama-3.1-70b-instruct [1]

1.3

Release 1.3.0

NVIDIA DeepStream SDK

7.1-triton-x86

DEEPSTREAM SDK 7.1 FOR NVIDIA DGPU/X86 (NVAIE)

NVIDIA Holoscan SDK

24.08

Holoscan SDK v2.6

NVIDIA Morpheus

24.06-runtime

NVIDIA Morpheus (24.06)

NVIDIA TensorRT

24.08-py3

TensorRT Release 24.08

NVIDIA Triton Inference Server

  • 24.08-py3

  • 24.08-py3-sdk

  • 24.08-py3-min

Triton Inference Server Release 24.08

NVIDIA NIM Retrieval QA E5 Embedding v5 [1]

1.2

Release 1.2.0

PyTorch

24.08-py3

PyTorch Release 24.08

PyTorch Geometric (PyG)

24.08-py3

PyG Release 24.08

RAPIDS

24.06-runtime

RAPIDS Documentation

RAPIDS Accelerator for Apache Spark

24.06.02

Release v24.06.1

TensorFlow 2

24.08-tf2-py3

TensorFlow Release 24.08

Production Branch - May 2024 (PB 24h1) - EOL#

Important

This branch is end-of-life (EOL).

Production Branch - May 2024 (PB 24h1)#

PB Collection on NGC

Production Branch - May 2024 (PB 24h1)

First planned release

May 2024

Last planned release

December 2024

Planned End of Life (EOL)

January 2025

Compatible Infrastructure Release

Use the latest NVIDIA AI Enterprise Infra Release 5 as a supported configuration.

Support Matrix

NVIDIA AI Enterprise Infrastructure Support Matrix 5

Products Included in Production Branch - May 2024 (PB 24h1)#

Name

Version

Product Documentation

NVIDIA MONAI Toolkit

24.03-py3

Project MONAI

NVIDIA Morpheus

24.02-runtime

Morpheus Documentation

NVIDIA TensorRT

24.03-py3

TensorRT Release 24.03

NVIDIA Triton Inference Server

  • 24.03-py3

  • 24.03-py3-sdk

  • 24.03-py3-min

Triton Inference Server Release 24.03

PyTorch

24.03-py3

PyTorch Release 24.03

RAPIDS

24.02-runtime

RAPIDS Documentation

RAPIDS Accelerator for Apache Spark

24.02

Release v24.02.0

TensorFlow 2

24.03-tf2-py3

TensorFlow Release 24.03

Production Branch - October 2023 (PB 23h2) - EOL#

Important

This branch is end-of-life (EOL).

Production Branch - October 2023 (PB 23h2)#

PB Collection on NGC

Production Branch - October 2023 (PB 23h2)

First planned release

October 2023

Last planned release

June 2024

Planned End of Life (EOL)

July 2024

Compatible Infrastructure Release

Use the latest NVIDIA AI Enterprise Infra Release 5 as a supported configuration.

Products Included in Production Branch - October 2023 (PB 23h2)#

Name

Version

Product Documentation

NVIDIA Holoscan SDK

23.10

Holoscan SDK v1.0.3

NVIDIA TensorRT

23.08-py3

TensorRT Release 23.08

NVIDIA Triton Inference Server

  • 23.08-py3

  • 23.08-py3-sdk

  • 23.08-py3-min

  • 23.08-pyt-python-py3

  • 23.08-tf2-python-py3

Triton Inference Server Release 23.08

PyTorch

23.08-py3

PyTorch Release 23.08

RAPIDS

23.06-runtime

RAPIDS Documentation

TensorFlow 2

23.08-tf2-py3

TensorFlow Release 23.08

Footnotes