Production Branch (PB)#

A Production Branch (PB) contains production-ready AI frameworks and SDK branches to provide API stability and a secure environment for building mission-critical AI applications.

For lifecycle policy and support timelines, refer to the Production Branch (PB) in the Lifecycle Policy. Learn more about NVIDIA AI Enterprise release branches in the NVIDIA AI Enterprise Release Branches document.

Production Branch - October 2025 (PB 25h2)#

The following table provides release information and support timelines for Production Branch - October 2025 (PB 25h2):

Production Branch - October 2025 (PB 25h2)#

PB Collection on NGC

Production Branch - October 2025 (PB 25h2)

Government Ready PB Collection

Production Branch Government Ready - October 2025 (PB 25h2)

First planned release [3]

October 2025

Last planned release

June 2026

Planned End of Life (EOL)

July 2026

Government Ready Versions

STIG hardened, FIPS enabled versions are available for some products. Refer to the table below.

Compatible Infrastructure Release

Use the latest NVIDIA AI Enterprise Infra Release 7 on the NGC Catalog.

Support Matrix

NVIDIA AI Enterprise Infrastructure Support Matrix 7 [4]

The following table lists all products included in Production Branch - October 2025 (PB 25h2) with their versions, Government Ready availability, and product documentation links:

Products Included in Production Branch - October 2025 (PB 25h2)#

Name

Version

Government Ready Versions

Product Documentation

CUDA Deep Learning

25.08

Yes for x86

Release 25.08

Multi-LLM NIM

1.14

Yes for x86

Release 1.14.0

NVIDIA NIM Llama-3.1-70b-instruct

1.14

Yes for x86

Release 1.14.0-pb5.1

NVIDIA NIM Llama-3.3-nemotron-super-49b-v1.5 [5]

1.14

Yes for x86

Release 1.14.0-pb5.1

NVIDIA Holoscan SDK

3.3

No

Release 3.3.0

NVIDIA Retrieval QA Llama 3.2 1B Embedding v2 [5]

1.11.0

Yes for x86

Release 1.11.0 (PB)

NVIDIA Retrieval QA Llama 3.2 1B Reranking v2 [5]

1.9.0

Yes for x86

Release 1.9.0 (PB)

NVIDIA TensorRT

25.08-py

Yes for x86

Release 25.08

NVIDIA Triton Inference Server

  • 25.08-py3

  • 25.08-py3-sdk

  • 25.08-py3-vllm

  • 25.08-py3-trtllm

Yes for x86

Release 25.08

NVIDIA TAO

  • 6.0-python

  • 6.0-deploy

  • 6.0-data-services

Yes for x86

Release 6.0

PyTorch

25.08-py

Yes for x86

Release 25.08

RAPIDS [5]

25.08.02-runtime

Yes for x86

RAPIDS Documentation

RAPIDS Accelerator for Apache Spark [5]

v25.10

No

Release v25.10.0

New Features - Government Ready

This release introduces a significant new baseline for security, Government Ready, for most x86 container images. Government Ready containers meet software security requirements for use within FedRAMP High or equivalent Sovereign use cases while providing matching functionality with standard NVIDIA software.

Government Ready containers are built on Canonical’s Ubuntu 24.04 STIG-hardened base image and include FIPS-enabled versions of common cryptography libraries. These containers can be deployed the same as normal containers and require a FIPS-enabled Linux kernel on the host machine to utilize FIPS mode.

For detailed information about Government Ready containers, including technical implementation details, STIG and FIPS 140-3 standards, deployment instructions, and verification procedures, refer to Government Ready Containers.

Production Branch - May 2025 (PB 25h1)#

The following table provides release information and support timelines for Production Branch - May 2025 (PB 25h1):

Production Branch - May 2025 (PB 25h1)#

PB Collection on NGC

Production Branch - May 2025 (PB 25h1)

First planned release

May 2025

Last planned release

December 2025

Planned End of Life (EOL)

January 2026

Compatible Infrastructure Release

Use the latest NVIDIA AI Enterprise Infra Release 6 on the NGC Catalog.

Support Matrix

NVIDIA AI Enterprise Infrastructure Support Matrix 6

The following table lists all products included in Production Branch - May 2025 (PB 25h1) with their versions and product documentation links:

Products Included in Production Branch - May 2025 (PB 25h1)#

Name

Version

Product Documentation

NVIDIA NIM Llama-3.1-8b-instruct [2]

1.10

Release 1.10

NVIDIA NIM Llama-3.1-70b-instruct [2]

1.10

Release 1.10

NVIDIA DeepStream SDK [2]

N/A

DEEPSTREAM SDK

NVIDIA Holoscan SDK [2]

3.3.0

Release 3.3.0

NVIDIA Morpheus

25.02-runtime

Release 25.02

NVIDIA TensorRT

25.03-py

Release 25.03

NVIDIA Triton Inference Server

  • 25.03-py3

  • 25.03-py3-sdk

  • 25.03-py3-min

  • 25.03-py3-vllm

  • 25.03-py3-trtllm

Release 25.03

NVIDIA NIM Retrieval QA E5 Embedding v5 [2]

1.8.0

Release 1.8.0

PyTorch

25.03-py

Release 25.03

RAPIDS

25.02-runtime

RAPIDS Documentation

RAPIDS Accelerator for Apache Spark

25.02.1

Release v25.02.1

Production Branch - October 2024 (PB 24h2) - EOL#

Important

This branch reached end-of-life in July 2025. No further updates will be provided.

The following table provides archived release information for Production Branch - October 2024 (PB 24h2):

Production Branch - October 2024 (PB 24h2)#

PB Collection on NGC

Production Branch - October 2024 (PB 24h2)

First planned release

October 2024

Last planned release

June 2025

Planned End of Life (EOL)

July 2025

Compatible Infrastructure Release

Use the latest NVIDIA AI Enterprise Infra Release 6 or NVIDIA AI Enterprise Infra Release 5 on the NGC Catalog.

Support Matrix

The following table lists products that were included in Production Branch - October 2024 (PB 24h2):

Products Included in Production Branch - October 2024 (PB 24h2)#

Name

Version

Product Documentation

Deep Graph Library (DGL)

24.08-py3

Release 24.08

NVIDIA NIM Llama-3.1-8b-instruct [1]

1.3

Release 1.3.0

NVIDIA NIM Llama-3.1-70b-instruct [1]

1.3

Release 1.3.0

NVIDIA DeepStream SDK

7.1-triton-x86

DEEPSTREAM SDK 7.1 FOR NVIDIA DGPU/X86 (NVAIE)

NVIDIA Holoscan SDK

24.08

Release 2.6.0

NVIDIA Morpheus

24.06-runtime

Release 24.06

NVIDIA TensorRT

24.08-py3

Release 24.08

NVIDIA Triton Inference Server

  • 24.08-py3

  • 24.08-py3-sdk

  • 24.08-py3-min

Release 24.08

NVIDIA NIM Retrieval QA E5 Embedding v5 [1]

1.2

Release 1.2.0

PyTorch

24.08-py3

Release 24.08

PyTorch Geometric (PyG)

24.08-py3

Release 24.08

RAPIDS

24.06-runtime

RAPIDS Documentation

RAPIDS Accelerator for Apache Spark

24.06.02

Release v24.06.1

TensorFlow 2

24.08-tf2-py3

Release 24.08

Production Branch - May 2024 (PB 24h1) - EOL#

Important

This branch reached end-of-life in January 2025. No further updates will be provided.

The following table provides archived release information for Production Branch - May 2024 (PB 24h1):

Production Branch - May 2024 (PB 24h1)#

PB Collection on NGC

Production Branch - May 2024 (PB 24h1)

First planned release

May 2024

Last planned release

December 2024

Planned End of Life (EOL)

January 2025

Compatible Infrastructure Release

Use the latest NVIDIA AI Enterprise Infra Release 5 as a supported configuration.

Support Matrix

NVIDIA AI Enterprise Infrastructure Support Matrix 5

The following table lists products that were included in Production Branch - May 2024 (PB 24h1):

Products Included in Production Branch - May 2024 (PB 24h1)#

Name

Version

Product Documentation

NVIDIA MONAI Toolkit

24.03-py3

Project MONAI

NVIDIA Morpheus

24.02-runtime

Morpheus Documentation

NVIDIA TensorRT

24.03-py3

Release 24.03

NVIDIA Triton Inference Server

  • 24.03-py3

  • 24.03-py3-sdk

  • 24.03-py3-min

Release 24.03

PyTorch

24.03-py3

Release 24.03

RAPIDS

24.02-runtime

RAPIDS Documentation

RAPIDS Accelerator for Apache Spark

24.02

Release v24.02.0

TensorFlow 2

24.03-tf2-py3

Release 24.03

Production Branch - October 2023 (PB 23h2) - EOL#

Important

This branch reached end-of-life in July 2024. No further updates will be provided.

The following table provides archived release information for Production Branch - October 2023 (PB 23h2):

Production Branch - October 2023 (PB 23h2)#

PB Collection on NGC

Production Branch - October 2023 (PB 23h2)

First planned release

October 2023

Last planned release

June 2024

Planned End of Life (EOL)

July 2024

Compatible Infrastructure Release

Use the latest NVIDIA AI Enterprise Infra Release 5 as a supported configuration.

The following table lists products that were included in Production Branch - October 2023 (PB 23h2):

Products Included in Production Branch - October 2023 (PB 23h2)#

Name

Version

Product Documentation

NVIDIA Holoscan SDK

23.10

Release 1.0.3

NVIDIA TensorRT

23.08-py3

Release 23.08

NVIDIA Triton Inference Server

  • 23.08-py3

  • 23.08-py3-sdk

  • 23.08-py3-min

  • 23.08-pyt-python-py3

  • 23.08-tf2-python-py3

Release 23.08

PyTorch

23.08-py3

Release 23.08

RAPIDS

23.06-runtime

RAPIDS Documentation

TensorFlow 2

23.08-tf2-py3

Release 23.08

Footnotes