Support Matrix for PB NIMs#

This page lists the supported models for NIM LLM NIMs on the Production Branch (PB).

Supported Models#

Use the following sections to identify the supported models.

llama-3.1-8b-instruct-pb6#

Latest supported NIM LLM PB version: 2.0.4-pb6.0

This PB NIM is published on NGC as meta/llama-3.1-8b-instruct-pb6.

llama-3.3-70b-instruct-pb6#

Latest supported NIM LLM PB version: 2.0.4-pb6.0

This PB NIM is published on NGC as meta/llama-3.3-70b-instruct-pb6.

llama-3.3-nemotron-super-49b-v1.5-pb6#

Latest supported NIM LLM PB version: 2.0.4-pb6.0

This PB NIM is published on NGC as nvidia/llama-3.3-nemotron-super-49b-v1.5-pb6.

nemotron-3-nano-pb6#

Latest supported NIM LLM PB version: 2.0.4-pb6.0

This PB NIM is published on NGC as nvidia/nemotron-3-nano-pb6.

gpt-oss-120b-pb6#

Latest supported NIM LLM PB version: 2.0.4-pb6.0

This PB NIM is published on NGC as openai/gpt-oss-120b-pb6.

model-free-nim-pb6#

Latest supported NIM LLM PB version: 2.0.4-pb6.0

This PB NIM is published on NGC as nvidia/model-free-nim-pb6.

While not explicitly validated, the model-free NIM can be used with any model supported by the underlying backend (vLLM) version. Refer to Model-Free NIM for deployment details.