Support Matrix for PB NIMs#
This page lists the supported models for NIM LLM NIMs on the Production Branch (PB).
Supported Models#
Use the following sections to identify the supported models.
llama-3.1-8b-instruct-pb6#
Latest supported NIM LLM PB version: 2.0.4-pb6.0
This PB NIM is published on NGC as meta/llama-3.1-8b-instruct-pb6.
llama-3.3-70b-instruct-pb6#
Latest supported NIM LLM PB version: 2.0.4-pb6.0
This PB NIM is published on NGC as meta/llama-3.3-70b-instruct-pb6.
llama-3.3-nemotron-super-49b-v1.5-pb6#
Latest supported NIM LLM PB version: 2.0.4-pb6.0
This PB NIM is published on NGC as nvidia/llama-3.3-nemotron-super-49b-v1.5-pb6.
nemotron-3-nano-pb6#
Latest supported NIM LLM PB version: 2.0.4-pb6.0
This PB NIM is published on NGC as nvidia/nemotron-3-nano-pb6.
gpt-oss-120b-pb6#
Latest supported NIM LLM PB version: 2.0.4-pb6.0
This PB NIM is published on NGC as openai/gpt-oss-120b-pb6.
model-free-nim-pb6#
Latest supported NIM LLM PB version: 2.0.4-pb6.0
This PB NIM is published on NGC as
nvidia/model-free-nim-pb6.
While not explicitly validated, the model-free NIM can be used with any model supported by the underlying backend (vLLM) version. Refer to Model-Free NIM for deployment details.