Support Matrix for Certified NIMs#
This page lists the supported models, their deployment profiles, and the verified hardware SKUs for NIM LLM Certified NIMs. For NIM Day 0, refer to Support Matrix for NIM Day 0. For NIM Turbo, refer to Support Matrix for NIM Turbo.
Supported Models and Profiles#
Use the following sections to identify the supported deployment profiles for each model. Profile strings follow a naming convention described in Model Profiles and Selection.
Use the table below to filter certified NIM profiles by GPU, tensor parallelism (TP), precision, and model name. Each row is one supported profile; details also appear in the per-model sections that follow.
gpt-oss-120b#
Latest supported NIM LLM version: 2.0.4
The following table lists the supported profile configurations for openai/gpt-oss-120b:
Precision |
TP1 |
TP2 |
TP4 |
TP8 |
|---|---|---|---|---|
MXFP4 |
|
|
|
|
MXFP4 + LoRA |
|
|
|
|
Verified GPUs
This model has been verified on the following GPUs:
NVIDIA-A100-SXM4-40GBNVIDIA-A100-SXM4-80GBNVIDIA-A10GNVIDIA-B200NVIDIA-B300-SXM6-ACNVIDIA-GB200NVIDIA-GB300NVIDIA-GH200-144G-HBM3eNVIDIA-GH200-480GBNVIDIA-H100-80GB-HBM3NVIDIA-H100-NVLNVIDIA-H200NVIDIA-H200-NVLNVIDIA-L40SNVIDIA-RTX-PRO-4500-Blackwell-Server-EditionNVIDIA-RTX-PRO-6000-Blackwell-Server-Edition
gpt-oss-20b#
Latest supported NIM LLM version: 2.0.4
The following table lists the supported profile configurations for openai/gpt-oss-20b:
Precision |
TP1 |
TP2 |
TP4 |
TP8 |
|---|---|---|---|---|
MXFP4 |
|
|
|
|
MXFP4 + LoRA |
|
|
|
|
Verified GPUs
This model has been verified on the following GPUs:
NVIDIA-A100-SXM4-40GBNVIDIA-A100-SXM4-80GBNVIDIA-A10GNVIDIA-B200NVIDIA-B300-SXM6-ACNVIDIA-GB10NVIDIA-GB200NVIDIA-GB300NVIDIA-GH200-144G-HBM3eNVIDIA-GH200-480GBNVIDIA-H100-80GB-HBM3NVIDIA-H100-NVLNVIDIA-H200NVIDIA-H200-NVLNVIDIA-L40SNVIDIA-RTX-PRO-4500-Blackwell-Server-EditionNVIDIA-RTX-PRO-6000-Blackwell-Server-Edition
llama-3.1-70b-instruct#
Latest supported NIM LLM version: 2.0.4
The following table lists the supported profile configurations for meta/llama-3.1-70b-instruct:
Precision |
TP1 |
TP2 |
TP4 |
TP8 |
|---|---|---|---|---|
BF16 |
|
|
|
|
BF16 + LoRA |
|
|
|
|
FP8 |
|
|
|
|
FP8 + LoRA |
|
|
|
|
NVFP4 |
|
|
|
|
NVFP4 + LoRA |
|
|
|
|
Verified GPUs
This model has been verified on the following GPUs:
NVIDIA-A10GNVIDIA-A100-SXM4-40GBNVIDIA-A100-SXM4-80GBNVIDIA-B200NVIDIA-B300-SXM6-ACNVIDIA-GB200NVIDIA-GB300NVIDIA-GH200-144G-HBM3eNVIDIA-GH200-480GBNVIDIA-H100-80GB-HBM3NVIDIA-H100-NVLNVIDIA-H200NVIDIA-H200-NVLNVIDIA-L40SNVIDIA-RTX-PRO-4500-Blackwell-Server-EditionNVIDIA-RTX-PRO-6000-Blackwell-Server-Edition
llama-3.1-8b-instruct#
Latest supported NIM LLM version: 2.0.4
The following table lists the supported profile configurations for meta/llama-3.1-8b-instruct:
Precision |
TP1 |
|---|---|
BF16 |
|
BF16 + LoRA |
|
FP8 |
|
FP8 + LoRA |
|
NVFP4 |
|
NVFP4 + LoRA |
|
Verified GPUs
This model has been verified on the following GPUs:
NVIDIA-A100-SXM4-40GBNVIDIA-A100-SXM4-80GBNVIDIA-A10GNVIDIA-B200NVIDIA-B300-SXM6-ACNVIDIA-GB10NVIDIA-GB200NVIDIA-GB300NVIDIA-GH200-144G-HBM3eNVIDIA-GH200-480GBNVIDIA-H100-80GB-HBM3NVIDIA-H100-NVLNVIDIA-H200NVIDIA-H200-NVLNVIDIA-L40SNVIDIA-RTX-PRO-4500-Blackwell-Server-EditionNVIDIA-RTX-PRO-6000-Blackwell-Server-Edition
llama-3.3-70b-instruct#
Latest supported NIM LLM version: 2.0.4
The following table lists the supported profile configurations for meta/llama-3.3-70b-instruct:
Precision |
TP1 |
TP2 |
TP4 |
TP8 |
|---|---|---|---|---|
BF16 |
|
|
|
|
BF16 + LoRA |
|
|
|
|
FP8 |
|
|
|
|
FP8 + LoRA |
|
|
|
|
NVFP4 |
|
|
|
|
NVFP4 + LoRA |
|
|
|
|
Verified GPUs
This model has been verified on the following GPUs:
NVIDIA-A10GNVIDIA-A100-SXM4-40GBNVIDIA-A100-SXM4-80GBNVIDIA-B200NVIDIA-B300-SXM6-ACNVIDIA-GB200NVIDIA-GB300NVIDIA-GH200-144G-HBM3eNVIDIA-GH200-480GBNVIDIA-H100-80GB-HBM3NVIDIA-H100-NVLNVIDIA-H200NVIDIA-H200-NVLNVIDIA-L40SNVIDIA-RTX-PRO-4500-Blackwell-Server-EditionNVIDIA-RTX-PRO-6000-Blackwell-Server-Edition
llama-3.3-nemotron-super-49b-v1.5#
Latest supported NIM LLM version: 2.0.4
The following table lists the supported profile configurations for nvidia/llama-3.3-nemotron-super-49b-v1.5:
Precision |
TP1 |
TP2 |
TP4 |
TP8 |
|---|---|---|---|---|
BF16 |
|
|
|
|
BF16 + LoRA |
|
|
|
|
FP8 |
|
|
|
|
FP8 + LoRA |
|
|
|
|
NVFP4 |
|
|
|
|
NVFP4 + LoRA |
|
|
|
|
Verified GPUs
This model has been verified on the following GPUs:
NVIDIA-A10GNVIDIA-A100-SXM4-40GBNVIDIA-A100-SXM4-80GBNVIDIA-B200NVIDIA-B300-SXM6-ACNVIDIA-GB10NVIDIA-GB200NVIDIA-GB300NVIDIA-GH200-144G-HBM3eNVIDIA-GH200-480GBNVIDIA-H100-80GB-HBM3NVIDIA-H100-NVLNVIDIA-H200NVIDIA-H200-NVLNVIDIA-L40SNVIDIA-RTX-PRO-4500-Blackwell-Server-EditionNVIDIA-RTX-PRO-6000-Blackwell-Server-Edition
nemotron-3-nano#
Latest supported NIM LLM version: 2.0.4
The following table lists the supported profile configurations for nvidia/nemotron-3-nano:
Precision |
TP1 |
TP2 |
TP4 |
TP8 |
|---|---|---|---|---|
BF16 |
|
|
|
|
BF16 + LoRA |
|
|
|
|
FP8 |
|
|
|
|
FP8 + LoRA |
|
|
|
|
NVFP4 |
|
|
|
|
Verified GPUs
This model has been verified on the following GPUs:
NVIDIA-A100-SXM4-40GBNVIDIA-A100-SXM4-80GBNVIDIA-A10GNVIDIA-B200NVIDIA-B300-SXM6-ACNVIDIA-GB10NVIDIA-GB200NVIDIA-GB300NVIDIA-GH200-144G-HBM3eNVIDIA-GH200-480GBNVIDIA-H100-80GB-HBM3NVIDIA-H100-NVLNVIDIA-H200NVIDIA-H200-NVLNVIDIA-L40SNVIDIA-RTX-PRO-4500-Blackwell-Server-EditionNVIDIA-RTX-PRO-6000-Blackwell-Server-Edition
nemotron-3-super-120b-a12b#
Latest supported NIM LLM version: 2.0.4
The following table lists the supported profile configurations for nvidia/nemotron-3-super-120b-a12b:
Precision |
TP1 |
TP2 |
TP4 |
TP8 |
|---|---|---|---|---|
BF16 |
|
|
|
|
BF16 + LoRA |
|
|
|
|
FP8 |
|
|
|
|
FP8 + LoRA |
|
|
|
|
NVFP4 |
|
|
|
|
NVFP4 + LoRA |
|
|
|
|
Note
This is a large model. Lower-TP profiles require substantially more GPU memory per device, so some verified GPUs support only TP4 or TP8 profiles.
Verified GPUs
The following GPUs have been verified with one or more supported profiles for this model:
NVIDIA-A100-SXM4-40GBNVIDIA-A100-SXM4-80GBNVIDIA-B200NVIDIA-B300-SXM6-ACNVIDIA-GB200NVIDIA-GB300NVIDIA-GH200-144G-HBM3eNVIDIA-H100-80GB-HBM3NVIDIA-H100-NVLNVIDIA-H200NVIDIA-H200-NVLNVIDIA-L40SNVIDIA-RTX-PRO-4500-Blackwell-Server-EditionNVIDIA-RTX-PRO-6000-Blackwell-Server-Edition
starcoder2-7b#
Latest supported NIM LLM version: 2.0.4
The following table lists the supported profile configurations for bigcode/starcoder2-7b:
Precision |
TP1 |
TP2 |
|---|---|---|
BF16 |
|
|
Verified GPUs
This model has been verified on the following GPUs:
NVIDIA-H100-80GB-HBM3NVIDIA-H200
Model-Free NIM#
Latest supported NIM LLM version: 2.0.4
The following models are tested and validated for
nvidia/model-free-nim:
gpt-oss-20bapriel-nemotroncodestral
While not explicitly validated, the model-free NIM can be used with any model supported by the underlying backend (vLLM) version. Refer to Model-Free NIM for deployment details.
Verified GPUs
The model-free NIM has been verified on the following GPUs:
NVIDIA-A100-80GB-PCIeNVIDIA-A100-PCIE-40GBNVIDIA-A100-SXM4-40GBNVIDIA-A100-SXM4-80GBNVIDIA-A10GNVIDIA-B200NVIDIA-B300-SXM6-ACNVIDIA-GB10NVIDIA-GB200NVIDIA-GB300NVIDIA-GH200-144G-HBM3eNVIDIA-GH200-480GBNVIDIA-H100-80GB-HBM3NVIDIA-H100-NVLNVIDIA-H100-PCIeNVIDIA-H200NVIDIA-H200-NVLNVIDIA-L40SNVIDIA-RTX-PRO-4500-Blackwell-Server-EditionNVIDIA-RTX-PRO-6000-Blackwell-Server-Edition
1.x NIM LLM Models#
For more information on version 1.x NIMs, refer to the 1.15 version of the NIM LLM Supported Models page.
Show 1.x models
Model (Hardware Requirements) |
Organization/Model ID (Catalog Page) |
|---|---|
|
|