Model Catalog#

Explore the model families and sizes supported by the NVIDIA NeMo Customizer microservice.

Tip

For specific values required to create customization targets, refer to the customization target value reference guide.

Model Families#

Llama Models

View the available Llama models from Meta, ranging from 8 billion to 70 billion parameters.

Llama Nemotron Models

View the available Llama Nemotron models from NVIDIA, including Nano and Super variants for efficient and advanced instruction tuning.

Llama Nemotron Models

Phi Models

View the available Phi models from Microsoft, designed for strong reasoning capabilities with efficient deployment.

Phi Models

Support Matrix#

The support matrices show the list of supported models and the features available for each model. For detailed technical specifications of each model (architecture, parameters, token limits, etc.), please refer to the previously listed model family pages.

Large Language Models#

All of the following models in the table support both chat and completion model training.

Model	Train a Chat Model with Tool Calling	Fine-tuning Options	Sequence Packing[1]	Inference with NIM	Reasoning
meta/llama-3.3-70b-instruct	Yes	LoRA	No	Supported (unverified)	No
meta/llama-3.2-3b-instruct	Yes	SFT, LoRA	Yes	Supported (unverified)	No
meta/llama-3.2-1b-instruct	Yes	SFT, LoRA	Yes	Supported	No
meta/llama-3.1-70b-instruct	Yes	LoRA	Yes	Supported (unverified)	No
meta/llama-3.1-8b-instruct	Yes	SFT, LoRA	Yes	Supported	No
meta/llama3-70b-instruct	Yes	LoRA	Yes	Supported (unverified)	No
nvidia/nemotron-nano-llama-3.1-8b@1.0	No	LoRA, All Weights	No	Supported	Yes
nvidia/nemotron-super-llama-3.3-49b@1.0	No	LoRA	No	Supported	Yes
microsoft/phi-4	No	SFT, LoRA	No	Not supported	No

Embedding Models#

Model	Fine-tuning Options	Inference with NIM
meta/llama-3.2-1b-embedding	SFT	Not supported