Model Catalog#

Explore the model families and sizes supported by the NVIDIA NeMo Customizer microservice.

Model Families#

Llama Models

View the available Llama models from Meta, ranging from 8 billion to 70 billion parameters.

Llama Models
Phi Models

View the available Phi models from Microsoft, designed for strong reasoning capabilities with efficient deployment.

Phi Models

Support Matrix#

The support matrices show the list of supported models and the features available for each model. For detailed technical specifications of each model (architecture, parameters, token limits, etc.), please refer to the previously listed model family pages.

Large Language Models#

Model

Train a Chat Model

Train a Chat Model with Tool Calling

Train a Completion Model

Fine-tuning Options

Sequence Packing[1]

Inference with NIM

meta/llama-3.3-70b-instruct

Yes

Yes

Yes

LoRA

No

Supported (unverified)

meta/llama-3.2-3b-instruct

Yes

Yes

Yes

SFT, LoRA

Yes

Supported (unverified)

meta/llama-3.2-1b-instruct

Yes

Yes

Yes

SFT, LoRA

Yes

Supported

meta/llama-3.1-70b-instruct

Yes

Yes

Yes

LoRA

Yes

Supported (unverified)

meta/llama-3.1-8b-instruct

Yes

Yes

Yes

SFT, LoRA

Yes

Supported

meta/llama3-70b-instruct

Yes

Yes

Yes

LoRA

Yes

Supported (unverified)

microsoft/phi-4

Yes

No

Yes

SFT, LoRA

No

Not supported

Embedding Models#

Model

Fine-tuning Options

Inference with NIM

meta/llama-3.2-1b-embedding

SFT

Not supported