Llama Nemotron Models#

This page provides detailed technical specifications for the Nemotron model family supported by the NVIDIA NeMo Customizer microservice. For information about supported features and capabilities, refer to the Support Matrix in the Model Catalog.

Llama 3.1 Nemotron Nano 8B v1#

Property

Value

Creator

NVIDIA

Architecture

transformer

Description

Llama 3.1 Nemotron Nano 8B v1 is a compact, instruction-tuned model for efficient customization and deployment.

Max I/O Tokens

4096

Parameters

8 billion

Training Data

Not specified

Recommended GPUs for Customization

1 (LoRA), 8 (All Weights)

Default Name

nvidia/nemotron-nano-llama-3.1-8b@1.0

Version

ngc://nvidian/nemo-llm/nemotron-nano-3_1-8b:0.0.1

Training Options#

  • LoRA: 1 GPU, tensor parallel size 1

  • All Weights: 8 GPUs, tensor parallel size 4

Llama 3.3 Nemotron Super 49B v1#

Property

Value

Creator

NVIDIA

Architecture

transformer

Description

Llama 3.3 Nemotron Super 49B v1 is a large, instruction-tuned model for advanced dialogue and reasoning tasks.

Max I/O Tokens

4096

Parameters

49 billion

Training Data

Not specified

Recommended GPUs for Customization

4 (LoRA)

Default Name

nvidia/nemotron-super-llama-3.3-49b@1.0

Version

ngc://nvidian/nemo-llm/nemotron-super-3_3-49b:v1

Training Options#

  • LoRA: 4 GPUs, tensor parallel size 4