Llama Nemotron Models#
This page provides detailed technical specifications for the Nemotron model family supported by the NVIDIA NeMo Customizer microservice. For information about supported features and capabilities, refer to the Support Matrix in the Model Catalog.
Llama 3.1 Nemotron Nano 8B v1#
Property |
Value |
---|---|
Creator |
NVIDIA |
Architecture |
transformer |
Description |
Llama 3.1 Nemotron Nano 8B v1 is a compact, instruction-tuned model for efficient customization and deployment. |
Max I/O Tokens |
4096 |
Parameters |
8 billion |
Training Data |
Not specified |
Recommended GPUs for Customization |
1 (LoRA), 8 (All Weights) |
Default Name |
nvidia/nemotron-nano-llama-3.1-8b@1.0 |
Version |
|
Training Options#
LoRA: 1 GPU, tensor parallel size 1
All Weights: 8 GPUs, tensor parallel size 4
Llama 3.3 Nemotron Super 49B v1#
Property |
Value |
---|---|
Creator |
NVIDIA |
Architecture |
transformer |
Description |
Llama 3.3 Nemotron Super 49B v1 is a large, instruction-tuned model for advanced dialogue and reasoning tasks. |
Max I/O Tokens |
4096 |
Parameters |
49 billion |
Training Data |
Not specified |
Recommended GPUs for Customization |
4 (LoRA) |
Default Name |
nvidia/nemotron-super-llama-3.3-49b@1.0 |
Version |
|
Training Options#
LoRA: 4 GPUs, tensor parallel size 4