Llama Models#
This page provides detailed technical specifications for the Llama model family supported by NeMo Customizer. For information about supported features and capabilities, refer to Tested Models.
Llama-3.3-70b Instruct#
| Property | Value | 
|---|---|
| Creator | Meta | 
| Architecture | transformer | 
| Description | Llama-3.3-70b is a large language AI model optimized for advanced dialogue and reasoning capabilities. | 
| Max I/O Tokens | 8192 | 
| Parameters | 70 billion | 
| Training Data | 15+ trillion tokens (up to 2024) | 
| Recommended GPUs for Customization | 16 | 
| Default Name | meta/llama-3.3-70b-instruct | 
| Version | 
 | 
Llama-3.2-3b Instruct#
| Property | Value | 
|---|---|
| Creator | Meta | 
| Architecture | transformer | 
| Description | Llama-3.2-3b is a compact yet powerful language model suitable for various dialogue applications. | 
| Max I/O Tokens | 8192 | 
| Parameters | 3 billion | 
| Training Data | 15+ trillion tokens (up to 2024) | 
| Recommended GPUs for Customization | 1 | 
| Default Name | meta/llama-3.2-3b-instruct | 
| Version | 
 | 
Llama-3.2-1b Instruct#
| Property | Value | 
|---|---|
| Creator | Meta | 
| Architecture | transformer | 
| Description | Llama-3.2-1b is a lightweight language model designed for efficient deployment while maintaining strong capabilities. | 
| Max I/O Tokens | 8192 | 
| Parameters | 1 billion | 
| Training Data | 15+ trillion tokens (up to 2024) | 
| Recommended GPUs for Customization | 1 | 
| Default Name | meta/llama-3.2-1b-instruct | 
| Version | 
 | 
Llama-3.1-70b Instruct#
| Property | Value | 
|---|---|
| Creator | Meta | 
| Architecture | transformer | 
| Description | Llama-3.1-70b is a large language AI model optimized for multilingual dialogue uses. | 
| Max I/O Tokens | 8192 | 
| Parameters | 70 billion | 
| Training Data | 15 trillion tokens (up to December 2023) | 
| Recommended GPUs for Customization | 16 | 
| Default Name | meta/llama-3.1-70b-instruct | 
| Version | 
 | 
Llama-3.1-8b Instruct#
| Property | Value | 
|---|---|
| Creator | Meta | 
| Architecture | transformer | 
| Description | Llama-3.1-8b is a large language AI model optimized for multilingual dialogue uses. | 
| Max I/O Tokens | 8192 | 
| Parameters | 8 billion | 
| Training Data | 15 trillion tokens (up to December 2023) | 
| Recommended GPUs for Customization | 4 | 
| Default Name | meta/llama-3.1-8b-instruct | 
| Version | 
 | 
Llama-3-70b Instruct#
| Property | Value | 
|---|---|
| Creator | Meta | 
| Architecture | transformer | 
| Description | Llama-3-70b is a large language AI model comprising a collection of models capable of generating text and code in response to prompts. | 
| Max I/O Tokens | 8192 | 
| Parameters | 70 billion | 
| Training Data | 15 trillion tokens (up to December 2023) | 
| Recommended GPUs for Customization | 16 | 
| Default Name | meta/llama-3-70b-instruct | 
| Version | 
 |