Code Llama 13B Instruct |
Meta |
CodeLlama-13B-Instruct |
1.0 , 1.2 , 1.2.3
|
- |
- |
- |
Code Llama 34B Instruct |
Meta |
CodeLlama-34B-Instruct |
1.0 , 1.2 , 1.2.3
|
- |
- |
- |
Code Llama 70B Instruct |
Meta |
CodeLlama-70B-Instruct |
1.0 , 1.2 , 1.2.3
|
- |
- |
- |
DeepSeek R1 |
DeepSeek |
Deepseek-R1 |
1.7
|
No |
No |
No |
DeepSeek R1 Distill Llama 8B |
DeepSeek |
Deepseek-R1-Distill-Llama-8B |
1.5
|
- |
- |
- |
DeepSeek R1 Distill Llama 70B |
DeepSeek |
Deepseek-R1-Distill-Llama-70B |
1.5
|
- |
- |
- |
DeepSeek R1 Distill Llama 8B RTX |
DeepSeek |
Deepseek-R1-Distill-Llama-8B |
1.8
|
- |
- |
- |
Gemma 2 2B |
Google |
Gemma-2-2B-IT |
1.4
|
- |
- |
- |
Gemma 2 9B |
Google |
Gemma-2-9B-IT |
1.4.0
|
- |
- |
- |
Llama 2 7B Chat |
Meta |
meta-llama-2-7b-chat |
1.0 , 1.0.3
|
- |
- |
- |
Llama 2 13B Chat |
Meta |
meta-llama-2-13b-chat |
1.0 , 1.0.3
|
H100, A100, L40S |
- |
- |
Llama 2 70B Chat |
Meta |
meta-llama-2-70b-chat |
1.0 , 1.0.3
|
- |
- |
- |
Llama 3 SQLCoder 8B |
Meta |
Llama-3-SQLCoder-8B |
1.2.3
|
- |
- |
- |
Llama 3 Swallow 70B Instruct V0.1 |
Meta |
Llama-3-Swallow-70B-Instruct-v0.1 |
1.0 , 1.2 , 1.1.2
|
- |
- |
- |
Llama 3 Taiwan 70B Instruct |
Meta |
Llama-3-Taiwan-70B-Instruct |
1.0 , 1.1 , 1.1.2
|
- |
- |
- |
Llama 3.1 8B Base |
Meta |
Llama-3.1-8b-base |
1.0 , 1.1 , 1.1.1 , 1.1.2
|
- |
Yes |
Yes |
Llama 3.1 8B Instruct |
Meta |
Llama-3.1-8b-instruct |
1.0 , 1.1 , 1.1.1 , 1.1.2 , 1.2 , 1.2.3 , 1.3 , 1.3.3 , 1.5 , 1.8 , 1.8.3
|
Yes |
Yes |
Yes |
Llama 3.1 8B Instruct RTX |
Meta |
Llama-3.1-8b-instruct |
1.8
|
- |
Yes |
Yes |
Llama 3.1 70B Instruct |
Meta |
Llama-3.1-70b-instruct |
1.0 , 1.1 , 1.1.1 , 1.1.2 , 1.2 , 1.2.1 , 1.3 , 1.8.3
|
Yes (1.8.3 only) |
Yes |
Yes |
Llama 3.1 405B Instruct |
Meta |
Llama-3.1-405b-instruct |
1.0 , 1.1 , 1.1.2 , 1.2 , 1.3
|
- |
Yes |
Yes |
Llama 3.1 Nemotron Nano 8B V1 |
NVIDIA |
Llama-3.1 Nemotron Nano 8B V1 |
1.6.0 , 1.8.3
|
No |
- |
- |
Llama 3.1 Nemotron 70B Instruct |
Meta |
Llama-3.1 Nemotron 70B-Instruct |
1.0 , 1.1 , 1.1.1 , 1.2 , 1.2.1 , 1.2.3
|
- |
- |
- |
Llama 3.1 Swallow 8B Instruct v0.1 |
Meta |
Llama-3.1-Swallow-8B-Instruct-v0.1 |
1.3
|
- |
- |
- |
Llama 3.1 Swallow 70B Instruct v0.1 |
Meta |
Llama-3.1-Swallow-70B-Instruct-v0.1 |
1.3
|
- |
- |
- |
Llama 3.2 1B Instruct |
Meta |
Llama-3.2-1b-Instruct |
1.6.0 , 1.8.1 , 1.8.3
|
- |
- |
- |
Llama 3.2 3B Instruct |
Meta |
Llama-3.2-3b-Instruct |
1.6.0 , 1.8.3
|
Yes (1.8.3 only) |
Yes |
- |
Llama 3.3 70B Instruct |
Meta |
Llama-3.3-70b-Instruct |
1.5.2 , 1.8.2 , 1.8.3
|
Yes (1.8.3 ) |
Yes |
No |
Llama 3.3 Nemotron Super 49B V1 |
NVIDIA |
Llama-3.3-Nemotron-Super-49B-v1 |
1.8
|
Yes |
- |
- |
Meta Llama 3 8B Instruct |
Meta |
Meta/Llama3-8b-instruct |
1.0 , 1.0.3
|
- |
- |
- |
Meta Llama 3 70B Instruct |
Meta |
Meta/Llama3-70b-instruct |
1.0 , 1.0.1 , 1.0.3
|
- |
- |
- |
Mistral 7B Instruct v0.3 |
Mistral |
Mistral-07B-Instruct-v0.3 |
1.0 , 1.1 , 1.1.2 , 1.3
|
Hugging Face, NeMo Formats |
Yes |
Yes |
Mistral NeMo 12B Instruct RTX |
Mistral |
Mistral-Nemo-12B-Instruct |
1.8.1
|
- |
- |
- |
Mistral NeMo 12B Instruct |
Mistral |
Mistral-Nemo-12B-Instruct |
1.0 , 1.2 , 1.2.3
|
- |
- |
- |
Mixtral 8x7B Instruct v0.1 |
Mistral |
Mixtral-8x7B-Instruct-v0.1 |
1.0 , 1.2 , 1.2.1 , 1.3
|
Hugging Face, NeMo Formats |
No |
No |
Mixtral 8x22B Instruct v0.1 |
Mistral |
Mixtral-8x22B-Instruct-v0.1 |
1.0 , 1.2 , 1.2.3
|
- |
Yes |
Yes |
Nemotron 4 340B Instruct |
NVIDIA |
nemotron-4-340b-instruct |
1.0 , 1.1 , 1.1.2
|
- |
- |
- |
Nemotron 4 340B Instruct 128K |
NVIDIA |
nemotron-4-340b-instruct-128k |
1.2.3 , 1.3
|
- |
- |
- |
Nemotron 4 340B Reward |
NVIDIA |
Nemotron-4-340B-Reward |
1.0 , 1.2
|
- |
Yes |
Yes |
Phi 3 Mini 4K Instruct |
Microsoft |
Phi-3-4K-Instruct |
1.2.3
|
- |
- |
- |
Phind Codellama 34B V2 Instruct |
Microsoft |
Phind-Codellama-34B-v2-Instruct |
1.2.3
|
- |
- |
- |
Qwen2.5 7B Instruct |
Alibaba Cloud |
Qwen2.5 7B Instruct |
1.4.0
|
- |
- |
- |
StarCoder2 7B |
BigCode |
StarCoder 2 7B |
1.8
|
- |
- |
- |
StarCoderBase 15.5B |
BigCode |
StarCoderBase 15.5B |
1.5
|
- |
- |
- |