Qwen Models
This page provides detailed technical specifications for the Qwen model family supported by NeMo Customizer. For information about supported features and capabilities, refer to Tested Models.
Qwen2.5-1.5B-Instruct
Training Options
- LoRA: 1x 80GB GPU, tensor parallel size 1
- Full SFT: 1x 80GB GPU, tensor parallel size 1
Deployment Configuration
- LoRA:
- NIM Image:
nvcr.io/nim/nvidia/llm-nim:1.15.5 - GPU Count: 1x 80GB
- Full SFT:
- NIM Image:
nvcr.io/nim/nvidia/llm-nim:1.15.5 - GPU Count: 1x 80GB
- Additional Environment Variables:
NIM_MODEL_PROFILE:vllm
Qwen3-0.6B
Training Options
- LoRA: 1x 80GB GPU, tensor parallel size 1
- Full SFT: 1x 80GB GPU, tensor parallel size 1
Deployment Configuration
- LoRA:
- NIM Image:
nvcr.io/nim/nvidia/llm-nim:1.15.5 - GPU Count: 1x 80GB
- Full SFT:
- NIM Image:
nvcr.io/nim/nvidia/llm-nim:1.15.5 - GPU Count: 1x 80GB
- Additional Environment Variables:
NIM_MODEL_PROFILE:vllm