Phi Models#
This page provides detailed technical specifications for the Phi model family supported by NeMo Customizer. For information about supported features and capabilities, refer to Tested Models.
Microsoft Phi-4#
Property |
Value |
|---|---|
Creator |
Microsoft |
Architecture |
Decoder-only Transformer |
Description |
Phi-4 is Microsoft’s most advanced small language model, designed to deliver strong reasoning capabilities while being efficient to deploy. |
Max I/O Tokens |
16K |
Parameters |
14 billion |
Training Data |
High-quality data with emphasis on reasoning and code |
Default Name |
microsoft/phi-4 |
HuggingFace |
Training Options#
LoRA: 2x 80GB GPU, tensor parallel size 2, micro batch size 2
Full SFT: 4x 80GB GPU, tensor parallel size 2
Deployment Configuration#
LoRA:
NIM Image:
nvcr.io/nim/nvidia/llm-nim:1.15.5
Full SFT:
NIM Image:
nvcr.io/nim/nvidia/llm-nim:1.15.5Additional Environment Variables:
NIM_MODEL_PROFILE:vllm