Phi Models

This page provides detailed technical specifications for the Phi model family supported by NeMo Customizer. For information about supported features and capabilities, refer to Tested Models.

Microsoft Phi-4

Property	Value
Creator	Microsoft
Architecture	Decoder-only Transformer
Description	Phi-4 is Microsoft’s most advanced small language model, designed to deliver strong reasoning capabilities while being efficient to deploy.
Max I/O Tokens	16K
Parameters	14 billion
Training Data	High-quality data with emphasis on reasoning and code
Default Name	microsoft/phi-4
Hugging Face	microsoft/phi-4

Training Options

LoRA: 2x 80GB GPU, tensor parallel size 2, micro batch size 2
Full SFT: 4x 80GB GPU, tensor parallel size 2

Deployment Configuration

LoRA:
NIM Image: nvcr.io/nim/nvidia/llm-nim:1.15.5
Full SFT:
NIM Image: nvcr.io/nim/nvidia/llm-nim:1.15.5
Additional Environment Variables:
NIM_MODEL_PROFILE: vllm