Phi Models#

This page provides detailed technical specifications for the Phi model family supported by NeMo Customizer. For information about supported features and capabilities, refer to Tested Models.

Microsoft Phi-4#

Property

Value

Creator

Microsoft

Architecture

Decoder-only Transformer

Description

Phi-4 is Microsoft’s most advanced small language model, designed to deliver strong reasoning capabilities while being efficient to deploy.

Max I/O Tokens

16K

Parameters

14 billion

Training Data

High-quality data with emphasis on reasoning and code

Recommended GPUs for Customization

2

Default Name

microsoft/phi-4

Version

nvidia/nemo/phi-4:1.0