Supported Models#
Megatron Core supports a wide range of language and multimodal models with optimized implementations for large-scale training.
Model Conversion#
For converting HuggingFace models to Megatron format, use Megatron Bridge, the official standalone converter. Megatron Bridge supports an extensive list of models including LLaMA, Mistral, Mixtral, Qwen, DeepSeek, Gemma, Phi, Nemotron, and many more.
See the Megatron Bridge supported models list for the complete and up-to-date list of supported models.