Skip to main content

Ctrl+K

Megatron Core

GitHub

Megatron Core

GitHub

Table of Contents

About Megatron Core

Overview
Release Notes

Get Started

Quick Start
Megatron Core Installation

Basic Usage

Data Preparation
Training Examples
Parallelism Strategies Guide

Supported Models

Supported Models

Advanced Features

Mixture of Experts
- Multi-Token Prediction (MTP)
- Multi-Latent Attention
context_parallel package
Megatron FSDP
Distributed Optimizer
Optimizer CPU Offload
Custom Pipeline Model Parallel Layout
Megatron Energon
Megatron RL
Tokenizers

Developer Guide

Contributing to Megatron-LM
How to Submit a PR
Oncall Overview
Generating Docs Locally

Discussions

Discussions

API Reference

API Guide
API Reference
- core

API Guide
Core APIs

Core APIs#

Low-level API reference for core Megatron components.

transformer package
tensor_parallel package
pipeline_parallel package
fusions package
distributed package
datasets package
Data Pipeline
dist_checkpointing package
dist_checkpointing.strategies package

previous

models.bert package

next

transformer package

Privacy Policy | Your Privacy Choices | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2025, NVIDIA Corporation.