Skip to main content

Ctrl+K

Megatron Core

GitHub

Megatron Core

GitHub

Table of Contents

About Megatron Core

Overview
Release Notes

Get Started

Installation
Your First Training Run

Basic Usage

Data Preparation
Training Examples
Parallelism Strategies Guide

Supported Models

Supported Models

Advanced Features

Mixture of Experts
Context Parallel Package
Megatron FSDP
Distributed Optimizer
Optimizer CPU Offload
Custom Pipeline Model Parallel Layout
Fine-Grained Activation Offloading
Data Loading at Scale
Megatron Energon
Megatron RL
Tokenizers

Developer Guide

Contributing to Megatron-LM
How to Submit a PR
Oncall Overview
Generating Docs Locally

API Reference

API Guide
API Reference
- core

Resources

Discussions

API Reference
core
core.post_training
core.post_training.modelopt
core.post_training.modelopt.mamba
core.post_training.modelopt.mamba.model_specs

Is this page helpful?

`core.post_training.modelopt.mamba.model_specs`#

previous

core.post_training.modelopt.mamba

next

core.post_training.modelopt.gpt

Privacy Policy | Your Privacy Choices | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2026, NVIDIA Corporation.