NVIDIA NeMo Framework Developer Docs

NVIDIA Docs Hub NVIDIA NeMo Framework User Guide NVIDIA NeMo Framework Developer Docs

NVIDIA NeMo Framework is an end-to-end, cloud-native framework designed to build, customize, and deploy generative AI models anywhere.

NVIDIA NeMo Framework supports large-scale training features, including:

Mixed Precision Training
Parallelism
Distributed Optimizer
Fully Sharded Data Parallel (FSDP)
Flash Attention
Activation Recomputation
Positional Embeddings and Positional Interpolation
Post-Training Quantization (PTQ) with Ammo
Sequence Packing

NVIDIA NeMo Framework has separate collections for:

Each collection consists of prebuilt modules that include everything needed to train on your data. Every module can easily be customized, extended, and composed to create new generative AI model architectures.

For quick guides and tutorials, see the “Getting started” section below.

Getting Started

For more information, browse the developer docs for your area of interest in the contents section below or on the left sidebar.

Key Optimizations

Community Model Converters

Community Checkpoint Converter

APIs

NeMo APIs

Collections

NeMo Collections

Speech AI Tools

Speech AI Tools

Previous Libraries

Next Introduction