Skip to main content
Ctrl+K
NeMo-AutoModel - Home NeMo-AutoModel - Home

NeMo-AutoModel

  • GitHub
NeMo-AutoModel - Home NeMo-AutoModel - Home

NeMo-AutoModel

  • GitHub

Table of Contents

Get Started

  • About NeMo AutoModel
  • Key Features and Concepts
  • Install NeMo AutoModel
  • YAML Configuration
  • 🤗 Transformers API Compatibility
  • Repository Structure

Announcements

  • Announcements

NeMo AutoModel Performance

  • Performance Summary

Model Coverage

  • Model Coverage Overview
  • Model Release Log
  • Large Language Models (LLMs)
    • Llama
    • Gemma
    • Qwen2
    • Qwen2 MoE
    • Qwen3
    • Qwen3 MoE
    • Qwen3-Next
    • DeepSeek
    • DeepSeek-V3
    • Mistral
    • Mixtral
    • Ministral3 / Devstral
    • Phi
    • Phi-3 / Phi-4
    • Phi-3-Small
    • Nemotron / Minitron
    • Nemotron-H
    • Nemotron-Flash
    • Nemotron-Super (Llama-3.3-Nemotron-Super-49B)
    • ChatGLM
    • GLM-4
    • GLM-4 MoE (GLM-4.5 / GLM-4.7)
    • GLM-5 / GLM-5.1 (MoE + DSA)
    • Granite
    • Granite MoE
    • Bamba
    • OLMo
    • OLMo2
    • OLMoE
    • GPT-OSS
    • GPT-J
    • GPT-NeoX / Pythia
    • StarCoder
    • StarCoder2
    • Aquila / Aquila2
    • Baichuan / Baichuan2
    • Command-R
    • Falcon
    • EXAONE
    • InternLM
    • Jais
    • MiniMax-M2
    • MiniCPM
    • Moonlight
    • Seed (ByteDance)
    • Solar Pro
    • Orion
    • StableLM
    • Step-3.5
    • GritLM
  • Vision Language Models (VLMs)
    • Kimi-VL
    • Gemma 3 VL / Gemma 3n
    • Qwen2.5-VL
    • Qwen3-VL / Qwen3-VL-MoE
    • Qwen3.5-VL
    • Nemotron-Parse
    • Ministral3 VL
    • Mistral-Small-4
    • InternVL
    • Llama 4
    • SmolVLM
    • LLaVA
  • Omni Models
    • Qwen3-Omni
    • Phi-4-multimodal
  • Diffusion Models
    • Wan 2.1 T2V
    • FLUX.1-dev
    • HunyuanVideo 1.5

Recipes & E2E Examples

  • Recipes and End-to-End Examples
  • Supervised Fine-Tuning (SFT) and Parameter-Efficient Fine-Tuning (PEFT) with NeMo AutoModel
  • Function Calling with NeMo AutoModel using FunctionGemma
  • Knowledge Distillation with NeMo AutoModel
  • Fine-Tune Large MoE LLMs
  • Pretraining Megatron Core Datasets with NeMo AutoModel
  • LLM Pre-Training with NeMo AutoModel
  • Sequence Classification (SFT/PEFT) with NeMo AutoModel
  • Fine-Tune Gemma 3 and Gemma 3n
  • Fine-Tuning Gemma 4 31B on CORD-v2 Receipts — End-to-End Guide
  • Fine-Tune Qwen3.5-VL
  • Diffusion Model Fine-Tuning with NeMo AutoModel
  • Quantization-Aware Training (QAT) in NeMo Automodel
  • Model Training on Databricks

Datasets

  • Dataset Overview: LLM, VLM, and Retrieval Datasets in NeMo AutoModel
  • Integrate Your Own Text Dataset
  • Retrieval Dataset (Embedding Fine-tuning)
  • Use the ColumnMappedTextInstructionDataset
  • Use the ColumnMappedTextInstructionIterableDataset (Streaming)
  • Integrate Your Own Multi-Modal Dataset
  • Diffusion Dataset Preparation

Job Launchers

  • Job Launchers
  • Run on Your Local Workstation
  • Run on a Cluster
  • Run with NeMo-Run
  • Run on Any Cloud with SkyPilot

Development

  • Checkpointing in NeMo Automodel
  • Gradient (Activation) Checkpointing in NeMo AutoModel
  • Pipeline Parallelism with AutoPipeline
  • FP8 Training in NeMo AutoModel
  • MLflow Logging in NeMo AutoModel
  • API Reference
    • nemo_automodel
      • nemo_automodel.autonvtx
      • nemo_automodel.components
        • nemo_automodel.components.attention
        • nemo_automodel.components.loss
        • nemo_automodel.components.utils
        • nemo_automodel.components.checkpoint
        • nemo_automodel.components.flow_matching
        • nemo_automodel.components.optim
        • nemo_automodel.components.config
        • nemo_automodel.components.launcher
        • nemo_automodel.components.distributed
        • nemo_automodel.components.moe
        • nemo_automodel.components.loggers
        • nemo_automodel.components.training
        • nemo_automodel.components._peft
        • nemo_automodel.components.models
        • nemo_automodel.components.datasets
        • nemo_automodel.components.quantization
      • nemo_automodel._transformers
        • nemo_automodel._transformers.capabilities
        • nemo_automodel._transformers.mfu
        • nemo_automodel._transformers.auto_model
        • nemo_automodel._transformers.utils
        • nemo_automodel._transformers.retrieval
        • nemo_automodel._transformers.registry
        • nemo_automodel._transformers.auto_tokenizer
        • nemo_automodel._transformers.kernel_patches
        • nemo_automodel._transformers.model_init
        • nemo_automodel._transformers.infrastructure
      • nemo_automodel.shared
        • nemo_automodel.shared.utils
        • nemo_automodel.shared.te_patches
        • nemo_automodel.shared.transformers_patches
        • nemo_automodel.shared.torch_patches
        • nemo_automodel.shared.import_utils
      • nemo_automodel.cli
        • nemo_automodel.cli.utils
        • nemo_automodel.cli.app
      • nemo_automodel._diffusers
        • nemo_automodel._diffusers.auto_diffusion_pipeline
  • API Reference
  • nemo_automodel
  • nemo_automodel.cli

nemo_automodel.cli#

Submodules#

  • nemo_automodel.cli.utils
  • nemo_automodel.cli.app

previous

nemo_automodel.shared.import_utils

next

nemo_automodel.cli.utils

On this page
  • Submodules
NVIDIA NVIDIA
Privacy Policy | Manage My Privacy | Do Not Sell or Share My Data | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2026, NVIDIA Corporation.

Last updated on 2026-04-08.