Skip to main content
Ctrl+K
NeMo-AutoModel - Home NeMo-AutoModel - Home

NeMo-AutoModel

  • GitHub
NeMo-AutoModel - Home NeMo-AutoModel - Home

NeMo-AutoModel

  • GitHub

Table of Contents

Get Started

  • About NeMo AutoModel
  • Key Features and Concepts
  • Install NeMo AutoModel
  • YAML Configuration
  • 🤗 Transformers API Compatibility
  • Run on Your Local Workstation
  • Run on a Cluster
  • Repository Structure

Announcements

  • Announcements

NeMo AutoModel Performance

  • Performance Summary

Model Coverage

  • Model Coverage Overview
  • Large Language Models (LLMs)
  • Vision Language Models (VLMs)
  • Troubleshooting Unsupported Models

Recipes & E2E Examples

  • Recipes and End-to-End Examples
  • Supervised Fine-Tuning (SFT) and Parameter-Efficient Fine-Tuning (PEFT) with NeMo Automodel
  • Function Calling with NeMo Automodel using FunctionGemma
  • Knowledge Distillation with NeMo-AutoModel
  • Pretraining using Megatron Core Datasets with NeMo Automodel
  • LLM Pre-Training with NeMo Automodel
  • Sequence Classification (SFT/PEFT) with NeMo Automodel
  • Fine-Tune Gemma 3 and Gemma 3n
  • Fine-Tune Qwen3.5-VL
  • Quantization-Aware Training (QAT) in NeMo Automodel
  • Model Training on Databricks

Datasets

  • Dataset Overview: LLM, VLM, and Retrieval Datasets in NeMo Automodel
  • Integrate Your Own Text Dataset
  • Biencoder Retrieval Dataset (Embedding Fine-tuning)
  • Use the ColumnMappedTextInstructionDataset
  • Use the ColumnMappedTextInstructionIterableDataset (Streaming)
  • Integrate Your Own Multi-Modal Dataset

Development

  • Checkpointing in NeMo Automodel
  • 🚀 Gradient (Activation) Checkpointing in NeMo-AutoModel
  • Pipeline Parallelism with AutoPipeline
  • FP8 Training in NeMo Automodel
  • MLflow Logging in NeMo Automodel
  • API Reference
    • nemo_automodel
      • nemo_automodel.components
        • nemo_automodel.components.loss
        • nemo_automodel.components.training
        • nemo_automodel.components.datasets
        • nemo_automodel.components._peft
        • nemo_automodel.components.config
        • nemo_automodel.components.distributed
        • nemo_automodel.components.utils
        • nemo_automodel.components.moe
        • nemo_automodel.components.loggers
        • nemo_automodel.components.optim
        • nemo_automodel.components.checkpoint
        • nemo_automodel.components.launcher
        • nemo_automodel.components.attention
        • nemo_automodel.components.quantization
        • nemo_automodel.components.models
      • nemo_automodel._diffusers
        • nemo_automodel._diffusers.auto_diffusion_pipeline
      • nemo_automodel.autonvtx
      • nemo_automodel._transformers
        • nemo_automodel._transformers.auto_model
        • nemo_automodel._transformers.biencoder
        • nemo_automodel._transformers.utils
        • nemo_automodel._transformers.model_init
        • nemo_automodel._transformers.auto_tokenizer
        • nemo_automodel._transformers.infrastructure
        • nemo_automodel._transformers.registry
        • nemo_automodel._transformers.kernel_patches
      • nemo_automodel.shared
        • nemo_automodel.shared.utils
        • nemo_automodel.shared.import_utils
        • nemo_automodel.shared.torch_patches
  • API Reference
  • nemo_automodel
  • nemo_automodel.components
  • nemo_automodel.components.models
  • nemo_automodel.components.models.deepseek_v3

nemo_automodel.components.models.deepseek_v3#

Submodules#

  • nemo_automodel.components.models.deepseek_v3.model
  • nemo_automodel.components.models.deepseek_v3.layers
  • nemo_automodel.components.models.deepseek_v3.rope_utils
  • nemo_automodel.components.models.deepseek_v3.state_dict_adapter

previous

nemo_automodel.components.models.llama.state_dict_adapter

next

nemo_automodel.components.models.deepseek_v3.model

On this page
  • Submodules
NVIDIA NVIDIA
Privacy Policy | Manage My Privacy | Do Not Sell or Share My Data | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2026, NVIDIA Corporation.

Last updated on 2026-03-01.