Model Training with Nemotron Steps#
This section documents how to run supervised fine-tuning (SFT), parameter-efficient fine-tuning (PEFT), reinforcement learning (RL) alignment, and post-training optimization with Nemotron steps.
Each step packages a training approach, configuration files, and entry logic that you invoke through the nemotron steps CLI.
For the definitions of step, configuration, and environment profile that apply across every domain, see Nemotron Steps Basics.
If you are new to fine-tuning, start with Training Basics, which defines the training-specific terms the rest of this section uses.
Capabilities at a Glance#
Area |
Step Names |
Role |
|---|---|---|
SFT |
|
Supervised fine tuning from chat-formatted JSON Lines (JSONL) or packed Apache Parquet |
PEFT |
|
Adapter training with a smaller trainable surface |
RL |
|
Alignment after a supervised fine tuning (SFT) policy exists |
Optimize |
|
Compression and quality recovery |
Limitations and Restrictions#
The Nemotron steps for data preparation and model training do not support local training, such as on a developer workstation.
These steps require access to at least two nodes, each equipped with 8 x NVIDIA A100 80 GB or better GPUs. These steps support the following environments:
Slurm
NVIDIA DGX Cloud Lepton
NVIDIA Run:ai
For assistance with configuring access to one of the supported computing environments, refer to Env Profile Generator or run the nemotron-env-toml skill with your agent.
Learning Path#
This page defines supervised fine-tuning, parameter-efficient fine-tuning, reinforcement learning alignment, quantization, tokenizers, the chat dataset format, and checkpoints.
This guide covers installation expectations, the environment profile, and how to run your first sample job with the tiny configuration.
These guides are task-focused. They explain how to pick a backend, wire data, and run optimization steps.
This material explains the basics, the artifact graph, and how the training libraries differ.
This material is for lookup. It lists the step catalog, parameters, and configuration conventions.
Use the customize skill with a YAML-first plan: repo steps, then configs, then code only for gaps.
Quick Links#
Model Training with Agents describes how to work with the
nemotron-customizeskill for multi-stage training plans and YAML-first deliverables.Getting Started describes how to verify the CLI and run a tiny configuration.
Execution through NeMo Run describes profiles, attached and detached runs, and clusters.
Nemotron CLI Overview describes how the wider CLI relates to configuration and overrides.