Training Reference#

This section provides lookup material for every supervised fine-tuning (SFT), parameter-efficient fine-tuning (PEFT), reinforcement learning (RL), and optimization step packaged under src/nemotron/steps/. Use these pages to find the exact CLI syntax, configuration file layout, parameters, and configuration overrides for each step.

Reference Sections#

Nemotron Steps CLI Reference

Shared command-line syntax, options, dotlist overrides, and passthrough arguments for the nemotron steps command group.

Nemotron Steps CLI Reference
Env Profile Generator

The packaged env/env_toml step that generates the environment profile file every other training step consumes.

Env Profile Generator
Step Catalog

Every step identifier, manifest path, and per-step reference link.

Step Catalog
Configuration Conventions

Per-step config/ layout, CLI configuration resolution, dotlist override rules, and environment-variable expansion.

Configuration Conventions

Per-Category Step References#

Supervised Fine-Tuning Steps

The sft/automodel and sft/megatron_bridge references.

Supervised Fine-Tuning Steps
Parameter-Efficient Fine-Tuning Steps

The peft/automodel and peft/megatron_bridge references.

Parameter-Efficient Fine-Tuning Steps
Reinforcement Learning Steps

The rl/nemo_rl/dpo, rl/nemo_rl/rlvr, and rl/nemo_rl/rlhf references.

Reinforcement Learning Steps
Optimization Steps

The optimize/modelopt/quantize, optimize/modelopt/prune, and optimize/modelopt/distill references.

Model Optimization Steps
Checkpoint Conversion Steps

The convert/hf_to_megatron, convert/megatron_to_hf, and convert/merge_lora references.

Checkpoint Conversion Steps