Training Reference#
This section provides lookup material for every supervised fine-tuning (SFT), parameter-efficient fine-tuning (PEFT), reinforcement learning (RL), and optimization step packaged under src/nemotron/steps/.
Use these pages to find the exact CLI syntax, configuration file layout, parameters, and configuration overrides for each step.
Reference Sections#
Shared command-line syntax, options, dotlist overrides, and passthrough arguments for the nemotron steps command group.
The packaged env/env_toml step that generates the environment profile file every other training step consumes.
Every step identifier, manifest path, and per-step reference link.
Per-step config/ layout, CLI configuration resolution, dotlist override rules, and environment-variable expansion.
Per-Category Step References#
The sft/automodel and sft/megatron_bridge references.
The peft/automodel and peft/megatron_bridge references.
The rl/nemo_rl/dpo, rl/nemo_rl/rlvr, and rl/nemo_rl/rlhf references.
The optimize/modelopt/quantize, optimize/modelopt/prune, and optimize/modelopt/distill references.
The convert/hf_to_megatron, convert/megatron_to_hf, and convert/merge_lora references.