Model Training How-To Guides#
This section provides task-focused guides for common training workflows. For your first run, start with Getting Started with Training Steps.
If you are new to fine-tuning concepts, read Training Concepts for definitions of fine-tuning approaches, data formats, and checkpoints before working through these tasks.
Configure tiny.yaml for your own JSONL data, run the step, verify the checkpoint output, and resolve common issues.
Pick between sft/automodel and sft/megatron_bridge based on checkpoint format and scale.
Pick between peft/automodel and peft/megatron_bridge based on base checkpoint format and data path.
Pick between DPO, RLVR, and RLHF based on how the reward signal enters training.
Apply quantization, pruning, or distillation with Model Optimizer after a trained model passes your quality bar.
Configure NeMo Run environment profiles for local, Slurm, and Lepton execution.
Understand the JSONL, Parquet, and checkpoint types that training steps declare in step.toml.
Run a conversion step when one step produces a checkpoint layout that the next step cannot consume directly.