Model Training How-To Guides#

This section provides task-focused guides for common training workflows. For your first run, start with Getting Started with Training Steps.

If you are new to fine-tuning concepts, read Training Concepts for definitions of fine-tuning approaches, data formats, and checkpoints before working through these tasks.

Run SFT with AutoModel on Custom Data

Configure tiny.yaml for your own JSONL data, run the step, verify the checkpoint output, and resolve common issues.

Run SFT with AutoModel on Custom Data
Choose an SFT Backend

Pick between sft/automodel and sft/megatron_bridge based on checkpoint format and scale.

Choose an SFT Backend
Choose a PEFT Backend

Pick between peft/automodel and peft/megatron_bridge based on base checkpoint format and data path.

Choose a PEFT Backend
Choose an RL Alignment Step

Pick between DPO, RLVR, and RLHF based on how the reward signal enters training.

Choose an RL Alignment Step
Run Post-Training Optimization

Apply quantization, pruning, or distillation with Model Optimizer after a trained model passes your quality bar.

Run Post-Training Optimization
Environment Profiles and Executors

Configure NeMo Run environment profiles for local, Slurm, and Lepton execution.

Environment Profiles and Executors
Data and Checkpoint Formats

Understand the JSONL, Parquet, and checkpoint types that training steps declare in step.toml.

Data and Checkpoint Formats
Convert Checkpoints Between Training Steps

Run a conversion step when one step produces a checkpoint layout that the next step cannot consume directly.

Convert Checkpoints Between Training Steps