Training Tutorials#
We have hands-on tutorials with supported training frameworks to help you train with NeMo Gym environments. If you’re interested in integrating another training framework, see the Training Framework Integration Guide.
RL (GRPO)#
NeMo RL
Tutorial-series: GRPO training to improve multi-step tool calling on the Workplace Assistant environment, scaling from single-node to multi-node training.
OpenRLHF
Review the agent executor for using NeMo Gym environments with OpenRLHF.
TRL
GRPO training on Workplace Assistant and Reasoning Gym environments
Unsloth
GRPO training on instruction following and reasoning environments.
NeMo Customizer
Coming soon
VeRL
Coming soon
SFT & DPO#
Offline Training with Rollouts
Transform rollouts into training data for supervised fine-tuning (SFT) and direct preference optimization (DPO).