Overview
We have hands-on tutorials with supported training frameworks to help you train with NeMo Gym environments. If you’re interested in integrating another training framework, see the Training Framework Integration Guide.
See training-approaches for a refresher on when to use GRPO, SFT, or DPO.
RL (GRPO)
NeMo RL
Tutorial-series: GRPO training to improve multi-step tool calling on the Workplace Assistant environment, scaling from single-node to multi-node training.
nemo rlgrpo3-5 hoursOpenRLHF
Review the agent executor for using NeMo Gym environments with OpenRLHF.
openrlhfUnsloth
GRPO training on instruction following and reasoning environments.
unslothsingle-gpu30 minNeMo Customizer
Coming soon
nemo customizerin progressVeRL
Coming soon
verlin progress