Before running GRPO training, you need to configure how NeMo RL connects to NeMo Gym. The training config file contains Gym-specific parameters that control data loading, environment interaction, and validation.
Goal: Understand the Gym configuration parameters for RL training.
Time: ~10 minutes (read)
In this section, you will learn:
The full training configuration file is located at:
There are two Gym-specific sections in the NeMo RL training config: data and env.
The vllm_model_for_training.yaml config is required for NeMo RL training integration.
With the Gym configuration understood, learn about the GRPO training parameters:
Continue to NeMo RL Configuration →