Gym Configuration#
Before running GRPO training, you need to configure how NeMo RL connects to NeMo Gym. The training config file contains Gym-specific parameters that control data loading, environment interaction, and validation.
Goal: Understand the Gym configuration parameters for RL training.
In this section, you will learn:
How to configure data paths for training and validation
How to enable and configure NeMo Gym in NeMo RL
← Previous: About Workplace Assistant
Configuration File Location#
The full training configuration file is located at:
examples/nemo_gym/grpo_workplace_assistant_nemotron_nano_v2_9b.yaml
Gym Configuration Sections#
There are two Gym-specific sections in the NeMo RL training config: data and env.
Data Section#
data:
train_jsonl_fpath: 3rdparty/Gym-workspace/Gym/data/workplace_assistant/train.jsonl
validation_jsonl_fpath: 3rdparty/Gym-workspace/Gym/data/workplace_assistant/validation.jsonl
Parameter |
Description |
|---|---|
|
Path to training dataset (prepared in Setup) |
|
Path to validation dataset |
Environment Section#
env:
should_use_nemo_gym: true
nemo_gym:
config_paths:
- responses_api_models/vllm_model/configs/vllm_model_for_training.yaml
- resources_servers/workplace_assistant/configs/workplace_assistant.yaml
workplace_assistant_simple_agent:
responses_api_agents:
simple_agent:
max_steps: 6
Parameter |
Description |
|---|---|
|
Set to |
|
Everything under this key is a Gym config |
|
Gym config files: vLLM model config and Workplace Assistant agent/resources config |
|
Maximum tool-calling steps per task (6 for Workplace Assistant) |
Important
The vllm_model_for_training.yaml config is required for NeMo RL training integration.