For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
DocumentationAPI Reference
DocumentationAPI Reference
  • Documentation
    • Home
  • About
    • Concepts
    • Ecosystem
  • Get Started
    • Quickstart
    • Detailed Setup Guide
    • Install from PyPI
    • Rollout Collection
  • Agent Server
  • Model Server
    • vLLM
  • Resources Server
  • Data
    • Prepare and Validate
    • Download from Hugging Face
    • Prompt Config
  • Environment Tutorials
    • Single-Step Environment
    • Multi-Step Environment
    • Stateful Environment
    • Real-World Environment
    • Integrate external libraries
    • Aggregate Metrics
    • LLM-as-Judge Verification
  • Benchmarks
    • Run benchmarks
    • Add a benchmark
    • Design a customer evaluation
  • Training Tutorials
    • NeMo RL
      • About Workplace Assistant
      • Gym Configuration
      • NeMo RL Configuration
      • Setup
      • Single Node Training
      • Multi-Node Training
    • Unsloth
    • Multi-Environment Training
    • Offline Training (SFT/DPO)
  • Model Recipes
    • Nemotron 3 Nano
    • Nemotron 3 Super
  • Infrastructure
    • Deployment Topology
    • Engineering Notes
  • Reference
    • Configuration
    • RL Framework Compatibility
    • CLI Commands
    • FAQ
  • Troubleshooting
    • Configuration Errors
  • Contribute
    • Development Setup
    • Environments
    • Integrate RL Frameworks
NVIDIANVIDIA
Developer-friendly docs for your API
Privacy Policy | Manage My Privacy | Do Not Sell or Share My Data | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2026, NVIDIA Corporation.

LogoLogoNeMo Gym
On this page
  • Prerequisites
  • Configuration File Location
  • Gym Configuration Sections
  • Data Section
  • Environment Section
  • Next Steps
Training TutorialsNeMo RL

Gym Configuration

||View as Markdown|
Previous

About Workplace Assistant

Next

NeMo RL Configuration

Before running GRPO training, you need to configure how NeMo RL connects to NeMo Gym. The training config file contains Gym-specific parameters that control data loading, environment interaction, and validation.

Goal: Understand the Gym configuration parameters for RL training.

Time: ~10 minutes (read)

In this section, you will learn:

  1. How to configure data paths for training and validation
  2. How to enable and configure NeMo Gym in NeMo RL
← Previous: About Workplace Assistant

Prerequisites

  • Read About Workplace Assistant to understand the training environment

Configuration File Location

The full training configuration file is located at:

examples/nemo_gym/grpo_workplace_assistant_nemotron_nano_v2_9b.yaml

Gym Configuration Sections

There are two Gym-specific sections in the NeMo RL training config: data and env.

Data Section

1data:
2 train_jsonl_fpath: 3rdparty/Gym-workspace/Gym/data/workplace_assistant/train.jsonl
3 validation_jsonl_fpath: 3rdparty/Gym-workspace/Gym/data/workplace_assistant/validation.jsonl
ParameterDescription
train_jsonl_fpathPath to training dataset (prepared in Setup)
validation_jsonl_fpathPath to validation dataset

Environment Section

1env:
2 should_use_nemo_gym: true
3 nemo_gym:
4 config_paths:
5 - responses_api_models/vllm_model/configs/vllm_model_for_training.yaml
6 - resources_servers/workplace_assistant/configs/workplace_assistant.yaml
7 workplace_assistant_simple_agent:
8 responses_api_agents:
9 simple_agent:
10 max_steps: 6
ParameterDescription
should_use_nemo_gymSet to true to enable Gym
nemo_gymEverything under this key is a Gym config
nemo_gym.config_pathsGym config files: vLLM model config and Workplace Assistant agent/resources config
max_stepsMaximum tool-calling steps per task (6 for Workplace Assistant)

The vllm_model_for_training.yaml config is required for NeMo RL training integration.


Next Steps

With the Gym configuration understood, learn about the GRPO training parameters:

Continue to NeMo RL Configuration →