Integrate RL Frameworks

View as Markdown

These guides cover how to integrate NeMo Gym into a new RL training framework. Use them if you are:

  • A training framework maintainer adding NeMo Gym support
  • Contributing NeMo Gym integration for a training framework that does not have one yet

Just want to train models? See Training Tutorials for supported frameworks.

Prerequisites

Before integrating Gym into your training framework, ensure you have:

  • An RL training framework with policy optimization support (PPO, GRPO, or similar)
  • A generation backend (vLLM, SGLang, or equivalent)
  • Familiarity with OpenAI-compatible HTTP server APIs

Integration Components

Gym integration requires implementing the following components in your training framework:

Integration Workflow

The typical integration workflow follows this sequence:

StepComponentDescription
1Generation backendExpose your generation engine, such as vLLM or SGLang, as an OpenAI-compatible HTTP server
2On-policy correctionsImplement token ID fixes to prevent re-tokenization and re-templating issues
3Gym integrationConnect Gym to your training loop using the rollout orchestration APIs
4ValidationVerify integration using the success criteria benchmarks