RL Environment Ecosystem#

We’re building NeMo Gym to integrate with a broad set of RL training frameworks and environment libraries.

We would love your contribution! Open a PR to add an integration, or file an issue to share what would be valuable for you.


Training Framework Integrations#

We have hands-on tutorials with supported training frameworks to help you train with NeMo Gym environments. If you’re interested in integrating another training framework, see the Training Framework Integration Guide.

  • NeMo RL - GRPO training to improve multi-step tool calling on the Workplace Assistant environment

  • OpenRLHF - example agent executor for RL training

  • TRL - GRPO training on Workplace Assistant and Reasoning Gym environments

  • Unsloth - GRPO training on instruction following and reasoning environments

  • NeMo Customizer - (In progress)

  • VeRL - (In progress)


Environment Library Integrations#

NeMo Gym integrates with external environment libraries and benchmarks. See the README for the full list—here are a few examples:

  • Reasoning Gym - reasoning environments spanning computation, cognition, logic and more

  • Aviary - environments spanning math, knowledge, biological sequences, scientific literature search, and protein stability

  • Verifiers - (In progress) - environments spanning coding, data & ML, science & reasoning, tool use and more

  • BrowserGym - (In progress) - environments for web task automation