RL Environment Ecosystem#
We’re building NeMo Gym to integrate with a broad set of RL training frameworks and environment libraries.
We would love your contribution! Open a PR to add an integration, or file an issue to share what would be valuable for you.
Training Framework Integrations#
We have hands-on tutorials with supported training frameworks to help you train with NeMo Gym environments. If you’re interested in integrating another training framework, see the Training Framework Integration Guide.
NeMo RL - GRPO training to improve multi-step tool calling on the Workplace Assistant environment
OpenRLHF - example agent executor for RL training
TRL - GRPO training on Workplace Assistant and Reasoning Gym environments
Unsloth - GRPO training on instruction following and reasoning environments
NeMo Customizer - (In progress)
VeRL - (In progress)
Environment Library Integrations#
NeMo Gym integrates with external environment libraries and benchmarks. See the README for the full list—here are a few examples:
Reasoning Gym - reasoning environments spanning computation, cognition, logic and more
Aviary - environments spanning math, knowledge, biological sequences, scientific literature search, and protein stability
Verifiers - (In progress) - environments spanning coding, data & ML, science & reasoning, tool use and more
BrowserGym - (In progress) - environments for web task automation