NeMo Gym v0.3.0 ships alongside the NVIDIA Nemotron 3 Ultra model release, open sourcing the environments and corresponding datasets used during training.
Highlights:
We welcomed 30+ new contributors to this release! Here are a few highlights:
Thank you to all the new contributors for helping make NeMo Gym better!
Added 70+ new environments including novel datasets and integrations of popular benchmarks. New coverage spans:
See the Available Environments table for the full list.
max_concurrent_requests on the OpenAI model server to cap in-flight API calls — useful for rate-limited external endpoints when rollout concurrency is highng_aggregate_rollouts command to merge rollout shards collected independently across multiple nodes, enabling distributed eval without requiring a single coordinated collection jobdocs/ directory is no longer used for publishing.Fixed PyPI package distribution that was broken in v0.2.0. No functional changes — all features and fixes from v0.2.0 apply.
NeMo Gym v0.2.0 ships alongside the NVIDIA Nemotron 3 Super model release, open sourcing the RL environments and corresponding datasets used during training. This release adds 17 new training environments across coding, math, science, reasoning, agentic tasks, and safety, plus integrations with Aviary, Reasoning Gym, and Verifiers to combine additional environments. You can now run end-to-end rollout collection locally with vLLM and install directly from PyPI.
Added 17 new resources servers spanning:
Added 5 new agent servers: Aviary agent, proof refinement agent, SWE agents, tool simulation agent, and verifiers agent.
Environment library integrations: Future House Aviary, Open-Thought Reasoning Gym, Prime Intellect Verifiers.
ng_reward_profile command to compute per-task pass rates and aggregate metricspip install nemo-gymng_run +dryrun=true to validate configs and install environments without starting serversng_status command to list running servers and their healthresponses_create_params overrides not merging correctly from CLIng_prepare_data failing when multiple environments define overlapping metricsng_dump_config now redacts API key values from outputWe’d like to highlight the following first-time contributors:
Initial public release of NeMo Gym.