For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
DocumentationAPI Reference
DocumentationAPI Reference
  • About
    • Concepts
    • Architecture
    • Ecosystem
    • Release Notes
  • Get Started
    • Prerequisites
    • Installation
    • Quickstart
  • Agent Server
  • Model Server
    • vLLM
  • Resources Server
  • Data
    • Prepare and Validate
    • Download from Hugging Face
    • Prompt Config
  • Environment Tutorials
    • Single-Step Environment
    • Multi-Step Environment
    • Stateful Environment
    • Real-World Environment
    • Integrate external libraries
    • Add a benchmark
    • Verification Patterns
    • Aggregate Metrics
  • Training Tutorials
    • NeMo RL
    • Unsloth
    • Multi-Environment Training
    • Training with VeRL
    • Offline Training (SFT/DPO)
  • Model Recipes
    • Nemotron 3 Nano
    • Nemotron 3 Super
  • Infrastructure
    • Deployment Topology
    • Engineering Notes
  • Reference
    • Configuration
    • RL Framework Compatibility
    • CLI Commands
    • FAQ
  • Troubleshooting
    • Configuration Errors
  • Contribute
    • Development Setup
    • Environments
    • Integrate RL Frameworks
NVIDIANVIDIA
Developer-friendly docs for your API
Privacy Policy | Your Privacy Choices | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2026, NVIDIA Corporation.

LogoLogoNeMo Gym
On this page
  • When to Use NeMo Gym
  • What NeMo Gym Provides
  • Integrations
  • Next Steps

NeMo Gym

||View as Markdown|
Next

Concepts

NeMo Gym is a library for evaluating and improving models and agents using environments. NeMo Gym provides infrastructure to develop environments, scalably run evaluation and training, and a collection of popular benchmarks and training environments.

When to Use NeMo Gym

  • You need to evaluate models or agents in stateful environments (for example, code execution, tool calling, sandboxes).
  • You want reproducible evaluation across teams using shared environments and verifiers.
  • You need to use environments at scale — multiple repeats per task, or thousands of concurrent requests for training.
  • You want to seamlessly transition between evaluation, agent optimization, and training.

If you are scoring model outputs with a stateless check and do not need scale or training, a script is probably sufficient.

What NeMo Gym Provides

  • Modular, extensible interfaces for agents, environments, tasks, and verifiers
  • Environment hub of popular benchmarks and training environments
  • Use your own agents or choose from built-in harnesses
  • Scale to thousands of concurrent environments
  • Train with the RL framework of your choice
  • Battle-tested in production Nemotron training

NeMo Gym Product Overview

Integrations

NeMo Gym integrates with the broader agentic ecosystem:

  • Environment libraries: Seamlessly combine environments and benchmarks from other libraries alongside NeMo Gym environments.
  • Training framework libraries: Use environments for SFT and RL training.
  • Agent harnesses: Popular agent harnesses for evaluation and training available out of the box.
  • Agent framework libraries: Use your custom agent built with agent frameworks in NeMo Gym environments.
  • Sandboxes: Isolate agent runtime execution.

Next Steps

Get Started

Install NeMo Gym and run your first evaluation.

Browse Environments

Browse available environments for evaluation and training.

Agents

Explore available agent harnesses and learn how to integrate your own agent.

Training

Improve your agent or model with RL or fine-tuning.

Build Custom Environments

Create your own evaluation or training environments.