Quickstart
Quickstart
See Installation if you need to install NeMo Gym.
Configure Your Model
Create an env.yaml file in the project root with your model endpoint credentials:
This quickstart uses OpenAI. NeMo Gym supports local and hosted inference — see Configure Model for vLLM, Fireworks, OpenRouter, and others.
Run Evaluation
Run your agent on a set of tasks and score the results. This example uses a simple tool calling agent simple_agent with the mcqa (multiple-choice Q&A) environment and its included example data.
1. Start servers
NeMo Gym uses local servers to coordinate your model, agent, and task verification. Start them first:
You should see three server instances starting:
2. Evaluate your agent
In a new terminal, run your agent on a single task to verify everything works:
You should see a progress bar followed by aggregate metrics:
For per-task pass rates, see ng_reward_profile in the CLI Reference.
Explore
Now that you have a working setup, explore what’s available.
NeMo Gym ships with environments across many domains. You can use these existing environments in addition to building your own.
This lists benchmarks with pre-configured agents. For the full set of environments (including training environments), see the Available Environments table.
Every CLI command supports +h=true or +help=true for detailed usage information: