> For clean Markdown of any page, append .md to the page URL.
> For a complete documentation index, see https://docs.nvidia.com/nemo/gym/llms.txt.
> For AI client integration (Claude Code, Cursor, etc.), connect to the MCP server at https://docs.nvidia.com/nemo/gym/_mcp/server.

# Evaluation Tutorials

> Run benchmark-specific evaluation workflows with NeMo Gym.

Here are the hands-on walkthroughs for running benchmarks, collecting rollouts, and reading the outputs. They assume familiarity with the basic concepts in [Evaluation](/about/concepts/evaluation) and the workflow in [Evaluation](/evaluation).

Run the EvalPlus coding benchmark and inspect rollout and aggregate metric outputs.

Browse the built-in benchmark and training environments.

Understand the aggregate metrics written after rollout collection.