About NVIDIA NeMo Gym#

NeMo Gym generates training data for reinforcement learning by capturing how AI agents interact with tools and environments.

Core Components#

Three components work together to generate and evaluate agent interactions:

  • Agents: Orchestrate multi-turn interactions between models and resources. Handle conversation flow, tool routing, and response formatting.

  • Models: LLM inference endpoints (OpenAI-compatible or vLLM). Handle single-turn text generation and tool-calling decisions.

  • Resources: Provide tools (functions agents call) + verifiers (logic to score performance). Examples: math environments, code sandboxes, web search.