Concepts

View as Markdown

NeMo Gym concepts explain the mental model behind building RL training environments: when to use RL over SFT, how environment components work together, and how verification signals drive learning. Use this page as a compass to decide which explanation to read next.

New to RL for LLMs? Start with training-approaches for context on SFT, RL, and RLVR, or refer to Key Terminology for a quick glossary.


Concept Highlights

Each explainer below covers one foundational idea and links to deeper material.