Configure Models
Configure your model with the inference backend of your choice.
NeMo Gym uses the Responses API as its native schema because it natively represents tool calls, multi-turn conversations, and structured outputs. NeMo Gym provides middleware to automatically convert other protocols, for example, Chat Completions, into Responses format.
For training, use a backend that returns token IDs and log probabilities (marked with training below).
Available Backends
Use OpenAI models.
cloudUse models hosted on Azure OpenAI deployments.
cloudUse hosted providers like Fireworks, Together.ai, OpenRouter, and more.
cloudConnect to a vLLM server you start and manage yourself.
self-hostedtrainingLet NeMo Gym launch and manage the vLLM server for you.
self-hostedgym-managedtrainingServe multiple request-time configs (e.g. reasoning on/off) from one Local vLLM deployment.
self-hostedgym-managedtraining