About Inference Routing | NVIDIA NemoClaw

NemoClaw gives agents one managed inference route while OpenShell handles the selected upstream provider on the host. This design keeps provider selection and credentials outside the sandbox.

Request Path

The agent sends inference requests to inference.local inside the sandbox. It does not connect to the upstream provider directly. OpenShell intercepts the request on the host and forwards it to the provider and model selected during onboarding.

Sandbox agent -> inference.local -> OpenShell -> selected provider and model

Host-side services such as Model Router and the OpenRouter runtime adapter remain behind the OpenShell route. The sandbox continues to use inference.local instead of calling their host ports directly.

Credential Boundary

Provider credentials stay on the host and flow through the OpenShell provider system. The sandbox does not receive the raw upstream API key.

Local Ollama and local vLLM routes do not require the host OPENAI_API_KEY. NemoClaw uses provider-specific local tokens for those routes. Rebuilds of legacy local-inference sandboxes migrate away from stale OpenAI credential requirements. When a rebuild reuses an automatically bridged compatible-endpoint route without a host API key, NemoClaw reapplies the config-only bridge rewrite without reading or passing the credential stored in OpenShell.

Choose an Inference Provider compares the upstream routes that OpenShell can use.
View the Active Inference Route shows the provider and model currently selected.
Verify the Sandbox Inference Route tests the configured path.

Request Path

Credential Boundary

Related Topics