Reasoning Parsing (Engine Fallback)
Reasoning Parsing (Engine Fallback)
When Dynamo’s registry does not list a reasoning parser for your model, fall back to the upstream engine’s parser via a chat-processor swap, which keeps frontend tokenization and KV routing.
For Dynamo-native parsers, see Reasoning Parsing (Dynamo). For the equivalent tool-call fallback, see Tool Call Parsing (Engine Fallback).
Known Issue: Engine-fallback reasoning parsing does not currently work with disaggregated serving. Use the Dynamo-native reasoning parser for disaggregated deployments.
Configurations
Upstream parser names come from the engine’s registry and may differ from
Dynamo’s name for the same model (e.g., vLLM’s nemotron_v3 vs Dynamo’s
nemotron3). They are pinned to the engine version shipped in the Dynamo
container.
Examples
See Also
- Reasoning Parsing (Dynamo) — Dynamo-native parsers and common pairings
- Tool Call Parsing (Engine Fallback) — Equivalent fallback for tool-call parsers
- vLLM Chat Processor — vLLM chat-processor details
- SGLang Chat Processor — SGLang chat-processor details
- Frontend Configuration Reference — Full CLI flag reference