Tool Call Parsing (Engine Fallback)
Tool Call Parsing (Engine Fallback)
Use upstream vLLM or SGLang tool-call parsers when Dynamo does not ship one
When Dynamo’s registry does not list a tool-call parser for your model, fall back to the upstream engine’s parser via a chat-processor swap, which keeps frontend tokenization and KV routing.
For Dynamo-native parsers, see Tool Call Parsing (Dynamo). For the equivalent reasoning fallback, see Reasoning Parsing (Engine Fallback).
Known Issue: Engine-fallback tool call parsing does not currently work with disaggregated serving. Use the Dynamo-native tool call parser for disaggregated deployments.
Configurations
Upstream parser names come from the engine’s registry and may differ from
Dynamo’s name for the same model (e.g., SGLang’s deepseekv3 vs Dynamo’s
deepseek_v3). They are pinned to the engine version shipped in the Dynamo
container.
Examples
See Also
- Tool Call Parsing (Dynamo) — Dynamo-native parsers and request examples
- Reasoning Parsing (Engine Fallback) — Equivalent fallback for reasoning
- vLLM Chat Processor — vLLM chat-processor details
- SGLang Chat Processor — SGLang chat-processor details
- Frontend Configuration Reference — Full CLI flag reference