For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
Digest
  • Getting Started
    • Quickstart
    • Introduction
    • Local Installation
    • Building from Source
    • Kubernetes Deployment
    • Contribution Guide
  • Resources
    • Support Matrix
    • Feature Matrix
    • Release Artifacts
    • Examples
    • Glossary
  • Digest
    • Dynamo Day 0 support for TokenSpeed
    • Multi-Turn Agentic Harnesses
    • Full-Stack Optimizations for Agentic Inference
    • Flash Indexer: Inter-Galactic KV Routing
  • Kubernetes Deployment
  • User Guides
    • Disaggregated Serving
    • KV Cache Aware Routing
    • KV Cache Offloading
    • Tool Calling
    • Reasoning
      • Reasoning Parsing (Dynamo)
      • Reasoning Parsing (Engine Fallback)
    • Agents
    • Multimodal
    • Diffusion
    • LoRA Adapters
    • Observability (Local)
    • Fault Tolerance
    • Benchmarking
    • Writing Python Workers
  • Backends
    • SGLang
    • TensorRT-LLM
    • vLLM
  • Components
    • Frontend
    • Router
    • Planner
    • Profiler
    • KVBM
  • Integrations
    • LMCache
    • SGLang HiCache
    • FlexKV
    • KV Events for Custom Engines
  • Design Docs
    • Overall Architecture
    • Architecture Flow
    • Disaggregated Serving
    • Distributed Runtime
  • Documentation
    • Dynamo Docs Guide
NVIDIANVIDIA
Developer-friendly docs for your API
Privacy Policy | Your Privacy Choices | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2026, NVIDIA Corporation.

LogoLogoDocumentation
Digest
On this page
  • Choose a parsing path
  • See Also
User Guides

Reasoning

Separate reasoning content from assistant output for chain-of-thought models

||View as Markdown|

Some models emit reasoning or thinking content separately from their final response. Dynamo can split that output into reasoning_content and normal assistant content by configuring a reasoning parser. As with tool calling, there are two ways to do this to ensure wide coverage and day0 model support.

Choose a parsing path

PathWhen to usePage
DynamoDynamo ships a Rust parser for the model’s reasoning format. Lowest latency, the default path.Reasoning Parsing (Dynamo)
Engine FallbackUse the framework’s implementation (vLLM or SGLang) for pre/post processing, including tool call and reasoning parsing - ensure consistency with framework behavior.Reasoning Parsing (Engine Fallback)

Start with the Dynamo path. Fall back to the engine path only when Dynamo’s registry does not list a parser for your model.

See Also

  • Tool Calling — parse tool calls out of model output. Several models need both a reasoning parser and a tool-call parser configured together.
  • Frontend Configuration Reference — full CLI flag reference.
Previous

Troubleshooting Tool Calls

Next

Reasoning Parsing (Dynamo)