For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
Digest
  • Getting Started
    • Quickstart
    • Introduction
    • Local Installation
    • Building from Source
    • Contribution Guide
  • Resources
    • Support Matrix
    • Feature Matrix
    • Release Artifacts
    • Examples
  • Kubernetes Deployment
    • Deployment Guide
  • User Guides
    • KV Cache Aware Routing
    • Disaggregated Serving
    • KV Cache Offloading
    • Dynamo Benchmarking
    • Multimodal
    • Diffusion (Preview)
      • FastVideo
      • SGLang Diffusion
      • TRT-LLM Diffusion
      • vLLM-Omni
    • Tool Calling
    • LoRA Adapters
    • Agents
    • Observability (Local)
    • Fault Tolerance
    • Writing Python Workers
  • Backends
    • SGLang
    • TensorRT-LLM
    • vLLM
  • Components
    • Frontend
    • Router
    • Planner
    • Profiler
    • KVBM
  • Integrations
    • LMCache
    • SGLang HiCache
    • FlexKV
    • KV Events for Custom Engines
  • Design Docs
    • Overall Architecture
    • Architecture Flow
    • Disaggregated Serving
    • Distributed Runtime
    • Blog
  • Documentation
    • Dynamo Docs Guide
NVIDIANVIDIA
Developer-friendly docs for your API
Privacy Policy | Your Privacy Choices | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2026, NVIDIA Corporation.

LogoLogoDocumentation
Digest
On this page
  • Overview
  • Support Matrix
  • Backend Documentation
User Guides

Diffusion

Deploy diffusion models for text-to-image, text-to-video, and more in Dynamo

||View as Markdown|

Overview

Dynamo supports serving diffusion models across multiple backends, enabling generation of images and video from text prompts. Backends expose diffusion capabilities through the same Dynamo pipeline infrastructure used for LLM inference, including frontend routing, scaling, and observability.

Support Matrix

ModalityvLLM-OmniSGLangTRT-LLM
Text-to-Text✅✅❌
Text-to-Image✅✅❌
Text-to-Video✅✅✅
Image-to-Video✅❌❌

Status: ✅ Supported | ❌ Not supported

Backend Documentation

For deployment guides, configuration, and examples for each backend:

  • vLLM-Omni
  • SGLang Diffusion
  • TRT-LLM Diffusion
  • FastVideo (custom worker)
Edit this page
Previous

Multimodal KV Routing

Next

FastVideo