For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
Digest
  • Getting Started
    • Quickstart
    • Installation
    • Support Matrix
    • Feature Matrix
    • Examples
  • Kubernetes Deployment
  • User Guides
    • Tool Calling
    • Multimodality Support
    • Finding Best Initial Configs
    • Dynamo Benchmarking Guide
    • Tuning Disaggregated Performance
    • Writing Python Workers in Dynamo
    • Glossary
  • Components
    • Router
  • Design Docs
    • Overall Architecture
    • Architecture Flow
    • Disaggregated Serving
    • Distributed Runtime
NVIDIANVIDIA
Developer-friendly docs for your API
Privacy Policy | Your Privacy Choices | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2026, NVIDIA Corporation.

LogoLogoDocumentation
Digest
On this page
  • 1. Install Dynamo
  • 2. Start etcd/NATS
  • 3. Run Dynamo
  • 4. Test your deployment
  • Next Steps
Getting Started

Quickstart

||View as Markdown|
Edit this page
Next

Installation

Get started with Dynamo locally in just a few commands.

1. Install Dynamo

$# Install uv (recommended Python package manager)
$curl -LsSf https://astral.sh/uv/install.sh | sh
$
$# Create virtual environment and install Dynamo
$uv venv venv
$source venv/bin/activate
$# Use prerelease flag to install RC versions of flashinfer and/or other dependencies
$uv pip install --prerelease=allow "ai-dynamo[sglang]" # or [vllm], [trtllm]

2. Start etcd/NATS

$# Fetch and start etcd and NATS using Docker Compose
$VERSION=$(uv pip show ai-dynamo | grep Version | cut -d' ' -f2)
$curl -fsSL -o docker-compose.yml https://raw.githubusercontent.com/ai-dynamo/dynamo/refs/tags/v${VERSION}/deploy/docker-compose.yml
$docker compose -f docker-compose.yml up -d

3. Run Dynamo

$# Start the OpenAI compatible frontend (default port is 8000)
$python -m dynamo.frontend
$
$# In another terminal, start an SGLang worker
$python -m dynamo.sglang --model-path Qwen/Qwen3-0.6B

4. Test your deployment

$curl localhost:8000/v1/chat/completions \
> -H "Content-Type: application/json" \
> -d '{"model": "Qwen/Qwen3-0.6B",
> "messages": [{"role": "user", "content": "Hello!"}],
> "max_tokens": 50}'

Next Steps

  • Installation - Detailed installation options
  • Examples - More deployment examples
  • Support Matrix - Supported models and configurations