Dynamo Release Artifacts#
This document provides a comprehensive inventory of all Dynamo release artifacts including container images, Python wheels, Helm charts, and Rust crates.
See also: Support Matrix for hardware and platform compatibility | Feature Matrix for backend feature support
Release history in this document begins at v0.6.0.
Current Release: Dynamo v0.8.1#
Patch Release: v0.8.1.post1 (Jan 23, 2026)#
v0.8.1.post1 is a patch release for PyPI wheels and TRT-LLM container only (no GitHub release). All other artifacts remain at v0.8.1.
Container Images#
Image:Tag |
Description |
Backend |
CUDA |
Arch |
NGC |
Notes |
|---|---|---|---|---|---|---|
|
Runtime container for vLLM backend |
vLLM |
|
AMD64/ARM64 |
||
|
Runtime container for vLLM backend (CUDA 13) |
vLLM |
|
AMD64/ARM64* |
— |
Fails to launch |
|
Runtime container for SGLang backend |
SGLang |
|
AMD64/ARM64 |
||
|
Runtime container for SGLang backend (CUDA 13) |
SGLang |
|
AMD64/ARM64* |
Experimental |
|
|
Runtime container for TensorRT-LLM backend |
TRT-LLM |
|
AMD64/ARM64 |
||
|
API gateway with Endpoint Prediction Protocol (EPP) |
— |
— |
AMD64/ARM64 |
||
|
Kubernetes operator for Dynamo deployments |
— |
— |
AMD64/ARM64 |
* Multimodal inference on CUDA 13 images: works on AMD64 for all backends; works on ARM64 only for TensorRT-LLM (vllm-runtime:*-cuda13 and sglang-runtime:*-cuda13 do not support multimodality on ARM64).
Python Wheels#
We recommend using the TensorRT-LLM NGC container instead of the ai-dynamo[trtllm] wheel. See the NGC container collection for supported images.
Package |
Description |
Python |
Platform |
PyPI |
|---|---|---|---|---|
|
Main package with backend integrations (vLLM, SGLang, TRT-LLM) |
|
Linux (glibc |
|
|
Core Python bindings for Dynamo runtime |
|
Linux (glibc |
|
|
KV Block Manager for disaggregated KV cache |
|
Linux (glibc |
Helm Charts#
Rust Crates#
Crate |
Description |
MSRV (Rust) |
crates.io |
|---|---|---|---|
|
Core distributed runtime library |
|
|
|
LLM inference engine |
|
|
|
Async OpenAI-compatible API client |
|
|
|
Protocol parsers (SSE, JSON streaming) |
|
|
|
Memory management utilities |
|
|
|
Configuration management |
|
Quick Install Commands#
Container Images (NGC)#
For detailed run instructions, see the Container README or backend-specific guides: vLLM | SGLang | TensorRT-LLM
# Runtime containers
docker pull nvcr.io/nvidia/ai-dynamo/vllm-runtime:0.8.1
docker pull nvcr.io/nvidia/ai-dynamo/sglang-runtime:0.8.1
docker pull nvcr.io/nvidia/ai-dynamo/tensorrtllm-runtime:0.8.1.post1
# CUDA 13 variants (experimental)
# vLLM CUDA 13 image fails to launch (known issue)
# docker pull nvcr.io/nvidia/ai-dynamo/vllm-runtime:0.8.1-cuda13
docker pull nvcr.io/nvidia/ai-dynamo/sglang-runtime:0.8.1-cuda13
# Infrastructure containers
docker pull nvcr.io/nvidia/ai-dynamo/dynamo-frontend:0.8.1
docker pull nvcr.io/nvidia/ai-dynamo/kubernetes-operator:0.8.1
Python Wheels (PyPI)#
For detailed installation instructions, see the Local Quick Start in the README.
# Install Dynamo with a specific backend (Recommended)
uv pip install "ai-dynamo[vllm]==0.8.1.post1"
uv pip install "ai-dynamo[sglang]==0.8.1.post1"
# TensorRT-LLM requires the NVIDIA PyPI index and pip
pip install --pre --extra-index-url https://pypi.nvidia.com "ai-dynamo[trtllm]==0.8.1.post1"
# Install Dynamo core only
uv pip install ai-dynamo==0.8.1.post1
# Install standalone KVBM (Python 3.12 only)
uv pip install kvbm==0.8.1
Helm Charts (NGC)#
For Kubernetes deployment instructions, see the Kubernetes Installation Guide.
helm install dynamo-crds oci://helm.ngc.nvidia.com/nvidia/ai-dynamo/charts/dynamo-crds --version 0.8.1
helm install dynamo-platform oci://helm.ngc.nvidia.com/nvidia/ai-dynamo/charts/dynamo-platform --version 0.8.1
helm install dynamo-graph oci://helm.ngc.nvidia.com/nvidia/ai-dynamo/charts/dynamo-graph --version 0.8.1
Rust Crates (crates.io)#
For API documentation, see each crate on docs.rs. To build Dynamo from source, see Building from Source.
cargo add dynamo-runtime@0.8.1
cargo add dynamo-llm@0.8.1
cargo add dynamo-async-openai@0.8.1
cargo add dynamo-parsers@0.8.1
cargo add dynamo-memory@0.8.1
cargo add dynamo-config@0.8.1
CUDA and Driver Requirements#
For detailed CUDA toolkit versions and minimum driver requirements for each container image, see the Support Matrix.
Known Issues#
For a complete list of known issues, refer to the release notes for each patch:
Known Artifact Issues#
Version |
Artifact |
Issue |
Status |
|---|---|---|---|
v0.8.1 |
|
Container fails to launch. |
Known issue |
v0.8.1 |
|
Multimodality not expected to work on ARM64. Works on AMD64. |
Known limitation |
v0.8.0 |
|
CuDNN installation issue caused PyTorch |
Fixed in v0.8.1 (#5461) |
Release History#
v0.8.1.post1 Patch: Updated TRT-LLM to
v1.2.0rc6.post2(PyPI wheels and TRT-LLM container only)Standalone Frontend Container:
dynamo-frontendadded in v0.8.0CUDA 13 Runtimes: Experimental CUDA 13 runtime for vLLM and SGLang in v0.8.0
New Rust Crates:
dynamo-memoryanddynamo-configadded in v0.8.0
GitHub Releases#
Container Images#
NGC Collection: ai-dynamo
To access a specific version, append
?version=TAGto the container URL:https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-dynamo/containers/{container}?version={tag}
vllm-runtime#
Image:Tag |
vLLM |
Arch |
CUDA |
Notes |
|---|---|---|---|---|
|
|
AMD64/ARM64 |
|
|
|
|
AMD64/ARM64 |
|
|
|
|
AMD64/ARM64 |
|
Experimental |
|
|
AMD64/ARM64 |
|
Patch |
|
|
AMD64/ARM64 |
|
|
|
|
AMD64/ARM64 |
|
Patch |
|
|
AMD64/ARM64 |
|
|
|
|
AMD64/ARM64 |
|
Patch |
|
|
AMD64/ARM64 |
|
|
|
|
AMD64 |
|
sglang-runtime#
Image:Tag |
SGLang |
Arch |
CUDA |
Notes |
|---|---|---|---|---|
|
|
AMD64/ARM64 |
|
|
|
|
AMD64/ARM64 |
|
Experimental |
|
|
AMD64/ARM64 |
|
|
|
|
AMD64/ARM64 |
|
Experimental |
|
|
AMD64/ARM64 |
|
|
|
|
AMD64/ARM64 |
|
Patch |
|
|
AMD64/ARM64 |
|
|
|
|
AMD64/ARM64 |
|
Patch |
|
|
AMD64/ARM64 |
|
|
|
|
AMD64 |
|
tensorrtllm-runtime#
Image:Tag |
TRT-LLM |
Arch |
CUDA |
Notes |
|---|---|---|---|---|
|
|
AMD64/ARM64 |
|
Patch |
|
|
AMD64/ARM64 |
|
|
|
|
AMD64/ARM64 |
|
|
|
|
AMD64/ARM64 |
|
Patch |
|
|
AMD64/ARM64 |
|
|
|
|
AMD64/ARM64 |
|
Patch |
|
|
AMD64/ARM64 |
|
|
|
|
AMD64/ARM64 |
|
Experimental |
|
|
AMD64/ARM64 |
|
Patch |
|
|
AMD64/ARM64 |
|
|
|
|
AMD64/ARM64 |
|
dynamo-frontend#
Image:Tag |
Arch |
Notes |
|---|---|---|
|
AMD64/ARM64 |
|
|
AMD64/ARM64 |
Initial |
kubernetes-operator#
Image:Tag |
Arch |
Notes |
|---|---|---|
|
AMD64/ARM64 |
|
|
AMD64/ARM64 |
|
|
AMD64/ARM64 |
|
|
AMD64/ARM64 |
Patch |
|
AMD64/ARM64 |
|
|
AMD64/ARM64 |
|
|
AMD64/ARM64 |
Python Wheels#
PyPI: ai-dynamo | ai-dynamo-runtime | kvbm
To access a specific version:
https://pypi.org/project/{package}/{version}/
ai-dynamo (wheel)#
Package |
Python |
Platform |
Notes |
|---|---|---|---|
|
|
Linux (glibc |
TRT-LLM |
|
|
Linux (glibc |
|
|
|
Linux (glibc |
|
|
|
Linux (glibc |
|
|
|
Linux (glibc |
|
|
|
Linux (glibc |
|
|
|
Linux (glibc |
ai-dynamo-runtime (wheel)#
Package |
Python |
Platform |
Notes |
|---|---|---|---|
|
|
Linux (glibc |
TRT-LLM |
|
|
Linux (glibc |
|
|
|
Linux (glibc |
|
|
|
Linux (glibc |
|
|
|
Linux (glibc |
|
|
|
Linux (glibc |
|
|
|
Linux (glibc |
kvbm (wheel)#
Package |
Python |
Platform |
Notes |
|---|---|---|---|
|
|
Linux (glibc |
|
|
|
Linux (glibc |
|
|
|
Linux (glibc |
|
|
|
Linux (glibc |
Initial |
Helm Charts#
NGC Helm Registry: ai-dynamo
Direct download:
https://helm.ngc.nvidia.com/nvidia/ai-dynamo/charts/{chart}-{version}.tgz
dynamo-crds (Helm chart)#
Chart |
Notes |
|---|---|
|
|
|
|
|
|
|
|
|
|
|
dynamo-platform (Helm chart)#
Chart |
Notes |
|---|---|
|
|
|
|
|
|
|
|
|
|
|
dynamo-graph (Helm chart)#
Chart |
Notes |
|---|---|
|
|
|
|
|
|
|
|
|
|
|
Rust Crates#
crates.io: dynamo-runtime | dynamo-llm | dynamo-async-openai | dynamo-parsers | dynamo-memory | dynamo-config
To access a specific version:
https://crates.io/crates/{crate}/{version}
dynamo-runtime (crate)#
Crate |
MSRV (Rust) |
Notes |
|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
dynamo-llm (crate)#
Crate |
MSRV (Rust) |
Notes |
|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
dynamo-async-openai (crate)#
Crate |
MSRV (Rust) |
Notes |
|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
dynamo-parsers (crate)#
Crate |
MSRV (Rust) |
Notes |
|---|---|---|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
dynamo-memory (crate)#
Crate |
MSRV (Rust) |
Notes |
|---|---|---|
|
|
|
|
|
Initial |
dynamo-config (crate)#
Crate |
MSRV (Rust) |
Notes |
|---|---|---|
|
|
|
|
|
Initial |