Release Artifacts

View as Markdown

This document provides a comprehensive inventory of all Dynamo release artifacts including container images, Python wheels, Helm charts, and Rust crates.

See also: Support Matrix for hardware and platform compatibility | Feature Matrix for backend feature support

Release history in this document begins at v0.6.0.

Current Release: Dynamo v1.0.1

Experimental: v1.1.0-dev.1 is available as an experimental preview. See Pre-Release Artifacts for container images, wheels, and Helm charts.

Container Images

Image:TagDescriptionBackendCUDAArchNGCNotes
vllm-runtime:1.0.1Runtime container for vLLM backendvLLM v0.16.0v12.9AMD64/ARM64link
vllm-runtime:1.0.1-cuda13Runtime container for vLLM backend (CUDA 13)vLLM v0.16.0v13.0AMD64/ARM64*link
vllm-runtime:1.0.1-efa-amd64Runtime container for vLLM with AWS EFAvLLM v0.16.0v12.9AMD64linkExperimental
sglang-runtime:1.0.1Runtime container for SGLang backendSGLang v0.5.9v12.9AMD64/ARM64link
sglang-runtime:1.0.1-cuda13Runtime container for SGLang backend (CUDA 13)SGLang v0.5.9v13.0AMD64/ARM64*link
tensorrtllm-runtime:1.0.1Runtime container for TensorRT-LLM backendTRT-LLM v1.3.0rc5.post1v13.1AMD64/ARM64link
tensorrtllm-runtime:1.0.1-efa-amd64Runtime container for TensorRT-LLM with AWS EFATRT-LLM v1.3.0rc5.post1v13.1AMD64linkExperimental
dynamo-frontend:1.0.1API gateway with Endpoint Prediction Protocol (EPP)AMD64/ARM64link
kubernetes-operator:1.0.1Kubernetes operator for Dynamo deploymentsAMD64/ARM64link
snapshot-agent:1.0.1Snapshot agent for fast GPU worker recovery via CRIUAMD64/ARM64linkPreview

* Multimodal inference on CUDA 13 images: works on AMD64 for all backends; works on ARM64 only for TensorRT-LLM (vllm-runtime:*-cuda13 and sglang-runtime:*-cuda13 do not support multimodality on ARM64).

Python Wheels

We recommend using the TensorRT-LLM NGC container instead of the ai-dynamo[trtllm] wheel. See the NGC container collection for supported images.

PackageDescriptionPythonPlatformPyPI
ai-dynamo==1.0.1Main package with backend integrations (vLLM, SGLang, TRT-LLM)3.103.12Linux (glibc v2.28+)link
ai-dynamo-runtime==1.0.1Core Python bindings for Dynamo runtime3.103.12Linux (glibc v2.28+)link
kvbm==1.0.1KV Block Manager for disaggregated KV cache3.12Linux (glibc v2.28+)link

Helm Charts

ChartDescriptionNGC
dynamo-platform-1.0.1Platform services (etcd, NATS) and Dynamo Operator for Dynamo clusterlink
snapshot-1.0.1Snapshot DaemonSet for fast GPU worker recoverylink

The dynamo-crds Helm chart is deprecated as of v1.0.0; CRDs are now managed by the Dynamo Operator. The dynamo-graph Helm chart is deprecated as of v0.9.0.

Rust Crates

CrateDescriptionMSRV (Rust)crates.io
dynamo-runtime@1.0.1Core distributed runtime libraryv1.82link
dynamo-llm@1.0.1LLM inference enginev1.82link
dynamo-protocols@1.0.1Async OpenAI-compatible API clientv1.82link
dynamo-parsers@1.0.1Protocol parsers (SSE, JSON streaming)v1.82link
dynamo-memory@1.0.1Memory management utilitiesv1.82link
dynamo-config@1.0.1Configuration managementv1.82link
dynamo-tokens@1.0.1Tokenizer bindings for LLM inferencev1.82link
dynamo-mocker@1.0.1Inference engine simulator for benchmarkingv1.82link
dynamo-kv-router@1.0.1KV-aware request routing libraryv1.82link

Quick Install Commands

Container Images (NGC)

For detailed run instructions, see the backend-specific guides: vLLM | SGLang | TensorRT-LLM

$# Runtime containers
$docker pull nvcr.io/nvidia/ai-dynamo/vllm-runtime:1.0.1
$docker pull nvcr.io/nvidia/ai-dynamo/sglang-runtime:1.0.1
$docker pull nvcr.io/nvidia/ai-dynamo/tensorrtllm-runtime:1.0.1
$
$# CUDA 13 variants
$docker pull nvcr.io/nvidia/ai-dynamo/vllm-runtime:1.0.1-cuda13
$docker pull nvcr.io/nvidia/ai-dynamo/sglang-runtime:1.0.1-cuda13
$
$# EFA variants (AWS, AMD64 only, experimental)
$docker pull nvcr.io/nvidia/ai-dynamo/vllm-runtime:1.0.1-efa-amd64
$docker pull nvcr.io/nvidia/ai-dynamo/tensorrtllm-runtime:1.0.1-efa-amd64
$
$# Infrastructure containers
$docker pull nvcr.io/nvidia/ai-dynamo/dynamo-frontend:1.0.1
$docker pull nvcr.io/nvidia/ai-dynamo/kubernetes-operator:1.0.1
$docker pull nvcr.io/nvidia/ai-dynamo/snapshot-agent:1.0.1

Python Wheels (PyPI)

For detailed installation instructions, see the Local Quick Start in the README.

$# Install Dynamo with a specific backend (Recommended)
$uv pip install "ai-dynamo[vllm]==1.0.1"
$uv pip install "ai-dynamo[sglang]==1.0.1"
$# TensorRT-LLM requires the NVIDIA PyPI index and pip
$pip install --pre --extra-index-url https://pypi.nvidia.com "ai-dynamo[trtllm]==1.0.1"
$
$# Install Dynamo core only
$uv pip install ai-dynamo==1.0.1
$
$# Install standalone KVBM (Python 3.12 only)
$uv pip install kvbm==1.0.1

Helm Charts (NGC)

For Kubernetes deployment instructions, see the Kubernetes Installation Guide.

$helm install dynamo-platform oci://helm.ngc.nvidia.com/nvidia/ai-dynamo/charts/dynamo-platform --version 1.0.1
$helm install snapshot oci://helm.ngc.nvidia.com/nvidia/ai-dynamo/charts/snapshot --version 1.0.1

Rust Crates (crates.io)

For API documentation, see each crate on docs.rs. To build Dynamo from source, see Building from Source.

$cargo add dynamo-runtime@1.0.1
$cargo add dynamo-llm@1.0.1
$cargo add dynamo-protocols@1.0.1
$cargo add dynamo-parsers@1.0.1
$cargo add dynamo-memory@1.0.1
$cargo add dynamo-config@1.0.1
$cargo add dynamo-tokens@1.0.1
$cargo add dynamo-mocker@1.0.1
$cargo add dynamo-kv-router@1.0.1

CUDA and Driver Requirements: For detailed CUDA toolkit versions and minimum driver requirements for each container image, see the Support Matrix.

Known Issues

For a complete list of known issues, refer to the release notes for each version:

Known Artifact Issues

VersionArtifactIssueStatus
v0.9.0dynamo-platform-0.9.0Helm chart sets operator image to 0.7.1 instead of 0.9.0.Fixed in v0.9.0.post1
v0.8.1vllm-runtime:0.8.1-cuda13Container fails to launch.Known issue
v0.8.1sglang-runtime:0.8.1-cuda13, vllm-runtime:0.8.1-cuda13Multimodality not expected to work on ARM64. Works on AMD64.Known limitation
v0.8.0sglang-runtime:0.8.0-cuda13CuDNN installation issue caused PyTorch v2.9.1 compatibility problems with nn.Conv3d, resulting in performance degradation and excessive memory usage in multimodal workloads.Fixed in v0.8.1 (#5461)

Release History

  • v1.1.0-dev.1 (experimental): Preview release. SGLang v0.5.9, TRT-LLM v1.3.0rc5.post1, vLLM v0.17.1, NIXL v0.10.1. Not recommended for production use.
  • v1.0.1: Patch release. Same backend versions as v1.0.0: SGLang v0.5.9, TRT-LLM v1.3.0rc5.post1, vLLM v0.16.0, NIXL v0.10.1.
  • v1.0.0: First major release. SGLang v0.5.9, TRT-LLM v1.3.0rc5.post1 (CUDA 13.1), vLLM v0.16.0, NIXL v0.10.1. New snapshot-agent container and snapshot Helm chart (Preview). New EFA container variants for vLLM and TRT-LLM (Experimental, AMD64 only). New dynamo-mocker and dynamo-kv-router Rust crates. Deprecated dynamo-crds Helm chart (CRDs now managed by the Operator). v1alpha1 CRDs deprecated.
  • v0.9.1: Updated TRT-LLM to v1.3.0rc3. All other backend versions unchanged from v0.9.0.
  • v0.9.0.post1: Fixed dynamo-platform Helm chart operator image tag (Helm chart only, NGC)
  • v0.9.0: Updated vLLM to v0.14.1, SGLang to v0.5.8, TRT-LLM to v1.3.0rc1, NIXL to v0.9.0. New dynamo-tokens Rust crate. Deprecated dynamo-graph Helm chart.
  • v0.8.1.post1/.post2/.post3 Patches: Experimental patch releases updating TRT-LLM only (PyPI wheels and TRT-LLM container). No other artifacts changed.
  • Standalone Frontend Container: dynamo-frontend added in v0.8.0
  • EFA Runtimes: Experimental AWS EFA variants for vLLM and TRT-LLM (AMD64 only) in v1.0.0
  • CUDA 13 Runtimes: Experimental CUDA 13 runtime for SGLang and vLLM in v0.8.0
  • New Rust Crates: dynamo-memory and dynamo-config added in v0.8.0

GitHub Releases

VersionRelease DateGitHubDocsNotes
v1.1.0-dev.1Mar 17, 2026BranchExperimental
v1.0.1Mar 16, 2026ReleaseDocs
v1.0.0Mar 12, 2026ReleaseDocs
v0.9.1Mar 4, 2026ReleaseDocs
v0.9.0Feb 11, 2026ReleaseArchived docs unavailable
v0.8.1Jan 23, 2026ReleaseArchived docs unavailable
v0.8.0Jan 15, 2026ReleaseArchived docs unavailable
v0.7.1Dec 15, 2025ReleaseArchived docs unavailable
v0.7.0Nov 26, 2025ReleaseArchived docs unavailable
v0.6.1Nov 6, 2025Release
v0.6.0Oct 28, 2025Release

Container Images

NGC Collection: ai-dynamo

To access a specific version, append ?version=TAG to the container URL: https://catalog.ngc.nvidia.com/orgs/nvidia/teams/ai-dynamo/containers/{container}?version={tag}

vllm-runtime

Image:TagvLLMArchCUDANotes
vllm-runtime:1.0.1v0.16.0AMD64/ARM64v12.9
vllm-runtime:1.0.1-cuda13v0.16.0AMD64/ARM64*v13.0
vllm-runtime:1.0.1-efa-amd64v0.16.0AMD64v12.9Experimental
vllm-runtime:1.0.0v0.16.0AMD64/ARM64v12.9
vllm-runtime:1.0.0-cuda13v0.16.0AMD64/ARM64*v13.0
vllm-runtime:1.0.0-efa-amd64v0.16.0AMD64v12.9Experimental
vllm-runtime:0.9.1v0.14.1AMD64/ARM64v12.9
vllm-runtime:0.9.1-cuda13v0.14.1AMD64/ARM64*v13.0Experimental
vllm-runtime:0.9.0v0.14.1AMD64/ARM64v12.9
vllm-runtime:0.9.0-cuda13v0.14.1AMD64/ARM64*v13.0Experimental
vllm-runtime:0.8.1v0.12.0AMD64/ARM64v12.9
vllm-runtime:0.8.0v0.12.0AMD64/ARM64v12.9
vllm-runtime:0.8.0-cuda13v0.12.0AMD64/ARM64v13.0Experimental
vllm-runtime:0.7.0.post2v0.11.2AMD64/ARM64v12.8Patch
vllm-runtime:0.7.1v0.11.0AMD64/ARM64v12.8
vllm-runtime:0.7.0.post1v0.11.0AMD64/ARM64v12.8Patch
vllm-runtime:0.7.0v0.11.0AMD64/ARM64v12.8
vllm-runtime:0.6.1.post1v0.11.0AMD64/ARM64v12.8Patch
vllm-runtime:0.6.1v0.11.0AMD64/ARM64v12.8
vllm-runtime:0.6.0v0.11.0AMD64v12.8

sglang-runtime

Image:TagSGLangArchCUDANotes
sglang-runtime:1.0.1v0.5.9AMD64/ARM64v12.9
sglang-runtime:1.0.1-cuda13v0.5.9AMD64/ARM64*v13.0
sglang-runtime:1.0.0v0.5.9AMD64/ARM64v12.9
sglang-runtime:1.0.0-cuda13v0.5.9AMD64/ARM64*v13.0
sglang-runtime:0.9.1v0.5.8AMD64/ARM64v12.9
sglang-runtime:0.9.1-cuda13v0.5.8AMD64/ARM64*v13.0Experimental
sglang-runtime:0.9.0v0.5.8AMD64/ARM64v12.9
sglang-runtime:0.9.0-cuda13v0.5.8AMD64/ARM64*v13.0Experimental
sglang-runtime:0.8.1v0.5.6.post2AMD64/ARM64v12.9
sglang-runtime:0.8.1-cuda13v0.5.6.post2AMD64/ARM64v13.0Experimental
sglang-runtime:0.8.0v0.5.6.post2AMD64/ARM64v12.9
sglang-runtime:0.8.0-cuda13v0.5.6.post2AMD64/ARM64v13.0Experimental
sglang-runtime:0.7.1v0.5.4.post3AMD64/ARM64v12.9
sglang-runtime:0.7.0.post1v0.5.4.post3AMD64/ARM64v12.9Patch
sglang-runtime:0.7.0v0.5.4.post3AMD64/ARM64v12.9
sglang-runtime:0.6.1.post1v0.5.3.post2AMD64/ARM64v12.9Patch
sglang-runtime:0.6.1v0.5.3.post2AMD64/ARM64v12.9
sglang-runtime:0.6.0v0.5.3.post2AMD64v12.8

tensorrtllm-runtime

Image:TagTRT-LLMArchCUDANotes
tensorrtllm-runtime:1.0.1v1.3.0rc5.post1AMD64/ARM64v13.1
tensorrtllm-runtime:1.0.1-efa-amd64v1.3.0rc5.post1AMD64v13.1Experimental
tensorrtllm-runtime:1.0.0v1.3.0rc5.post1AMD64/ARM64v13.1
tensorrtllm-runtime:1.0.0-efa-amd64v1.3.0rc5.post1AMD64v13.1Experimental
tensorrtllm-runtime:0.9.1v1.3.0rc3AMD64/ARM64v13.0
tensorrtllm-runtime:0.9.0v1.3.0rc1AMD64/ARM64v13.0
tensorrtllm-runtime:0.8.1.post3v1.2.0rc6.post3AMD64/ARM64v13.0Patch
tensorrtllm-runtime:0.8.1.post1v1.2.0rc6.post2AMD64/ARM64v13.0Patch
tensorrtllm-runtime:0.8.1v1.2.0rc6.post1AMD64/ARM64v13.0
tensorrtllm-runtime:0.8.0v1.2.0rc6.post1AMD64/ARM64v13.0
tensorrtllm-runtime:0.7.0.post2v1.2.0rc2AMD64/ARM64v13.0Patch
tensorrtllm-runtime:0.7.1v1.2.0rc3AMD64/ARM64v13.0
tensorrtllm-runtime:0.7.0.post1v1.2.0rc3AMD64/ARM64v13.0Patch
tensorrtllm-runtime:0.7.0v1.2.0rc2AMD64/ARM64v13.0
tensorrtllm-runtime:0.6.1-cuda13v1.2.0rc1AMD64/ARM64v13.0Experimental
tensorrtllm-runtime:0.6.1.post1v1.1.0rc5AMD64/ARM64v12.9Patch
tensorrtllm-runtime:0.6.1v1.1.0rc5AMD64/ARM64v12.9
tensorrtllm-runtime:0.6.0v1.1.0rc5AMD64/ARM64v12.9

dynamo-frontend

Image:TagArchNotes
dynamo-frontend:1.0.1AMD64/ARM64
dynamo-frontend:1.0.0AMD64/ARM64
dynamo-frontend:0.9.1AMD64/ARM64
dynamo-frontend:0.9.0AMD64/ARM64
dynamo-frontend:0.8.1AMD64/ARM64
dynamo-frontend:0.8.0AMD64/ARM64Initial

kubernetes-operator

Image:TagArchNotes
kubernetes-operator:1.0.1AMD64/ARM64
kubernetes-operator:1.0.0AMD64/ARM64
kubernetes-operator:0.9.1AMD64/ARM64
kubernetes-operator:0.9.0AMD64/ARM64
kubernetes-operator:0.8.1AMD64/ARM64
kubernetes-operator:0.8.0AMD64/ARM64
kubernetes-operator:0.7.1AMD64/ARM64
kubernetes-operator:0.7.0.post1AMD64/ARM64Patch
kubernetes-operator:0.7.0AMD64/ARM64
kubernetes-operator:0.6.1AMD64/ARM64
kubernetes-operator:0.6.0AMD64/ARM64

snapshot-agent

Image:TagArchNotes
snapshot-agent:1.0.1AMD64/ARM64Preview
snapshot-agent:1.0.0AMD64/ARM64Preview

Python Wheels

PyPI: ai-dynamo | ai-dynamo-runtime | kvbm

To access a specific version: https://pypi.org/project/{package}/{version}/

ai-dynamo (wheel)

PackagePythonPlatformNotes
ai-dynamo==1.0.13.103.12Linux (glibc v2.28+)
ai-dynamo==1.0.03.103.12Linux (glibc v2.28+)
ai-dynamo==0.9.13.103.12Linux (glibc v2.28+)
ai-dynamo==0.9.03.103.12Linux (glibc v2.28+)
ai-dynamo==0.8.1.post33.103.12Linux (glibc v2.28+)TRT-LLM v1.2.0rc6.post3
ai-dynamo==0.8.1.post13.103.12Linux (glibc v2.28+)TRT-LLM v1.2.0rc6.post2
ai-dynamo==0.8.13.103.12Linux (glibc v2.28+)
ai-dynamo==0.8.03.103.12Linux (glibc v2.28+)
ai-dynamo==0.7.13.103.12Linux (glibc v2.28+)
ai-dynamo==0.7.03.103.12Linux (glibc v2.28+)
ai-dynamo==0.6.13.103.12Linux (glibc v2.28+)
ai-dynamo==0.6.03.103.12Linux (glibc v2.28+)

ai-dynamo-runtime (wheel)

PackagePythonPlatformNotes
ai-dynamo-runtime==1.0.13.103.12Linux (glibc v2.28+)
ai-dynamo-runtime==1.0.03.103.12Linux (glibc v2.28+)
ai-dynamo-runtime==0.9.13.103.12Linux (glibc v2.28+)
ai-dynamo-runtime==0.9.03.103.12Linux (glibc v2.28+)
ai-dynamo-runtime==0.8.1.post33.103.12Linux (glibc v2.28+)TRT-LLM v1.2.0rc6.post3
ai-dynamo-runtime==0.8.1.post13.103.12Linux (glibc v2.28+)TRT-LLM v1.2.0rc6.post2
ai-dynamo-runtime==0.8.13.103.12Linux (glibc v2.28+)
ai-dynamo-runtime==0.8.03.103.12Linux (glibc v2.28+)
ai-dynamo-runtime==0.7.13.103.12Linux (glibc v2.28+)
ai-dynamo-runtime==0.7.03.103.12Linux (glibc v2.28+)
ai-dynamo-runtime==0.6.13.103.12Linux (glibc v2.28+)
ai-dynamo-runtime==0.6.03.103.12Linux (glibc v2.28+)

kvbm (wheel)

PackagePythonPlatformNotes
kvbm==1.0.13.12Linux (glibc v2.28+)
kvbm==1.0.03.12Linux (glibc v2.28+)
kvbm==0.9.13.12Linux (glibc v2.28+)
kvbm==0.9.03.12Linux (glibc v2.28+)
kvbm==0.8.13.12Linux (glibc v2.28+)
kvbm==0.8.03.12Linux (glibc v2.28+)
kvbm==0.7.13.12Linux (glibc v2.28+)
kvbm==0.7.03.12Linux (glibc v2.28+)Initial

Helm Charts

NGC Helm Registry: ai-dynamo

Direct download: https://helm.ngc.nvidia.com/nvidia/ai-dynamo/charts/{chart}-{version}.tgz

dynamo-crds (Helm chart) — Deprecated

The dynamo-crds Helm chart is deprecated as of v1.0.0. CRDs are now managed by the Dynamo Operator.

ChartNotes
dynamo-crds-0.9.1Last release
dynamo-crds-0.9.0
dynamo-crds-0.8.1
dynamo-crds-0.8.0
dynamo-crds-0.7.1
dynamo-crds-0.7.0
dynamo-crds-0.6.1
dynamo-crds-0.6.0

dynamo-platform (Helm chart)

ChartNotes
dynamo-platform-1.0.1
dynamo-platform-1.0.0
dynamo-platform-0.9.1
dynamo-platform-0.9.0-post1Helm fix: operator image tag
dynamo-platform-0.9.0
dynamo-platform-0.8.1
dynamo-platform-0.8.0
dynamo-platform-0.7.1
dynamo-platform-0.7.0
dynamo-platform-0.6.1
dynamo-platform-0.6.0

snapshot (Helm chart)

ChartNotes
snapshot-1.0.1Preview
snapshot-1.0.0Preview

dynamo-graph (Helm chart) — Deprecated

The dynamo-graph Helm chart is deprecated as of v0.9.0.
ChartNotes
dynamo-graph-0.8.1Last release
dynamo-graph-0.8.0
dynamo-graph-0.7.1
dynamo-graph-0.7.0
dynamo-graph-0.6.1
dynamo-graph-0.6.0

Rust Crates

crates.io: dynamo-runtime | dynamo-llm | dynamo-protocols | dynamo-parsers | dynamo-memory | dynamo-config | dynamo-tokens

To access a specific version: https://crates.io/crates/{crate}/{version}

dynamo-runtime (crate)

CrateMSRV (Rust)Notes
dynamo-runtime@1.0.1v1.82
dynamo-runtime@1.0.0v1.82
dynamo-runtime@0.9.1v1.82
dynamo-runtime@0.9.0v1.82
dynamo-runtime@0.8.1v1.82
dynamo-runtime@0.8.0v1.82
dynamo-runtime@0.7.1v1.82
dynamo-runtime@0.7.0v1.82
dynamo-runtime@0.6.1v1.82
dynamo-runtime@0.6.0v1.82

dynamo-llm (crate)

CrateMSRV (Rust)Notes
dynamo-llm@1.0.1v1.82
dynamo-llm@1.0.0v1.82
dynamo-llm@0.9.1v1.82
dynamo-llm@0.9.0v1.82
dynamo-llm@0.8.1v1.82
dynamo-llm@0.8.0v1.82
dynamo-llm@0.7.1v1.82
dynamo-llm@0.7.0v1.82
dynamo-llm@0.6.1v1.82
dynamo-llm@0.6.0v1.82

dynamo-protocols (crate)

CrateMSRV (Rust)Notes
dynamo-protocols@1.0.1v1.82
dynamo-protocols@1.0.0v1.82
dynamo-protocols@0.9.1v1.82
dynamo-protocols@0.9.0v1.82
dynamo-protocols@0.8.1v1.82
dynamo-protocols@0.8.0v1.82
dynamo-protocols@0.7.1v1.82
dynamo-protocols@0.7.0v1.82
dynamo-protocols@0.6.1v1.82
dynamo-protocols@0.6.0v1.82

dynamo-parsers (crate)

CrateMSRV (Rust)Notes
dynamo-parsers@1.0.1v1.82
dynamo-parsers@1.0.0v1.82
dynamo-parsers@0.9.1v1.82
dynamo-parsers@0.9.0v1.82
dynamo-parsers@0.8.1v1.82
dynamo-parsers@0.8.0v1.82
dynamo-parsers@0.7.1v1.82
dynamo-parsers@0.7.0v1.82
dynamo-parsers@0.6.1v1.82
dynamo-parsers@0.6.0v1.82

dynamo-memory (crate)

CrateMSRV (Rust)Notes
dynamo-memory@1.0.1v1.82
dynamo-memory@1.0.0v1.82
dynamo-memory@0.9.1v1.82
dynamo-memory@0.9.0v1.82
dynamo-memory@0.8.1v1.82
dynamo-memory@0.8.0v1.82Initial

dynamo-config (crate)

CrateMSRV (Rust)Notes
dynamo-config@1.0.1v1.82
dynamo-config@1.0.0v1.82
dynamo-config@0.9.1v1.82
dynamo-config@0.9.0v1.82
dynamo-config@0.8.1v1.82
dynamo-config@0.8.0v1.82Initial

dynamo-tokens (crate)

CrateMSRV (Rust)Notes
dynamo-tokens@1.0.1v1.82
dynamo-tokens@1.0.0v1.82
dynamo-tokens@0.9.1v1.82
dynamo-tokens@0.9.0v1.82Initial

dynamo-mocker (crate)

CrateMSRV (Rust)Notes
dynamo-mocker@1.0.1v1.82
dynamo-mocker@1.0.0v1.82Initial

dynamo-kv-router (crate)

CrateMSRV (Rust)Notes
dynamo-kv-router@1.0.1v1.82
dynamo-kv-router@1.0.0v1.82Initial

Pre-Release Artifacts

Pre-Release artifacts do not go through QA validation. Pre-release versions are experimental previews intended for early testing and feedback. They may contain bugs, breaking changes, or incomplete features. Use stable releases for production workloads.

v1.1.0-dev.1

Container Images

Image:TagBackendCUDAArch
vllm-runtime:1.1.0-dev.1vLLM v0.17.1v12.9AMD64/ARM64
vllm-runtime:1.1.0-dev.1-cuda13vLLM v0.17.1v13.0AMD64/ARM64*
vllm-runtime:1.1.0-dev.1-efa-amd64vLLM v0.17.1v12.9AMD64
sglang-runtime:1.1.0-dev.1SGLang v0.5.9v12.9AMD64/ARM64
sglang-runtime:1.1.0-dev.1-cuda13SGLang v0.5.9v13.0AMD64/ARM64*
tensorrtllm-runtime:1.1.0-dev.1TRT-LLM v1.3.0rc5.post1v13.1AMD64/ARM64
tensorrtllm-runtime:1.1.0-dev.1-efa-amd64TRT-LLM v1.3.0rc5.post1v13.1AMD64
dynamo-frontend:1.1.0-dev.1AMD64/ARM64
kubernetes-operator:1.1.0-dev.1AMD64/ARM64
snapshot-agent:1.1.0-dev.1AMD64/ARM64

Python Wheels

Available from pypi.nvidia.com (pre-release index):

$uv pip install --pre --extra-index-url https://pypi.nvidia.com/ ai-dynamo==1.1.0.dev1
$uv pip install --pre --extra-index-url https://pypi.nvidia.com/ ai-dynamo-runtime==1.1.0.dev1
$uv pip install --pre --extra-index-url https://pypi.nvidia.com/ kvbm==1.1.0.dev1

Helm Charts

ChartNGC
dynamo-platform-1.1.0-dev.1link
snapshot-1.1.0-dev.1link

Rust Crates

Not shipped for pre-release versions.