Skip to main content
Ctrl+K
Dynamo - Home

Dynamo

  • GitHub
Dynamo - Home

Dynamo

  • GitHub

Table of Contents

  • Welcome to Dynamo
  • Support Matrix

Architecture & Features

  • High Level Architecture
  • Distributed Runtime
  • Disaggregated Serving
  • KV Block Manager
    • Motivation
    • KVBM Architecture
    • Understanding KVBM components
    • KVBM Further Reading
  • KV Cache Routing
  • Planner
    • Pre-Deployment Profiling
    • Load-based Planner
    • SLA-based Planner
  • Dynamo Architecture Flow

Using Dynamo

  • Writing Python Workers in Dynamo
  • Disaggregation and Performance Tuning
  • Working with Dynamo Kubernetes Operator

Deployment Guides

  • Dynamo Deploy Quickstart
  • Dynamo Cloud Kubernetes Platform
  • Manual Helm Deployment
  • Minikube Setup Guide
  • Model Caching with Fluid

Examples

  • Hello World
  • LLM Deployment Examples using VLLM
  • LLM Deployment Examples using SGLang
  • Multinode Examples using SGLang
  • Planner Benchmark Example
  • LLM Deployment Examples using TensorRT-LLM

Reference

  • Glossary
  • NIXL Connect API
  • KVBM Reading
  • dynamo.nixl_...

dynamo.nixl_connect.DeviceKind(IntEnum)#

Represents the kind of device a Device object represents.

Values#

CUDA#

CUDA addressable device (GPU) memory.

HOST#

System (CPU) memory.

Related Classes#

  • Connector

  • Descriptor

  • Device

  • OperationStatus

  • RdmaMetadata

  • ReadOperation

  • WritableOperation

  • WriteOperation

On this page
  • Values
    • CUDA
    • HOST
  • Related Classes
NVIDIA NVIDIA
Privacy Policy | Manage My Privacy | Do Not Sell or Share My Data | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2025-2025, NVIDIA Corporation.