Additional ResourcesTemplates

Backend README

View as Markdown

Feature Matrix

FeatureStatusNotes
Disaggregated Serving
KV-Aware Routing
SLA-Based Planner
MultimodalVision models
LoRA🚧Experimental

Quick Start

Prerequisites

Usage

$# Add minimal usage example from existing backend docs
$# Example pattern (vLLM):
$# python -m dynamo.vllm --model <model-name>
$# Example pattern (SGLang):
$# python -m dynamo.sglang --model <model-name>

Kubernetes

1# Add DGDR example - use apiVersion: nvidia.com/v1beta1
2# See recipes/ folder for production examples

Configuration

ParameterDefaultDescription

Next Steps

DocumentPathDescription
<Backend> Guide<backend>_guide.mdAdvanced configuration
Backend Comparison../README.mdCompare backends