📚 Official Documentation: vLLM Metrics Design
This document describes how vLLM Prometheus metrics are exposed in Dynamo.
When running vLLM through Dynamo, vLLM engine metrics are automatically passed through and exposed on Dynamo’s /metrics endpoint (default port 8081). This allows you to access both vLLM engine metrics (prefixed with vllm:) and Dynamo runtime metrics (prefixed with dynamo_*) from a single worker backend endpoint.
For the complete and authoritative list of all vLLM metrics, always refer to the official documentation linked above.
Dynamo runtime metrics are documented in docs/observability/metrics.md.
The official documentation includes:
model_name, finished_reason, scheduling_event)vLLM provides metrics in the following categories (all prefixed with vllm:):
Note: Specific metrics are subject to change between vLLM versions. Always refer to the official documentation or inspect the /metrics endpoint for your vLLM version.
vLLM metrics are automatically exposed when running vLLM through Dynamo with metrics enabled.
To see the actual metrics available in your vLLM version:
Metrics will be available at: http://localhost:8081/metrics
Note: The specific metrics shown below are examples and may vary depending on your vLLM version. Always inspect your actual /metrics endpoint for the current list.
prometheus_client.multiprocessPROMETHEUS_MULTIPROC_DIR: (optional). By default, Dynamo automatically manages this environment variable, setting it to a temporary directory where multiprocess metrics are stored as memory-mapped files. Each worker process writes its metrics to separate files in this directory, which are aggregated when /metrics is scraped. Users only need to set this explicitly where complete control over the metrics directory is required.MultiProcessCollector to aggregate metrics from all worker processesvllm: and lmcache: prefixes before being exposed (when LMCache is enabled)register_engine_metrics_callback() function with the global REGISTRYdynamo_* for runtime, components, endpoints, and namespaces
lib/runtime/src/metrics.rs (Rust runtime metrics)lib/runtime/src/metrics/prometheus_names.rs (metric name constants)/metrics endpoint alongside vLLM metricscomponents/src/dynamo/common/utils/prometheus.py - Prometheus utilities and callback registration