LLM Request Router Metrics
The LLM Request Router serves Prometheus metrics from
llm-request-router:9090/metrics when
llmRequestRouter.metrics.enabled is true.
The self-managed stack maps global.observability.metrics.enabled to this chart
value. The request-router chart runs Stargate with
--metrics-prefix=llm_request_router_, so deployed metric names use the
llm_request_router_ prefix instead of the upstream default stargate_ prefix.
The chart also sets the trace service name with
--otel-service-name=llm-request-router.
Label Boundaries
Use bounded labels only. Keep routing_key, model, inference_server_id,
status, result, and reason to bounded service dimensions. Do not add
request IDs, session IDs, function IDs, organization IDs, project IDs, raw URLs,
raw prompts, authorization values, or other unbounded request fields as metric
labels.