For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
    • Overview
    • Quickstart
  • Before You Deploy
    • Infrastructure Sizing
    • Manifest
  • Deployment
    • Installation Overview
    • Image Mirroring
    • Helmfile Installation
  • GPU Cluster Setup
    • GPU Cluster Setup
    • Self-Managed Clusters
  • Configuration
    • Optional Enhancements
    • LLM Function Enablement
    • Gateway Routing
    • Third-Party Registries
    • Registry Allowlist
    • Cluster Configuration
    • KAI Scheduler
  • Using Cloud Functions
    • API
    • Service Keys
    • Function Creation
    • LLM Gateway
    • Generic HTTP Function Invocation
    • gRPC Function Invocation
    • Container Functions
    • Helm Functions
    • Streaming Functions
    • Configure Autoscaling
    • CLI
  • Function Autoscaling
    • Function Autoscaling Overview
    • Architecture
    • Operations
    • Observability
  • Observability
    • Observability
    • Example Dashboards
      • Metrics Overview
      • Cassandra
      • ESS
      • Init Container
      • Invocation Service
      • LLM API Gateway
      • LLM Function Invocation Metrics Report
      • LLM Request Router
      • NVCF API
      • SIS/Spot
      • State Metrics
      • Utils Container
      • Vault/OpenBao
  • Operations
    • Control Plane Operations
    • Cluster Monitoring
    • Troubleshooting
  • Runbooks
    • Runbooks
    • Key Rotation
  • Reference
    • Cluster Reference
    • gRPC Load Testing
    • gRPC Load Test SLI Guide
    • HTTP Load Testing
    • HTTP Load Test SLI Guide
    • HTTP Soak Testing
  • Development
    • Architecture Overview
    • Local Development
    • Fake GPU Operator
    • Release Process
  • Managed (Legacy)
    • Function Lifecycle
    • Observability
NVIDIANVIDIA
Developer-friendly docs for your API
Privacy Policy | Your Privacy Choices | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2026, NVIDIA Corporation.

LogoLogoCloud Functions
On this page
  • Control plane services
  • LLM services
  • Per-function containers
  • Datastores
  • See also
ObservabilityMetrics

Metrics Overview

||View as Markdown|
Previous

Example Dashboards

Next

Cassandra

Per-service metrics reference for the NVCF self-hosted control plane. Each linked page lists metric names, types, sources, descriptions, and the labels and filters that make the metric useful in queries and dashboards.

Control plane services

  • NVCF API: request rates, response status codes, and log event counts for the NVCF API service.
  • Invocation Service: HTTP request counts, durations, and invocation error metrics for the invocation path.
  • ESS: template rendering counters and HTTP client metrics for the Encrypted Secrets Service.
  • State Metrics Service: per-function instance count, stage durations, request latency, and function metadata.
  • SIS/Spot: HTTP client metrics for the Spot Instance Service.
  • Function Autoscaler: OpenTelemetry metrics emitted by the function autoscaler service.

LLM services

  • LLM API Gateway: request and routing metrics for the LLM API gateway.
  • LLM Function Invocation Metrics Report: end-to-end LLM invocation path report.
  • LLM Request Router: request router metrics for LLM traffic.

Per-function containers

  • Init Container: restart counts and termination reasons for function init containers.
  • Utils Container: restart counts, termination reasons, and worker service response metrics for function utils containers.

Datastores

  • Cassandra: client request latency, timeouts, authentication failures, and endpoint connection metrics.
  • Vault/OpenBao: pointer to upstream OpenBao telemetry documentation.

See also

  • Observability for logging, tracing, and overall observability configuration.
  • Example Dashboards for reference Grafana dashboards.