For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
    • Overview
  • Before You Deploy
    • Infrastructure Sizing
    • Manifest
  • Deployment
    • Installation Overview
    • Image Mirroring
    • Terraform Installation
    • Helmfile Installation
  • GPU Cluster Setup
    • GPU Cluster Setup
    • Self-Managed Clusters
  • Configuration
    • Optional Enhancements
    • Gateway Routing
    • Third-Party Registries
    • Cluster Configuration
    • KAI Scheduler
  • Using Cloud Functions
    • API
    • Function Creation
    • Container Functions
    • Helm Functions
    • Streaming Functions
    • CLI
  • Observability
    • Observability
    • Example Dashboards
  • Operations
    • Control Plane Operations
    • Cluster Monitoring
    • Troubleshooting
  • Runbooks
    • Runbooks
    • Key Rotation
  • Reference
    • Cluster Reference
    • gRPC Load Testing
    • HTTP Soak Testing
  • Development
    • Local Development
    • Fake GPU Operator
NVIDIANVIDIA
Developer-friendly docs for your API
Privacy Policy | Your Privacy Choices | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2026, NVIDIA Corporation.

LogoLogoCloud Functions

NVIDIA Cloud Functions

||View as Markdown|

This guide provides information for deploying and operating NVCF in self-managed environments.

  • Deployment : Install the NVCF control plane and connect GPU clusters.
  • Configuration : Configure gateway routing, registries, and optional enhancements.
  • Using Cloud Functions : Create and invoke functions using the NVCF API and CLI.

Decoupled control plane deployments (GPU cluster separate from the control plane cluster) are not available in Early Access. All EA deployments use a co-located architecture where the control plane and GPU workloads run in the same cluster.

Next

Infrastructure Sizing