Overview#

NVIDIA Cloud Functions (NVCF) is a serverless API to deploy & manage AI workloads on GPUs, which provides security, scale and reliability to your workloads. The API to access the workloads is un-opinionated and supports HTTP polling, HTTP streaming & gRPC.

Cloud Functions is available via the NGC Portal. To gain access to Cloud Functions, talk to your NVIDIA Account Manager.

Cloud Functions is primarily suited for shorter running, preemptable workloads such as inferencing and fine-tuning.

See Function Lifecycle for key concepts and terminology.

Function Types#

Cloud Functions supports two different workload and deployment types for defining functions - “Container” and “Helm Chart” function types.

The workloads are ephemeral and preemptable, so ensure you are not running long tasks, such as those spanning several hours, without expecting to save your work to the local disk.

Cloud Function models, containers, helm charts, and any additional resources are hosted by and pulled from NGC Private Registry.

See Function Creation for an in-depth overview of each function type.

Getting Started#

The best way to get started with Cloud Functions is to follow along with the Functions Quickstart.