Overview

The NVIDIA Cloud Native Service Add-on Pack is a set of packaged components for AI Workflows designed to provide the basic functionalities required for enterprise deployments of AI applications on Kubernetes-based infrastructure.

Currently, the following variants of K8S deployments and integrations are available:

NVIDIA Cloud Native Stack (upstream Kubernetes)

More information about the specific platforms that the add-on pack supports can be found in the platform-specific sections of the document, linked above.

The packaged components in the add-on pack include implementation examples for authentication, monitoring, reporting, and load balancing, that can be used as-is, or customized and connected to your own environment.

These examples follow general guidelines for enterprise production requirements and serve as standards compatible with NVIDIA’s AI frameworks for building and deploying AI solutions as microservices.

The guidelines generally fall within the following categories:

Deployment and Orchestration
- OCI-Compliant Container Images
- Liveness/Readiness/Startup Probes
- Security and Vulnerability Scanning/Patching
Security
- OIDC/OAuth2 User Authentication
- External Secrets Management
- Secure API Endpoints
Networking
- Ingress Control
- Proxy Sidecar
Logging and Reporting
- Open Telemetry Protocol (OTLP) monitoring
- OTLP support within application containers
- Log Aggregation

AI Workflows also include the AI framework specific to your use case, which is delivered as an OCI-compliant base container image. The following graphic illustrates the additional opinionated components which are included within NVIDIA AI Workflows to meet the above guideline requirements:

Keycloak
Cert-Manager
Trust Manager
Ingress Controller
Prometheus
Grafana
Postgres Operator
Elastic Operator

Note

Each user is responsible for checking the content and the applicable licenses of third party software and determining if they are suitable for the intended use.