Prerequisites#

This document covers all prerequisites needed before deploying CuOpt with the NIM Operator.

Kubernetes Cluster Setup#

You need a Kubernetes cluster with GPU-enabled nodes. Choose one of the following installation methods:

Standard Kubernetes installation using kubeadm. Follow the official Kubernetes documentation.

Use NVIDIA Cloud Native Stack Ansible playbooks for automated setup including GPU Operator:

git clone https://github.com/NVIDIA/cloud-native-stack.git
cd cloud-native-stack/playbooks
# Follow the playbook instructions

This method automatically deploys necessary operators including the GPU Operator.

For local development and testing:

minikube start --driver=docker --gpus all

If not using Cloud Native Stack, install the GPU Operator manually.

helm repo add nvidia https://helm.ngc.nvidia.com/nvidia
helm repo update

helm install --wait --generate-name \
   -n gpu-operator --create-namespace \
   nvidia/gpu-operator

This typically takes 3-5 minutes to install the driver and set up the cloud native stack for GPU usage.

kubectl get pods -n gpu-operator

All pods should be in Running state.

CuOpt requires persistent storage. Deploy a storage provisioner if your cluster doesn’t have one.

kubectl apply -f https://raw.githubusercontent.com/rancher/local-path-provisioner/v0.0.31/deploy/local-path-storage.yaml

Wait for the provisioner to be ready:

kubectl rollout status deployment/local-path-provisioner -n local-path-storage --timeout=120s

kubectl patch storageclass local-path -p '{"metadata": {"annotations":{"storageclass.kubernetes.io/is-default-class":"true"}}}'

kubectl get storageclass

You should see local-path marked as (default).

For production deployments, consider:

kubectl create namespace nim-operator

helm upgrade --install nim-operator nvidia/k8s-nim-operator \
    -n nim-operator \
    --version=3.0.2

kubectl get pods -n nim-operator
kubectl get crd | grep nvidia

You should see the nimservices.apps.nvidia.com CRD registered.

You need an NGC API key to pull NVIDIA container images.

export NGC_API_KEY=<your-api-key>

For persistent configuration, add to your shell profile:

echo 'export NGC_API_KEY=<your-api-key>' >> ~/.bashrc
source ~/.bashrc

Before proceeding with CuOpt deployment, verify:

Once all prerequisites are met, proceed to deployment.