NIM Operator Deployment#

This guide walks you through deploying NVIDIA cuOpt as a Kubernetes service using the NIM Operator.

Overview#

The NIM Operator simplifies the deployment and management of NVIDIA NIM microservices on Kubernetes. This deployment method provides:

Automated lifecycle management - The operator handles scaling, updates, and health checks
Native Kubernetes integration - Uses standard K8s patterns (CRDs, services, ingress)
GPU resource management - Leverages the GPU Operator for optimal GPU utilization
Built-in monitoring - Prometheus metrics and service monitors

The following NVIDIA GPUs are supported:

GPU	Architecture	Notes
A100	Ampere	Recommended for production
H100	Hopper	High performance
H200	Hopper	High performance with extended memory
B200	Blackwell	Latest generation
RTX 6000 Pro	Ada Lovelace	Workstation GPU

Verify the deployment:

kubectl get nimservice -n nim-service
kubectl get pods -n nim-service