NIM Operator Deployment#
This guide walks you through deploying NVIDIA cuOpt as a Kubernetes service using the NIM Operator.
Overview#
The NIM Operator simplifies the deployment and management of NVIDIA NIM microservices on Kubernetes. This deployment method provides:
Automated lifecycle management - The operator handles scaling, updates, and health checks
Native Kubernetes integration - Uses standard K8s patterns (CRDs, services, ingress)
GPU resource management - Leverages the GPU Operator for optimal GPU utilization
Built-in monitoring - Prometheus metrics and service monitors
Supported GPUs#
The following NVIDIA GPUs are supported:
GPU |
Architecture |
Notes |
|---|---|---|
A100 |
Ampere |
Recommended for production |
H100 |
Hopper |
High performance |
H200 |
Hopper |
High performance with extended memory |
B200 |
Blackwell |
Latest generation |
RTX 6000 Pro |
Ada Lovelace |
Workstation GPU |
Quick Start#
Complete all prerequisites
Set your NGC API Key:
export NGC_API_KEY=<your-ngc-api-key>
Run the deployment script:
./deploy.sh
Verify the deployment:
kubectl get nimservice -n nim-service kubectl get pods -n nim-service