Kubernetes Deployment#
NIM LLM 2.0 supports deployment on Kubernetes using Helm charts, NIM Operator, and other orchestration platforms. Select your deployment or platform below for detailed setup instructions.
Note
For CSP-managed Kubernetes environments (such as GKE, EKS, and AKS), refer to the CSP Deployment guides.
Deployment Options#
Use one of the following guides to deploy NVIDIA NIM for LLMs on Kubernetes:
Deploy NVIDIA NIM for LLMs on Kubernetes by using the NIM Helm chart.
Deploy NVIDIA NIM for LLMs on KServe in a Kubernetes environment.
Deploy NVIDIA NIM for LLMs on Red Hat OpenShift, including GPU operator setup and verification.
Deploy NVIDIA NIM for LLMs on Run:ai for inference workloads.
Deploy NVIDIA NIM for LLMs by using the NVIDIA NIM Operator on Kubernetes.