Set Up Using Deployment Scripts#
Start a minikube cluster and install the NeMo microservices platform using the automated deployment scripts provided by NVIDIA.
Note
This minikube cluster setup tutorial is designed for the Beginner Platform Tutorials that run small workloads of fine-tuning, evaluating, and running inference on smaller LLMs such as llama-3.1-8b-instruct and meta-llama/llama-3.2-1b-instruct. If you want to run AI workloads at a larger scale, set up the NeMo microservices platform on a larger Kubernetes cluster. For more information, refer to About Admin Setup.
Before You Begin#
Check the Demo Cluster Requirements before you begin.
To Start Minikube Cluster and Install NeMo Microservices Platform#
Follow the steps below to start a minikube cluster and install the NeMo microservices platform using the automated deployment scripts provided by NVIDIA.
- Download the following files in a directory on your local machine. - create-nmp-deployment.sh: NeMo platform deployment script. Use this script to set up a minikube cluster and install the NeMo microservices platform on it.
- destroy-nmp-deployment.sh: Clean-up script. Use this script to delete the minikube cluster and the local files used during the deployment.- Note - The default values in the NeMo Microservices Helm Chart are for a minimal setup of the NeMo microservices platform on a single node cluster that can run the Beginner Platform Tutorials. For production-grade deployments, you need additional configurations for: - Database disaster recovery. 
- Multi-node training support in Volcano. 
- Persistent volume claims (PVC) with proper storage classes. 
 - For production-grade deployments, refer to the Admin Setup section. - Note - The NeMo Data Store microservice, configured with the demo values, includes a 2GB persistent volume that can accommodate roughly twenty LoRA adapters. Attempting to run multiple LoRA fine-tuning sessions or even a single full supervised fine-tuning (SFT) job within this demo setup will result in failure. 
 
- Make the scripts executable. - chmod +x create-nmp-deployment.sh destroy-nmp-deployment.sh 
- Create an NGC API key following the instructions at Generating NGC API Keys. 
- Export the NGC API key into your shell environment using the following command: - export NGC_API_KEY=<your-ngc-api-key> 
- Go to build.nvidia.com and generate an NVIDIA API key. 
- Export the NVIDIA API key into your shell environment using the following command: - export NVIDIA_API_KEY=<your-nvidia-api-key> 
- Run the - create-nmp-deployment.shscript to set up the minikube cluster and the NeMo microservices platform.- ./create-nmp-deployment.sh 
- Move onto the Beginner Platform Tutorials to learn how to use the capabilities of the NeMo microservices. 
- After you’re done with the Beginner Platform Tutorials, run the - destroy-nmp-deployment.shscript to delete everything.- ./destroy-nmp-deployment.sh 
NeMo Platform Deployment Script Overview#
The create-nmp-deployment.sh script automates the deployment of the NeMo microservices platform on a minikube cluster. Here’s a detailed breakdown of its functionality:
| Phase | Description | 
|---|---|
| Phase 0: Run pre-flight checks | 
 | 
| Phase 1: Set up minikube | 
 | 
| Phase 2: Create secrets | 
 | 
| Phase 3: Install the Helm chart | 
 | 
| Phase 4: Verify pod health | 
 | 
| Phase 5: Configure DNS | 
 | 
| Phase 6: Deploy Meta LLaMA NIM | 
 | 
| Phase 7: Wait for NIM readiness | 
 | 
| Phase 8: Verify the NIM endpoint | 
 | 
Script Usage#
./create-nmp-deployment.sh [OPTIONS]
Options:
  --helm-chart-url URL    [Required] Helm chart URL
  --values-file FILE      [Required] Specify values file(s)
  --help                  Show help message
Deploy to a Production-Grade Kubernetes Cluster#
If you have completed this minikube tutorial and want to deploy the NeMo microservices platform to a production-grade Kubernetes cluster, proceed to the Admin Setup section.