Requirements#

The following requirements are specific to setting up the entire NeMo microservices platform on a minikube cluster for running the Beginner Platform Tutorials.

Depending on your environment, what combination of the NeMo microservices you want to deploy, and the scale of your AI workload, you should adjust the requirements accordingly.

System Requirements

The following are the common requirements for running the getting started tutorials.

  • A single-node NVIDIA GPU cluster on a Linux host with cluster-admin permissions.

  • A least 300 GB of free disk space.

  • Two NVIDIA GPUs, B200 80B, A100 80 GB, or H100 80 GB, and no other workloads running on them:

    • One GPU for machine learning model fine-tuning.

    • One GPU for a meta/llama-3.1-8b-instruct NIM microservice for inference.

Software Requirements

Choose one of the following options to set up a demo cluster with the NeMo microservices platform.

NVIDIA developed and tested this tutorial using minikube and meeting the following prerequisites.

The minikube cluster setup tutorial uses the following minikube features:

  • minikube ingress.

  • Standard storage class using host path volumes provided by the default storage provisioner.

    The host file system for the host path volumes must support file locking. During customization with NeMo Customizer, NeMo Operator starts an entity handler pod that runs the Hugging Face CLI. The CLI requires a file system, such as EXT4, that supports file locking.