Helm deployment#

This guide covers deploying VSS developer profiles on Kubernetes using Helm. It is an alternative to the Docker Compose method used in Quickstart and the Agent Workflows guides.

Install steps, values files, GPU sizing, ingress setup, and troubleshooting for each profile are documented in the chart README.md under deploy/helm/developer-profiles/ in the video-search-and-summarization GitHub repository.

Available developer profiles#

base
alerts
search
lvs

Prerequisites#

A Kubernetes cluster (version 1.34 or greater) you can reach with kubectl
Helm 3.x
NVIDIA GPU Operator on the cluster
NVIDIA NIM Operator when deploying in-cluster LLM and VLM models
StorageClass - for PVC provisioning
NGC API key — for image pull and NIM model access

GPU requirements depend on the profile and values you choose (for example, in-cluster NIMs vs remote LLM/VLM endpoints). See the GPU requirements section in the chart README for the profile you are deploying.

Steps to deploy a profile#

Clone the blueprint repository:

git clone https://github.com/NVIDIA-AI-Blueprints/video-search-and-summarization.git
cd video-search-and-summarization
git checkout tags/v3.2.0
git lfs install
git lfs pull

Change into the Helm developer-profiles directory:
```
cd ./deploy/helm/developer-profiles
```

Open the chart README.md for the profile you are deploying and follow the installation and uninstall instructions there:

Workflow	Developer Profile	Chart documentation
Q&A and Report Generation	`base`	dev-profile-base
Alerts	`alerts`	dev-profile-alerts
Search	`search`	dev-profile-search
Video Summarization	`lvs`	dev-profile-lvs

Note

After helm upgrade --install, wait until all pods in the release namespace are in Running state and show Ready (for example, kubectl get pods -n <NAMESPACE>) before opening the UI or sending requests. Image pulls, persistent volume binding, and in-cluster NIM model download and warm-up add startup time. For some profiles, bringing every pod to Ready can take 20–25 minutes depending on cluster resources and network speed.