Introduction

NVIDIA Fleet Command deploys container-based applications on GPU-accelerated Kubernetes clusters using Helm charts. This guide will provide the information necessary to build applications compatible with Fleet Command and includes information on setting up a development environment.

Containerization involves bundling an application with all of its related configuration files, libraries, and dependencies required to run efficiently for portability. While containers are an excellent way to bundle and run applications, it is essential to manage the containers that run the applications and ensure no downtime in a deployment environment.

Kubernetes is an open-source platform for managing containerized applications. For example, if a container goes down, another container needs to start. Kubernetes helps in maintaining container life cycles without any human intervention.

Fleet Command leverages these technologies to provide a fully supported cloud-native platform that securely deploys, manages, and scales your applications across a distributed edge infrastructure.

Note

Fleet Command Software Stack 2.0 is available in Fleet Command version 1.3.0 and above.

Applications deployed on NVIDIA Fleet Command must run on the following software stack:

  • Ubuntu 22.04 LTS

  • CUDA 11.7

    • NVIDIA Driver 525.85.12 (Pre-compiled signed driver)

  • Containerd 1.6.18-1

  • Kubernetes 1.25.9

  • NVIDIA Container Toolkit 1.12.0-1

  • Helm 3.11.0

  • NVIDIA MIG Manager

  • NVIDIA GPU Operator 22.9.2 (includes the NVIDIA Driver libraries)

Instructions on setting up this software stack manually for development purposes are provided later in this guide, in the Development Environment section.

© Copyright 2022-2023, NVIDIA. Last updated on Jun 12, 2023.