Create Endpoints from NVIDIA NIM

Learn how to create dedicated endpoints from NVIDIA NIM.

For enhanced performance and seamless compatibility, NVIDIA‑optimized models from the NIM container registry are available on DGX Cloud Lepton.

Prerequisites

These models require an NVIDIA account with access to the NIM container registry.

NVIDIA Registry

You must have an NVIDIA account with access to the NIM container registry and configure the registry auth key on DGX Cloud Lepton.

Refer to this guide for details. Once the registry auth key is created, add a private registry via Settings > Registries > New Registry Auth.

Create registry auth

Choose NVIDIA as the registry type and paste the registry auth key in the API Key field.

Create registry auth

NGC API Key

Besides the registry auth key, you also need an NGC API key. Navigate to the NGC API key creation page and click Generate Personal Key.

In the Service Included field, select Public API Endpoints.

NGC API key 0.6x

Store the NGC API key on DGX Cloud Lepton as a secret.

Create endpoint from NVIDIA NIM

Navigate to the Create Endpoint page on the dashboard.

For Endpoint name, enter nim-endpoint or any name you prefer.

For Resource, choose an appropriate resource based on the model size.

For NIM configuration:

  • Select a model image from the list of built‑in models, or enter a custom model image.
  • Select the NVIDIA registry auth you created (see registry auth).
  • Select the NGC API key you saved as a secret in your workspace.

For other endpoint‑related configurations, refer to this guide.

For NIM engine‑related configurations, refer to this guide. Configure the NIM engine by setting the relevant environment variables.

When finished, click Create Endpoint to create the endpoint.

Copyright @ 2025, NVIDIA Corporation.