Manage NIM Deployments#

Use the NeMo Deployment Management service to deploy and manage NIM microservices on your Kubernetes cluster.

The Deployment Management service provides APIs to create, update, and delete NIM deployments and their configurations.


Task Guides#

Perform common deployment management tasks.

Deploy NIM

Deploy a new NIM instance by specifying the model and configuration via a POST API call.

Deploy NIM Microservice
Get NIM Details

Retrieve details for a specific NIM deployment, including its status and configuration.

Get NIM Deployment Details
List Deployments

View all NIM deployments in your cluster and their current status.

List Deployments
Update Deployment

Modify an existing NIM deployment’s configuration and settings.

Update Deployment
Delete NIM

Remove a deployed NIM instance using the DELETE API endpoint.

Delete NIM Deployment
Create Config

Create a new configuration for NIM deployments with specific resource requirements and settings.

Create Configuration
Get Config

Retrieve details of a specific configuration.

Get Configuration
List Configs

View all available configurations for NIM deployments.

List Configurations
Update Config

Modify an existing configuration to change resource requirements or settings.

Update Configuration
Delete Config

Remove a configuration using the DELETE API endpoint.

Delete Configuration