API Reference#

OpenAPI Schema#

The OpenAPI spec details the endpoints for NVIDIA NIM for LLMs:

/v1/health/ready - Health endpoint
/v1/models - Show available models
/v1/chat/completions - Chat Completions Endpoint
/v1/completions - Completions Endpoint

The /v1/completions and /v1/chat/completions endpoints can be found in the NIM OpenAPI Schema.

Experimental APIs#

Experimental support for Llama Stack (LS) API#

/experimental/ls/inference/chat_completion
/experimental/ls/inference/completion

The /experimental/ls/inference/chat_completion and /experimental/ls/inference/completion endpoints can be found in the NIM OpenAPI Schema.

Reference#

NVIDIA NIM for LLMs