API Reference for NVIDIA NIM for LLMs#

This documentation contains the API reference for NVIDIA NIM for LLMs.

OpenAPI Schema#

The OpenAPI spec details the endpoints for NVIDIA NIM for LLMs:

  • /v1/health/ready - Health endpoint

  • /v1/models - Models endpoint

  • /v1/chat/completions - Chat completions endpoint

  • /v1/completions - Completions endpoint

Refer to the NIM OpenAPI schema for /v1/completions and /v1/chat/completions.

Experimental APIs#

Experimental Support for Llama Stack (LS) API#

  • /experimental/ls/inference/chat_completion

  • /experimental/ls/inference/completion

Refer to the NIM OpenAPI schema for /experimental/ls/inference/chat_completion and /experimental/ls/inference/completion.

Experimental Support for the OpenAI Responses API#

  • /v1/responses

For more information, refer to the OpenAI API reference.

Reference#