API Reference for NVIDIA NIM for LLMs#
This documentation contains the API reference for NVIDIA NIM for LLMs.
OpenAPI Schema#
The OpenAPI spec details the endpoints for NVIDIA NIM for LLMs:
/v1/health/ready - Health endpoint
/v1/models - Models endpoint
/v1/chat/completions - Chat completions endpoint
/v1/completions - Completions endpoint
Refer to the NIM OpenAPI schema for /v1/completions
and /v1/chat/completions
.
Experimental APIs#
Experimental Support for Llama Stack (LS) API#
/experimental/ls/inference/chat_completion
/experimental/ls/inference/completion
Refer to the NIM OpenAPI schema for /experimental/ls/inference/chat_completion
and /experimental/ls/inference/completion
.
Experimental Support for the OpenAI Responses API#
/v1/responses
For more information, refer to the OpenAI API reference.