Large Language Models (Latest)
Large Language Models (Latest)

API Reference

The OpenAPI spec details the endpoints for NVIDIA NIM for LLMs:

  • /v1/health/ready - Health endpoint

  • /v1/models - Show available models

  • /v1/chat/completions - Chat Completions Endpoint

  • /v1/completions - Completions Endpoint

The /v1/completions and /v1/chat/completions endpoints can be found in the NIM OpenAPI Schema.

Experimental support for Llama Stack (LS) API

  • /experimental/ls/inference/chat_completion

  • /experimental/ls/inference/completion

The /experimental/ls/inference/chat_completion and /experimental/ls/inference/completion endpoints can be found in the NIM OpenAPI Schema.

Previous Support Matrix
Next Function Calling
© Copyright © 2024, NVIDIA Corporation. Last updated on Oct 10, 2024.