Large Language Models (Latest)
Large Language Models (Latest)

API Reference

The OpenAPI spec details the endpoints for NVIDIA NIM for LLMs:

  • /v1/health/ready - Health endpoint

  • /v1/models - Show available models

  • /v1/chat/completions - Chat Completions Endpoint

  • /v1/completions - Completions Endpoint

The /v1/completions and /v1/chat/completions endpoints can be found in the NIM OpenAPI Schema.

Experimental support of LS API

  • /experimental/ls/inference/chat_completion

  • /experimental/ls/inference/completion

The /experimental/ls/inference/chat_completion and /experimental/ls/inference/completion endpoints can be found in the NIM OpenAPI Schema.

Previous Support Matrix
Next Function Calling
© Copyright © 2024, NVIDIA Corporation. Last updated on Jul 26, 2024.