Large Language Models (1.0.0)
Large Language Models (1.0.0)

API Reference

The OpenAPI spec details the endpoints for NVIDIA NIM for LLMs:

  • /v1/health/ready - Health endpoint

  • /v1/models - Show available models

  • /v1/chat/completions - Chat Completions Endpoint

  • /v1/completions - Completions Endpoint

The /v1/completions and /v1/chat/completions endpoints can be found in the NIM OpenAPI Schema.

Experimental support of LS API

  • /experimental/ls/inference/chat_completion

  • /experimental/ls/inference/completion

The /experimental/ls/inference/chat_completion and /experimental/ls/inference/completion endpoints can be found in the NIM OpenAPI Schema.

Previous Support Matrix
Next Observability
© Copyright © 2024, NVIDIA Corporation. Last updated on Jul 22, 2024.