Skip to main content
country_code
Ctrl+K
📢 Notice: This is Early Access NIM LLM documentation. For more information, refer to the GA NIM LLM or NIM VLM documentation.
NVIDIA NIM for Large Language Models - Home NVIDIA NIM for Large Language Models - Home

NVIDIA NIM for Large Language Models

  • Documentation Home
NVIDIA NIM for Large Language Models - Home NVIDIA NIM for Large Language Models - Home

NVIDIA NIM for Large Language Models

  • Documentation Home

Table of Contents

  • Overview
  • Release Notes

Get Started with Turbo VLMs

  • Get Started with Kimi-K2.6
  • Get Started with Kimi-K2.5

Get Started with Turbo LLMs

  • Get Started with Nemotron-3-Ultra
  • Get Started with Nemotron-3-Super
  • Get Started with GPT-OSS-120b-Turbo
  • Get Started with Nemotron-3-Super-120B-A12B
  • Overview
Is this page helpful?

Overview#

NIM Turbo delivers validated best-in-class inference performance for top models on NVIDIA Hardware. It’s free for use in production deployments.

For more information, please refer to the following NIM documentation sites:

  • LLM: https://docs.nvidia.com/nim/large-language-models/latest/about-nim-llm/overview.html

  • VLM: https://docs.nvidia.com/nim/vision-language-models/latest/index.html

previous

NVIDIA NIM for Large Language Models Documentation

next

Release Notes

NVIDIA NVIDIA
Privacy Policy | Your Privacy Choices | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2024-2026, NVIDIA Corporation.

Last updated on Jun 09, 2026.