Skip to main content
Ctrl+K
NVIDIA NIM for Large Language Models 2.0 - Home NVIDIA NIM for Large Language Models 2.0 - Home

NVIDIA NIM for Large Language Models 2.0

  • Documentation Home
NVIDIA NIM for Large Language Models 2.0 - Home NVIDIA NIM for Large Language Models 2.0 - Home

NVIDIA NIM for Large Language Models 2.0

  • Documentation Home

Table of Contents

About NVIDIA NIM for LLMs

  • Overview
  • Enterprise-Grade Inference Software Stack
  • Release Notes

Get Started

  • About Get Started
  • Prerequisites
  • Installation
  • Configuration
  • Quickstart

Deployment

  • Model Profiles and Selection
  • Model Download
  • Model-Free NIM
  • Kubernetes Deployment
    • Helm and Kubernetes
    • KServe
    • OpenShift
    • Run:ai
    • NIM Operator Deployment
  • Cloud Service Provider (CSP) Deployment
    • Google Cloud
    • AWS
    • Azure
    • Oracle
  • Air-Gap Deployment
  • Multi-Node Deployment
  • vGPU Deployment

Advanced Use Cases

  • Fine-Tuning with LoRA
  • Custom Logits Processing
  • Prompt Embeddings

Reference

  • Architecture
  • Environment Variables
  • API Reference
  • CLI Reference
  • Advanced Configuration
  • Logging and Observability
  • 1.x Migration Guide
  • Support Matrix

Resources

  • Support and FAQ
  • Related Products
  • Legal
  • Legal

Legal#

This page contains the primary legal references for NVIDIA NIM for LLMs.

NVIDIA AI Product Agreement#

By using this NIM, you acknowledge that you have read and agreed to the NVIDIA AI Product Agreement.

Open Source Software License Acknowledgements#

NVIDIA NIM for LLMs includes open source software components. For the acknowledgements that apply to a specific container, refer to the NVIDIA OSS archive.

previous

Related Software

On this page
  • NVIDIA AI Product Agreement
  • Open Source Software License Acknowledgements
NVIDIA NVIDIA
Privacy Policy | Your Privacy Choices | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2024-2026, NVIDIA Corporation.

Last updated on Mar 12, 2026.