Skip to main content

Ctrl+K

NVIDIA RAG blueprint

GitHub

NVIDIA RAG blueprint

GitHub

Table of Contents

NVIDIA RAG Blueprint

Release Notes
Support Matrix

Get Started

Get an API Key
Get Started with the RAG Blueprint
Web User Interface
Use the RAG Python Package
Notebooks

Deployment Options for RAG Blueprint

Deploy with Docker (Self-Hosted Models)
Deploy with Docker (NVIDIA-Hosted Models)
Deploy on Kubernetes with Helm
Deploy on Kubernetes with Helm from the repository
Deploy on Kubernetes with Helm and MIG Support
Deploy on OpenShift with Helm
Deploy Retrieval-Only Mode

Common configurations

Best Practices for Common Settings
Agentic RAG
Change the Model
Nemotron 3 Super Deployment
Customize Parameters
Customize Prompts
Model Profiles
Multi-Collection Retrieval
Multi-Turn Conversation Support
Reasoning
Self-reflection
Summarization

Data Ingestion and Processing

Audio Ingestion Support
Continuous Ingestion from Object Storage
Custom metadata Support
Data Catalog for Collections and Documents
File System Access to Results
Multimodal Retriever (VLM Embedding & VLM Reranker) for NVIDIA RAG Blueprint
OCR Configuration Guide
Enhanced PDF Extraction
Standalone NeMo Retriever Library
Text-Only Ingestion
MCP Server Usage

Vector Database and Retrieval

Vector Database Configuration for NVIDIA RAG Blueprint
Hybrid Search
Milvus Configuration
Elasticsearch Configuration
Query Decomposition

Multimodal and Advanced Generation

Image Captioning
Multimodal Query Support
VLM-based Inferencing

Evaluation

Evaluate Your RAG System
RAG Accuracy Benchmarks
Benchmark RAG Performance

Governance

NeMo Guardrails

Observability and Telemetry

Observability
Query-to-Answer Pipeline

Troubleshoot RAG Blueprint

Troubleshoot
RAG Pipeline Debugging Guide
Migration Guide

Reference

Milvus Collection Schema
Service Port and GPU Reference
API - Ingestor Server Schema
API - RAG Server Schema

API - RAG Server Schema

API - RAG Server Schema#

This documentation contains the OpenAPI reference for the RAG server.

Tip

To view this documentation on docs.nvidia.com, browse to https://docs.nvidia.com/rag/latest/api-rag.

Related Topics#

API - Ingestor Server Schema
NVIDIA RAG Blueprint Documentation

previous

API - Ingestor Server Schema

On this page

Related Topics

Privacy Policy | Your Privacy Choices | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2025, NVIDIA CORPORATION & AFFILIATES.