Minimum System Requirements for NVIDIA RAG Blueprint#

This documentation contains the system requirements for the NVIDIA RAG Blueprint.

Important

You can deploy the RAG Blueprint with Docker, Helm, or NIM Operator, and target dedicated hardware or a Kubernetes cluster. Some requirements are different depending on your target system and deployment method.

Disk Space Requirements#

Important

Ensure that you have at least 200GB of available disk space before you deploy the RAG Blueprint. This space is required for the following:

NIM model downloads and caching (largest component, ~100-150GB)
Container images (~20-30GB)
Vector database data and indices
Application logs and temporary files

Insufficient disk space causes deployment failures during model downloads or runtime operations.

Operating System#

For the RAG Blueprint you need the following operating system:

Ubuntu 22.04 OS

Driver Versions#

For the RAG Blueprint you need the following drivers:

GPU Driver - 560 or later
CUDA version - 12.9 or later

For details, see NVIDIA NIM for LLMs Software.

Hardware Requirements (Docker)#

By default, the RAG Blueprint deploys the NIM microservices locally (self-hosted). You need one of the following:

2 x H100
2 x B200
3 x A100 SXM
2 x RTX PRO 6000

Tip

You can also modify the RAG Blueprint to use NVIDIA-hosted NIM microservices.

Tip

No GPU Available? Try the Containerless Deployment (Lite Mode) which requires no GPU hardware and uses NVIDIA cloud APIs for all processing.

Hardware Requirements (Kubernetes)#

To install the RAG Blueprint on Kubernetes, you need one of the following:

8 x H100-80GB
8 x B200
9 x A100-80GB SXM
8 x RTX PRO 6000
3 x H100 (with Multi-Instance GPU)

Hardware requirements for self-hosting all NVIDIA NIM microservices#

The following are requirements and recommendations for the individual components of the RAG Blueprint:

Pipeline operation – 1x L40 GPU or similar recommended. This is needed for the Milvus vector database, as GPU acceleration is enabled by default.
LLM NIM (llama-3.3-nemotron-super-49b-v1.5) – Refer to the Support Matrix.
Embedding NIM (Llama-3.2-NV-EmbedQA-1B-v2 ) – Refer to the Support Matrix.
Reranking NIM (llama-3_2-nv-rerankqa-1b-v2 ): Refer to the Support Matrix.
NeMo Retriever OCR (Default): Refer to the Support Matrix.
NVIDIA NIM for Image OCR (baidu/paddleocr - Legacy): Refer to the Support Matrix.
NVIDIA NIMs for Object Detection:
- NeMo Retriever Page Elements v3 Support Matrix
- NeMo Retriever Graphic Elements v1 Support Matrix
- NeMo Retriever Table Structure v1 Support Matrix

Tip

NeMo Retriever OCR is now the default OCR service. To use the legacy Paddle OCR instead, see OCR Configuration Guide.

Minimum System Requirements for NVIDIA RAG Blueprint#

Disk Space Requirements#

Operating System#

Driver Versions#

Hardware Requirements (Docker)#

Hardware Requirements (Kubernetes)#

Hardware requirements for self-hosting all NVIDIA NIM microservices#

Related Topics#