Is this page helpful?

What is NeMo Retriever Library?

NVIDIA NeMo Retriever Library is a scalable, performance-oriented framework for document content and metadata extraction. It supports both NVIDIA NIM microservices and a wide range of models to find, contextualize, and extract text, tables, charts, and infographics for use in downstream generative and retrieval-augmented applications.

Note

NVIDIA Ingest (nv-ingest) has been renamed NeMo Retriever Library.

NeMo Retriever Library enables parallelization of splitting documents into pages where artifacts are classified (such as text, tables, charts, and infographics), extracted, and further contextualized through optical character recognition (OCR) into a well defined JSON schema. From there, NeMo Retriever Library can optionally manage computation of embeddings for the extracted content, and optionally manage storing into a vector database (LanceDB by default, or Milvus).

Note

Cached and Deplot are deprecated. Instead, NeMo Retriever Library now uses the yolox-graphic-elements NIM. With this change, you should now be able to run NeMo Retriever Library on a single 24GB A10G or better GPU. If you want to use the old pipeline, with Cached and Deplot, use the NeMo Retriever Library 24.12.1 release.

What NeMo Retriever Library Is ✔️

The following diagram shows the retriever pipeline.

Overview diagram

NeMo Retriever Library is a microservice service that does the following:

Accept a JSON job description, containing a document payload, and a set of ingestion tasks to perform on that payload.
Allow the results of a job to be retrieved. The result is a JSON dictionary that contains a list of metadata describing objects extracted from the base document, and processing annotations and timing/trace data.
Support multiple methods of extraction for each document type to balance trade-offs between throughput and accuracy. For example, for .pdf documents, extraction is performed by using pdfium and nemotron-parse.
Support various types of pre- and post- processing operations, including text splitting and chunking, transform and filtering, embedding generation, and image offloading to storage.

NeMo Retriever Library supports the following file types:

avi (early access)
bmp
docx
html (converted to markdown format)
jpeg
json (treated as text)
md (treated as text)
mkv (early access)
mov (early access)
mp3
mp4 (early access)
pdf
png
pptx
sh (treated as text)
svg (NeMo Retriever Library only, requires cairosvg)
tiff
txt
wav

What NeMo Retriever Library Isn't ✖️

NeMo Retriever Library does not do the following:

Run a static pipeline or fixed set of operations on every submitted document.
Act as a wrapper for any specific document parsing library.

What is NeMo Retriever Library?

What NeMo Retriever Library Is ✔️

What NeMo Retriever Library Isn't ✖️

Related Topics