Skip to content

Release Notes for NeMo Retriever Extraction

This documentation contains the release notes for NeMo Retriever extraction.

Note

NeMo Retriever extraction is also known as NVIDIA Ingest and nv-ingest.

Release 25.03

Summary

The NeMo Retriever extraction 25.03 release includes accuracy improvements, feature expansions, and throughput improvements.

New Features

  • Consolidated NeMo Retriever extraction to run on a single GPU (H100, A100, L40S, or A10G). For details, refer to Support Matrix.
  • Added Library Mode for a lightweight no-GPU deployment that uses NIM endpoints hosted on build.nvidia.com. For details, refer to Deploy Without Containers (Library Mode).
  • Added support for infographics extraction.
  • Added support for RIVA NIM for Audio extraction (Early Access). For details, refer to Audio Processing.
  • Added support for Llama-3.2 VLM for Image Captioning capability.
  • docX, pptx, jpg, png support for image detection & extraction.
  • Deprecated DePlot and CACHED NIMs.

Release 24.12.1

Bug fixes

Cases where .split() tasks fail during ingestion are now fixed.

Release 24.12

Known Issues

We currently do not support OCR-based text extraction. This was discovered in an unsupported use case and is not a product functionality issue.