Release Notes for NeMo Retriever Extraction
This documentation contains the release notes for NeMo Retriever extraction.
Note
NeMo Retriever extraction is also known as NVIDIA Ingest and nv-ingest.
Release 25.03
Summary
The NeMo Retriever extraction 25.03 release includes accuracy improvements, feature expansions, and throughput improvements.
New Features
- Consolidated NeMo Retriever extraction to run on a single GPU (H100, A100, L40S, or A10G). For details, refer to Support Matrix.
- Added Library Mode for a lightweight no-GPU deployment that uses NIM endpoints hosted on build.nvidia.com. For details, refer to Deploy Without Containers (Library Mode).
- Added support for infographics extraction.
- Added support for RIVA NIM for Audio extraction (Early Access). For details, refer to Audio Processing.
- Added support for Llama-3.2 VLM for Image Captioning capability.
- docX, pptx, jpg, png support for image detection & extraction.
- Deprecated DePlot and CACHED NIMs.
Release 24.12.1
Bug fixes
Cases where .split() tasks fail during ingestion are now fixed.
Release 24.12
Known Issues
We currently do not support OCR-based text extraction. This was discovered in an unsupported use case and is not a product functionality issue.