Skip to content

Notebooks for NeMo Retriever Extraction

To get started using NeMo Retriever extraction, you can try one of the ready-made notebooks that are available.

Note

NeMo Retriever extraction is also known as NVIDIA Ingest and nv-ingest.

Dataset Downloads for Benchmarking

If you plan to run benchmarking or evaluation tests, you must download the Benchmark Datasets (Bo20, Bo767, Bo10k) from Digital Corpora. This is a prerequisite for all benchmarking operations.

Getting Started

To get started with the basics, try one of the following notebooks:

For more advanced scenarios, try one of the following notebooks: