This guide helps you set up and get started with NeMo Curator’s audio curation capabilities. Follow these steps to prepare your environment and run your first audio curation pipeline using the FLEURS dataset.
To use NeMo Curator’s audio curation modules, ensure you meet the following requirements:
If you don’t have uv installed, refer to the Installation Guide for setup instructions, or install it quickly with:
You can install NeMo Curator with audio support in four ways:
The simplest way to install NeMo Curator with audio support:
The audio extras include NeMo Toolkit with ASR models. Additional audio processing libraries (soundfile, editdistance) are installed automatically as NeMo Toolkit dependencies.
NeMo Curator provides a sample FLEURS configuration for audio curation. You can download and customize it:
This configuration file contains a complete audio curation pipeline for the FLEURS dataset, including ASR inference, quality assessment, and filtering.
Create a directory to store your audio datasets:
Here’s a simple example to get started with audio curation using the FLEURS dataset:
You can also run the pipeline using the downloaded configuration:
After running the pipeline, you’ll have:
Each output entry contains:
Explore the Audio Curation documentation for more advanced processing techniques and customization options.
Key areas to explore next: