Create and Load Custom Audio Manifests
Create and load custom audio manifests in JSONL format for your speech datasets. This guide covers the required manifest format and how to load manifests into NeMo Curator pipelines.
Manifest Format
NeMo Curator uses JSONL (JSON Lines) format for audio manifests, with one JSON object per line:
NeMo Curator does not provide a generic TSV reader stage. You must convert your data to JSONL format before loading, or use dataset-specific importers like the FLEURS manifest creator.
Required Fields
Every audio manifest entry must include:
Optional Fields
Additional fields that can enhance processing:
Creating Custom Manifests
You’ll need to create your own manifest files using your preferred tools. Here’s a simple Python example:
Loading Manifests in Pipelines
Using JsonlReader
Load your custom manifest using the built-in JsonlReader:
Validation
Audio file validation happens automatically during pipeline processing:
Example: Complete Workflow
Related Topics
- Audio Processing Overview - Complete audio processing workflow
- FLEURS Dataset - Example of automated dataset loading
- Local Files - Loading audio files from local directories