nemo_curator.stages.audio.datasets.readspeech.create_initial_manifest
nemo_curator.stages.audio.datasets.readspeech.create_initial_manifest
nemo_curator.stages.audio.datasets.readspeech.create_initial_manifest
Bases: ProcessingStage[_EmptyTask, AudioTask]
Stage to create initial manifest for the DNS Challenge Read Speech dataset.
Dataset: Microsoft DNS Challenge 5 - Read Speech (Track 1 Headset) Source: https://github.com/microsoft/DNS-Challenge
Downloads a single archive (4.88 GB) containing 14,279 WAV files at 48kHz (19.3 hours).
When auto_download=True, the archive is downloaded and extracted automatically.
Parameters:
Directory where data will be downloaded/extracted to.
Maximum number of samples to include (-1 for all).
If True, automatically download and extract dataset.
Download and extract DNS Challenge Read Speech dataset (~4.88 GB).
Main processing method. Returns list[AudioTask] with one AudioTask per file.