nemo_curator.stages.audio.preprocessing.mono_conversion
nemo_curator.stages.audio.preprocessing.mono_conversion
nemo_curator.stages.audio.preprocessing.mono_conversion
Audio mono conversion stage.
Converts multi-channel audio to mono and verifies sample rate. Typically the first stage in an audio processing pipeline.
Bases: ProcessingStage[AudioTask, AudioTask]
Audio mono conversion and sample rate verification stage.
Converts multi-channel audio to mono by averaging channels. Optionally verifies that audio matches expected sample rate.
Parameters:
Expected sample rate in Hz (default: 48000)
Key in data dict for audio file path
If True, reject audio with wrong sample rate
Convert audio to mono and verify sample rate.
Mutates task.data in-place with waveform data. Returns task if successful, [] if doesn’t meet requirements.