nemo_curator.stages.audio.filtering.band
nemo_curator.stages.audio.filtering.band
nemo_curator.stages.audio.filtering.band
Band filter stage for audio bandwidth classification.
Classifies audio as “full_band” or “narrow_band” based on spectral characteristics. Useful for filtering low-quality telephone or compressed audio.
Bases: ProcessingStage[AudioTask, AudioTask]
Band filter stage for bandwidth classification.
Classifies audio as “full_band” or “narrow_band” and filters based on the specified band_value to pass.
Parameters:
Local path to band classifier model (.joblib). If not provided, the model is downloaded from HuggingFace (nvidia/nemocurator-speech-bandwidth-filter).
Directory to cache downloaded models.
Which band type to pass (“full_band” or “narrow_band”)
Run band classification on a single (non-nested) task.
Filter audio based on bandwidth classification.
When task.data contains a "segments" key (nested mode from VAD),
each segment is evaluated individually and only survivors are kept.
Returns: AudioTask | list[AudioTask]
AudioTask if passes the band filter, [] if filtered out.