nemo_curator.stages.audio.alm.alm_data_builder
nemo_curator.stages.audio.alm.alm_data_builder
ALM Data Builder Stage - Native NeMo Curator Implementation.
Creates training windows from audio segments. Follows the exact pattern from NeMo Curator: https://github.com/NVIDIA-NeMo/Curator/blob/main/nemo_curator/stages/audio/common.py
Produces identical output to SDP implementation.
Module Contents
Classes
Functions
Data
API
Bases: ProcessingStage[AudioTask, AudioTask]
Build ALM training windows from audio segments.
Filters segments by sample rate, bandwidth, speaker count, and duration
to create valid training windows. Mutates the entry dict in-place,
adding windows, stats, and truncation_events keys.
Compute derived parameters - EXACT match to SDP.
Process a single entry and extract valid training windows.
Tracks segment loss reasons and counts during window building.
Record statistics for a rejected window.
Truncate a segment’s words at the given end time, returning a modified copy.