nemo_curator.stages.audio.filtering.band_filter_module.predict
nemo_curator.stages.audio.filtering.band_filter_module.predict
Module Contents
Classes
API
Class to predict band label (full_band/narrow_band) for audio waveforms.
feature_cache
Load the model from disk.
Extract band energy features directly from waveform tensor.
Parameters:
waveform
Audio waveform tensor [channels, samples]
sample_rate
Sample rate of the audio
Returns: np.ndarray
Array of extracted features
Predict whether an audio waveform is full band or narrow band.
Parameters:
waveform
Audio waveform tensor [channels, samples]
sample_rate
Sample rate of the audio
Returns: str
Prediction result as a string (‘full_band’ or ‘narrow_band’)