nemo_curator.stages.audio.common

View as Markdown

Module Contents

Classes

NameDescription
GetAudioDurationStageCompute audio duration from the file at audio_filepath_key and
PreserveByValueStageFilter entries by comparing input_value_key against target_value.

API

class nemo_curator.stages.audio.common.GetAudioDurationStage(
name: str = 'GetAudioDurationStage',
audio_filepath_key: str = 'audio_filepath',
duration_key: str = 'duration'
)
Dataclass

Bases: ProcessingStage[AudioTask, AudioTask]

Compute audio duration from the file at audio_filepath_key and store the result under duration_key.

Parameters:

audio_filepath_key
strDefaults to 'audio_filepath'

Key to get path to wav file.

duration_key
strDefaults to 'duration'

Key to put audio duration.

audio_filepath_key
str = 'audio_filepath'
duration_key
str = 'duration'
name
str = 'GetAudioDurationStage'
nemo_curator.stages.audio.common.GetAudioDurationStage.inputs() -> tuple[list[str], list[str]]
nemo_curator.stages.audio.common.GetAudioDurationStage.outputs() -> tuple[list[str], list[str]]
nemo_curator.stages.audio.common.GetAudioDurationStage.process(
task: nemo_curator.tasks.AudioTask
) -> nemo_curator.tasks.AudioTask
nemo_curator.stages.audio.common.GetAudioDurationStage.setup(
worker_metadata: typing.Any = None
) -> None
class nemo_curator.stages.audio.common.PreserveByValueStage(
input_value_key: str,
target_value: int | str,
operator: str = 'eq'
)

Bases: ProcessingStage[AudioTask, AudioTask]

Filter entries by comparing input_value_key against target_value.

Returns None from process() to drop entries that fail the comparison, matching the text-modality filter convention.

Parameters:

input_value_key
str

The field in the dataset entries to evaluate.

target_value
int | str

The value to compare with.

operator
strDefaults to 'eq'

Comparison operator (lt, le, eq, ne, ge, gt).

name
str = 'PreserveByValueStage'
operator
= ops[operator]
nemo_curator.stages.audio.common.PreserveByValueStage.inputs() -> tuple[list[str], list[str]]
nemo_curator.stages.audio.common.PreserveByValueStage.outputs() -> tuple[list[str], list[str]]
nemo_curator.stages.audio.common.PreserveByValueStage.process(
task: nemo_curator.tasks.AudioTask
) -> nemo_curator.tasks.AudioTask | None
nemo_curator.stages.audio.common.PreserveByValueStage.process_batch(
tasks: list[nemo_curator.tasks.AudioTask]
) -> list[nemo_curator.tasks.AudioTask]