stages.audio.common
#
Module Contents#
Classes#
Stage that computes the duration of the file in |
|
LegacySpeechStage for SDP processors inherited from BaseParallelProcessor |
|
Processor for preserving dataset entries based on a specified condition involving a target value and an input field. |
API#
- class stages.audio.common.GetAudioDurationStage#
Bases:
stages.audio.common.LegacySpeechStage
Stage that computes the duration of the file in
audio_filepath_key
(using soundfile) and saves the duration induration_key
. If there is an error computing the duration, the value atduration_key
will be updated with the value -1.0.Args: audio_filepath_key (str): Key to get path to wav file. duration_key (str): Key to put to audio duration. Returns: All the same fields as in the input manifest plus duration_key
- audio_filepath_key: str#
None
- duration_key: str#
None
- process_dataset_entry(
- data_entry: dict,
- class stages.audio.common.LegacySpeechStage#
Bases:
nemo_curator.stages.base.ProcessingStage
[nemo_curator.tasks.Task
,nemo_curator.tasks.Task
]LegacySpeechStage for SDP processors inherited from BaseParallelProcessor
- process(
- task: nemo_curator.tasks.AudioBatch,
Process a task and return the result. Args: task (X): Input task to process Returns (Y | list[Y]): - Single task: For 1-to-1 transformations - List of tasks: For 1-to-many transformations (e.g., readers) - None: If the task should be filtered out
- abstractmethod process_dataset_entry(
- data_entry: nemo_curator.tasks.AudioBatch,
- class stages.audio.common.PreserveByValueStage(
- input_value_key: str,
- target_value: int | str,
- operator: str = 'eq',
- **kwargs,
Bases:
stages.audio.common.LegacySpeechStage
Processor for preserving dataset entries based on a specified condition involving a target value and an input field.
Args: input_value_key (str): The field in the dataset entries to be evaluated. target_value (Union[int, str]): The value to compare with the input field. operator (str): (Optional) The operator to apply for comparison. Options: “lt” (less than), “le” (less than or equal to), “eq” (equal to), “ne” (not equal to), “ge” (greater than or equal to), “gt” (greater than). Defaults to “eq”. **kwargs: Additional keyword arguments to be passed to the base class
BaseParallelProcessor
.Initialization
- process_dataset_entry(
- data_entry: nemo_curator.tasks.AudioBatch,