nemo_curator.tasks.audio_task
Module Contents
Classes
API
Dataclass
Bases: Task[dict]
A single audio manifest entry.
Represents one line from a JSONL manifest file (e.g. one audio file
with its metadata). data is always a single dict, never a list.
Matches the VideoTask naming convention used by the video modality.
Parameters:
data
Manifest entry dict (e.g. {"audio_filepath": "...", "text": "..."}).
filepath_key
Optional key whose value is validated as an existing path.
data
dataset_name
filepath_key
num_items
task_id
Validate the task data.
Bases: dict
Dict subclass exposing keys as attributes so hasattr works.