For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
DocumentationAPI Reference
DocumentationAPI Reference
  • API Reference
    • Overview
        • Nemo Curator
          • Backends
          • Config
          • Core
          • Metrics
          • Models
          • Package Info
          • Pipeline
          • Stages
          • Tasks
            • Audio Task
            • Document
            • File Group
            • Image
            • Interleaved
            • Tasks
            • Utils
            • Video
          • Utils
    • Pipeline
    • ProcessingStage
    • CompositeStage
    • Resources
NVIDIANVIDIA
Developer-friendly docs for your API
Privacy Policy | Your Privacy Choices | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2026, NVIDIA Corporation.

LogoLogoNeMo Curator
On this page
  • Module Contents
  • Classes
  • API
API ReferenceFull Library ReferenceNemo CuratorNemo CuratorTasks

nemo_curator.tasks.audio_task

||View as Markdown|
Previous

nemo_curator.tasks

Next

nemo_curator.tasks.document

Module Contents

Classes

NameDescription
AudioTaskA single audio manifest entry.
_AttrDictDict subclass exposing keys as attributes so hasattr works.

API

class nemo_curator.tasks.audio_task.AudioTask(
task_id: str = '',
dataset_name: str = '',
data: dict = _AttrDict(),
_stage_perf: list[nemo_curator.utils.performance_utils.StagePerfStats] = list(),
_metadata: dict[str, typing.Any] = dict(),
filepath_key: str | None = None
)
Dataclass

Bases: Task[dict]

A single audio manifest entry.

Represents one line from a JSONL manifest file (e.g. one audio file with its metadata). data is always a single dict, never a list.

Matches the VideoTask naming convention used by the video modality.

Parameters:

data
dictDefaults to _AttrDict()

Manifest entry dict (e.g. {"audio_filepath": "...", "text": "..."}).

filepath_key
str | NoneDefaults to None

Optional key whose value is validated as an existing path.

data
dict = field(default_factory=_AttrDict)
dataset_name
str = ''
filepath_key
str | None = None
num_items
int
task_id
str = ''
nemo_curator.tasks.audio_task.AudioTask.__post_init__()
nemo_curator.tasks.audio_task.AudioTask.validate() -> bool

Validate the task data.

class nemo_curator.tasks.audio_task._AttrDict()

Bases: dict

Dict subclass exposing keys as attributes so hasattr works.

nemo_curator.tasks.audio_task._AttrDict.__delattr__(
key: str
)
nemo_curator.tasks.audio_task._AttrDict.__getattr__(
key: str
)
nemo_curator.tasks.audio_task._AttrDict.__setattr__(
key: str,
value: object
) -> None