AudioTask is the task type for audio processing in NeMo Curator. Each AudioTask holds a single manifest entry as a dict, matching the convention used by VideoTask and FileGroupTask.
Audio data follows the NeMo manifest format:
num_itemsAlways returns 1 — each AudioTask holds exactly one manifest entry.
All audio stages subclass ProcessingStage[AudioTask, AudioTask] directly — there is no intermediate base class.