nemo_automodel.components.datasets.diffusion.text_to_video_dataset
nemo_automodel.components.datasets.diffusion.text_to_video_dataset
Module Contents
Classes
Functions
Data
API
Bases: BaseMultiresolutionDataset
Text-to-Video dataset with multiresolution bucket organization.
Loads preprocessed .meta files organized by resolution bucket. Compatible with SequentialBucketSampler for multiresolution training.
Load a single video sample from its .meta file.
Concatenate optional video fields present in batch into result dict.
Extract optional model-specific fields, moving to device.