nemo_automodel.components.datasets.diffusion.base_dataset#
Module Contents#
Classes#
Abstract base class for multiresolution datasets with bucket-based sampling. |
Data#
API#
- nemo_automodel.components.datasets.diffusion.base_dataset.logger#
‘getLogger(…)’
- class nemo_automodel.components.datasets.diffusion.base_dataset.BaseMultiresolutionDataset(cache_dir: str, quantization: int = 64)#
Bases:
torch.utils.data.Dataset,abc.ABCAbstract base class for multiresolution datasets with bucket-based sampling.
Initialization
- Parameters:
cache_dir – Directory containing preprocessed cache (metadata.json + shards)
quantization – Resolution quantization factor (64 for images, 8 for video)
- _load_metadata() List[Dict]#
Load metadata from cache directory.
Expects metadata.json with “shards” key referencing shard files.
- _aspect_ratio_to_name(aspect_ratio: float) str#
Convert aspect ratio to a descriptive name.
- _group_by_bucket()#
Group samples by bucket (aspect_ratio + resolution).
- get_bucket_info() Dict#
Get bucket organization information.
- __len__() int#
- abstractmethod __getitem__(idx: int) Dict#
Load a single sample. Subclasses must implement.