aistore.pytorch.parallel_map_dataset
aistore.pytorch.parallel_map_dataset
PyTorch Map-style Dataset with parallel download acceleration.
Copyright (c) 2026, NVIDIA CORPORATION. All rights reserved.
Module Contents
Classes
API
Bases: AISBaseMapDataset
Map-style dataset that uses parallel download to fetch objects.
Parallel download splits each object into byte ranges and fetches them
concurrently using num_workers workers.
__getitem__ returns (object_name, ParallelBuffer). The caller (or
PyTorch DataLoader collate function) is responsible for consuming and
closing the ParallelBuffer.
Parameters:
ais_source_list
Single or list of AISSource objects to load data.
prefix_map
Map of AISSource to prefix(es) for filtering objects.
num_workers
Number of concurrent range-read workers per object.
num_workers
Number of concurrent range-read workers per object.