aistore.pytorch.shard_reader
aistore.pytorch.shard_reader
AIS Shard Reader for PyTorch
PyTorch Dataset and DataLoader for AIS.
Copyright (c) 2024-2025, NVIDIA CORPORATION. All rights reserved.
Module Contents
Classes
API
Bases: AISBaseIterDataset
An iterable-style dataset that iterates over objects stored as Webdataset shards and yields samples represented as a tuple of basename (str) and contents (dictionary).
Parameters:
Single or list of Bucket objects to load data
Map of Bucket objects to list of prefixes that only allows
Optional ETL on the AIS cluster to apply to each object
Enables console shard reading progress indicator
Returns the length of the dataset. Note that calling this will iterate through the dataset, taking O(N) time.
NOTE: If you want the length of the dataset after iterating through
it, use for i, data in enumerate(dataset) instead.