nemo_automodel.components.datasets.multimodal
nemo_automodel.components.datasets.multimodal
BAGEL-style multimodal data pipeline for packed three-group training.
Ports the subset of upstream BAGEL data needed to feed BAGEL training from
fully AM-native code: VLM SFT, T2I pretrain, and unified image editing. The
packed batch schema is shared by Stage 1 and Stage 2; whether VAE /
flow-matching tensors are consumed is controlled by the model stage.
Submodules
nemo_automodel.components.datasets.multimodal.collate_fnsnemo_automodel.components.datasets.multimodal.datasetsnemo_automodel.components.datasets.multimodal.distributed_iterablenemo_automodel.components.datasets.multimodal.interleavenemo_automodel.components.datasets.multimodal.packingnemo_automodel.components.datasets.multimodal.parquet_utilsnemo_automodel.components.datasets.multimodal.transformsnemo_automodel.components.datasets.multimodal.utilsnemo_automodel.components.datasets.multimodal.video