`bridge.training.vlm_step`#

Module Contents#

Functions#

`get_batch_from_iterator`	Get a batch of data from the iterator.
`pack_batch_sequences`	Pack sequences in a batch by concatenating them and removing padding.
`get_batch`	Generate a batch.
`forward_step`	Forward training step.

Data#

logger

API#

bridge.training.vlm_step.logger#: ‘getLogger(…)’

bridge.training.vlm_step.get_batch_from_iterator( data_iterator: Iterable, use_mtp: bool = False, skip_getting_attention_mask_from_dataset: bool = True, *, is_first_pp_stage: bool, is_last_pp_stage: bool, ) → dict[str, Any]#

Get a batch of data from the iterator.

Parameters:

data_iterator – The data iterator to get the batch from.
use_mtp – Whether Multi-Token Prediction layers are enabled.
skip_getting_attention_mask_from_dataset – If set, the dataset will pass a None attention mask.

Returns:

A dictionary containing the batch data.

Return type:

dict[str, torch.Tensor]

bridge.training.vlm_step.pack_batch_sequences( tokens: torch.Tensor, labels: torch.Tensor, loss_mask: torch.Tensor, attention_mask: torch.Tensor, position_ids: torch.Tensor, pad_token_id: int = 0, pad_to_multiple_of: int = 1, ) → tuple[torch.Tensor, torch.Tensor, torch.Tensor, torch.Tensor, torch.Tensor, torch.Tensor, torch.Tensor]#

Pack sequences in a batch by concatenating them and removing padding.

Parameters:

tokens – [batch_size, seq_len]
labels – [batch_size, seq_len]
loss_mask – [batch_size, seq_len]
attention_mask – [batch_size, 1, seq_len, seq_len] or None
position_ids – [batch_size, seq_len]
pad_token_id – Token ID used for padding

Returns:

packed_tokens: [1, total_len] - concatenated sequences
packed_labels: [1, total_len]
packed_loss_mask: [1, total_len]
packed_attention_mask: None (not used with packing)
packed_position_ids: [1, total_len]
cu_seqlens: [num_sequences + 1] - cumulative sequence lengths
max_seqlen: tensor - max sequence length in packed batch

Return type:

Tuple of

bridge.training.vlm_step.get_batch( data_iterator: Iterable, cfg: megatron.bridge.training.config.ConfigContainer, use_mtp: bool = False, *, pg_collection, ) → tuple[...]#

Generate a batch.

Parameters:

data_iterator – Input data iterator
cfg – Configuration container
use_mtp – Whether Multi-Token Prediction layers are enabled

Returns:

tuple of tensors containing tokens, labels, loss_mask, attention_mask, position_ids, cu_seqlens, cu_seqlens_argmin, max_seqlen, visual_inputs (container of optional modalities)

bridge.training.vlm_step.forward_step( state: megatron.bridge.training.state.GlobalState, data_iterator: Iterable, model: megatron.core.models.gpt.GPTModel, return_schedule_plan: bool = False, ) → tuple[torch.Tensor, functools.partial]#

Forward training step.

Parameters:

state – Global state for the run
data_iterator – Input data iterator
model – The GPT Model
return_schedule_plan (bool) – Whether to return the schedule plan instead of the output tensor

Returns:

tuple containing the output tensor and the loss function

bridge.training.vlm_step#

Module Contents#

Functions#

Data#

API#

`bridge.training.vlm_step`#