nemo_automodel.components.loss.chunked_ce
nemo_automodel.components.loss.chunked_ce
Module Contents
Classes
Functions
Data
_compiled_compute_cross_entropy
API
Bases: Module
Cross-entropy loss computed over sequence chunks.
Computes cross-entropy loss in chunks to handle long sequences more efficiently.
Parameters:
Model output logits of shape [batch_size, seq_len, vocab_size].
Ground-truth labels of shape [batch_size, seq_len].
Boolean mask indicating valid positions (1) and positions to ignore (0). Defaults to None.
Returns: torch.Tensor
torch.Tensor: The sum of cross-entropy losses over the sequence.
Computes the cross-entropy loss between logits and targets.
Parameters:
Model predictions of shape (sequence_length, num_classes).
Ground-truth labels of shape (sequence_length,).
Target value that is ignored when computing the loss. Defaults to -100.
Returns:
torch.Tensor: The sum of cross-entropy losses over the sequence.