nemo_automodel.components.models.bagel.connector#

Projection layers that connect BAGEL vision features to text hidden states.

Module Contents#

Classes#

_Activation

Wrap a Transformers activation callable in an nn.Module.

BagelMultiModalProjector

Project vision-tower features into the language-model hidden size.

API#

class nemo_automodel.components.models.bagel.connector._Activation(name: str)#

Bases: torch.nn.Module

Wrap a Transformers activation callable in an nn.Module.

Initialization

forward(hidden_states)#
class nemo_automodel.components.models.bagel.connector.BagelMultiModalProjector(in_dim: int, out_dim: int, hidden_act: str)#

Bases: torch.nn.Sequential

Project vision-tower features into the language-model hidden size.

Initialization