nemo_automodel.components.models.bagel.connector#
Projection layers that connect BAGEL vision features to text hidden states.
Module Contents#
Classes#
Wrap a Transformers activation callable in an |
|
Project vision-tower features into the language-model hidden size. |
API#
- class nemo_automodel.components.models.bagel.connector._Activation(name: str)#
Bases:
torch.nn.ModuleWrap a Transformers activation callable in an
nn.Module.Initialization
- forward(hidden_states)#
- class nemo_automodel.components.models.bagel.connector.BagelMultiModalProjector(in_dim: int, out_dim: int, hidden_act: str)#
Bases:
torch.nn.SequentialProject vision-tower features into the language-model hidden size.
Initialization