nemo_automodel.components.models.bagel.hf_backbone_loader
nemo_automodel.components.models.bagel.hf_backbone_loader
Helpers for initializing BAGEL pretraining runs from HF backbones.
Module Contents
Functions
Data
API
Copy UND Qwen weights into *_moe_gen siblings after sharding/wrapping.
Load a HF safetensors/bin checkpoint as a full CPU state dict.
Load vanilla Qwen weights into BAGEL’s language model after AM sharding.
Load SigLIP weights into BAGEL’s packed-NaViT vision model after AM sharding.
Load a SigLIP vision config from a vision-only or full SigLIP HF folder.
Remove wrapper path fragments that are not part of logical parameter FQNs.
Reset BAGEL-added Q/K norm weights missing from vanilla Qwen checkpoints.
Resolve a local path or download a HF snapshot containing model weights.
Build BAGEL from upstream Qwen/SigLIP backbone configs.
Initialize BAGEL-owned modules not loaded from Qwen/SigLIP checkpoints.
Load Qwen/SigLIP HF backbone weights into an already-built BAGEL model.