nemo_automodel.components.models.llava_onevision.state_dict_adapter
nemo_automodel.components.models.llava_onevision.state_dict_adapter
State dict adapter for LLaVA-OneVision-1.5.
HF on-disk safetensors layout (from lmms-lab/LLaVA-OneVision-1.5-): visual.{patch_embed,class_embedding,class_pos_emb,pre_layernorm,blocks.,merger.} model.{embed_tokens,layers.,norm} lm_head.weight
Applies the same regex rename HF does via _checkpoint_conversion_mapping:
^visual -> model.visual
^model(?!.(language_model|visual)) -> model.language_model