`nemo_automodel.components.models.biencoder.state_dict_adapter`#

Module Contents#

Classes#

BiencoderStateDictAdapter

Adapter for converting BiencoderModel state dict to single encoder format.

API#

class nemo_automodel.components.models.biencoder.state_dict_adapter.BiencoderStateDictAdapter#

Bases: nemo_automodel.components.checkpoint.state_dict_adapter.StateDictAdapter

Adapter for converting BiencoderModel state dict to single encoder format.

This adapter extracts only the query encoder (lm_q) state dict and converts the “lm_q.” prefix to “model.” prefix, making it compatible with standard HuggingFace model format.

Initialization

Initialize the adapter.

_PEFT_PREFIX#: ‘base_model.model.’

to_hf(

state_dict: dict[str, Any],

**kwargs,

) → dict[str, Any]#

Convert from biencoder state dict to HuggingFace format.

Filters to only lm_q keys and converts “lm_q.” prefix to “model.” prefix. Also handles the base_model.model. prefix that PEFT checkpointing adds so that adapter weights are correctly converted (e.g. base_model.model.lm_q.X → base_model.model.model.X).

Parameters:: state_dict – The biencoder model state dict
Returns:: The converted HuggingFace format state dict with only query encoder

from_hf(

hf_state_dict: dict[str, Any],

device_mesh: Optional[torch.distributed.device_mesh.DeviceMesh] = None,

**kwargs,

) → dict[str, Any]#

Convert HuggingFace state dict to biencoder format.

Converts “model.” prefix to “lm_q.” prefix for loading into biencoder. Also handles the base_model.model. prefix used by PEFT checkpoints (e.g. base_model.model.model.X → base_model.model.lm_q.X).

Parameters:

hf_state_dict – The HuggingFace format state dict
device_mesh – Optional device mesh (not used in this adapter)

Returns:

The converted biencoder format state dict

convert_single_tensor_to_hf(

fqn: str,

tensor: Any,

**kwargs,

) → list[tuple[str, Any]]#

Convert a single tensor from biencoder format to HuggingFace format.