nemo_automodel.components.models.qwen2.state_dict_adapter#

State dict adapter for Qwen2 model with combined projections.

Uses the generic CombinedProjectionStateDictAdapter from common/.

Module Contents#

Classes#

Qwen2StateDictAdapter

State dict adapter for Qwen2 models.

API#

class nemo_automodel.components.models.qwen2.state_dict_adapter.Qwen2StateDictAdapter(config: transformers.Qwen2Config)#

Bases: nemo_automodel.components.models.common.combined_projection.state_dict_adapter.CombinedProjectionStateDictAdapter

State dict adapter for Qwen2 models.

Inherits from the generic CombinedProjectionStateDictAdapter, providing a clean interface specific to Qwen2.

.. rubric:: Example

from transformers import Qwen2Config

config = Qwen2Config.from_pretrained(“Qwen/Qwen2.5-7B”) adapter = Qwen2StateDictAdapter(config)

Convert HF checkpoint to custom format#

custom_state_dict = adapter.from_hf(hf_state_dict)

Convert custom checkpoint back to HF format#

hf_state_dict = adapter.to_hf(custom_state_dict)

Initialization

Initialize adapter with Qwen2 config.