nemo_automodel.components.models.qwen3_5.state_dict_adapter
nemo_automodel.components.models.qwen3_5.state_dict_adapter
State-dict adapter for Qwen3.5 dense (non-MoE) models.
Qwen3.5 dense keeps its GatedDeltaNet SSM-gating parameters (A_log /
dt_bias) in a fp32 _fp32_params holder. The model’s state dict therefore
contains keys of the form ...linear_attn._fp32_params.A_log instead of the
original ...linear_attn.A_log.
This adapter renames keys at save/load boundaries so that on-disk checkpoints
match the original HF Qwen3.5 layout (bare A_log) and are directly
loadable via transformers.AutoModelForImageTextToText.from_pretrained.
Module Contents
Classes
Functions
Data
API
Map HF Qwen3.5 MTP keys to Automodel’s Megatron-style MTP module.
Map Automodel Qwen3.5 MTP keys back to HF checkpoint keys.