`nemo_automodel.components.models.minimax_m2.state_dict_adapter`#

Module Contents#

Classes#

MiniMaxM2StateDictAdapter

Convert between MiniMax-M2.1 HF checkpoints and native grouped-expert format.

Functions#

should_quantize_key

Data#

NON_QUANTIZED_KEY_PATTERNS

API#

nemo_automodel.components.models.minimax_m2.state_dict_adapter.NON_QUANTIZED_KEY_PATTERNS#: [‘input_layernorm.weight’, ‘post_attention_layernorm.weight’, ‘norm.weight’, ‘lm_head.weight’, ‘embe…

nemo_automodel.components.models.minimax_m2.state_dict_adapter.should_quantize_key(key: str) → bool#

class nemo_automodel.components.models.minimax_m2.state_dict_adapter.MiniMaxM2StateDictAdapter( config: Any, moe_config: nemo_automodel.components.moe.layers.MoEConfig, backend: nemo_automodel.components.models.common.BackendConfig, dtype: torch.dtype = torch.float32, )#

Bases: nemo_automodel.components.moe.state_dict_mixin.MoESplitExpertsStateDictMixin, nemo_automodel.components.checkpoint.state_dict_adapter.StateDictAdapter