nemo_automodel.components.models.glm_moe_dsa.state_dict_adapter

View as Markdown

Module Contents

Classes

NameDescription
GlmMoeDsaStateDictAdapterConverts between HF GLM-MoE-DSA checkpoints and native format.

API

class nemo_automodel.components.models.glm_moe_dsa.state_dict_adapter.GlmMoeDsaStateDictAdapter()

Bases: Glm4MoeStateDictAdapter

Converts between HF GLM-MoE-DSA checkpoints and native format.

Extends Glm4MoeStateDictAdapter with handling for the DSA indexer weights that should not be quantized (k_norm, weights_proj).

_indexer_non_quantized_keys
nemo_automodel.components.models.glm_moe_dsa.state_dict_adapter.GlmMoeDsaStateDictAdapter.convert_single_tensor_to_hf(
fqn: str,
tensor: typing.Any,
kwargs = {}
) -> list[tuple[str, typing.Any]]