nemo_automodel.components.models.llama.state_dict_adapter

View as Markdown

State dict adapter for Llama model.

The model uses separate q_proj / k_proj / v_proj / gate_proj / up_proj that match HuggingFace key names exactly, so the adapter is a passthrough (only tied-weight handling is applied in from_hf).

Module Contents

Classes

NameDescription
LlamaStateDictAdapterState dict adapter for Llama models.

Data

logger

API

class nemo_automodel.components.models.llama.state_dict_adapter.LlamaStateDictAdapter(
config: transformers.LlamaConfig
)

State dict adapter for Llama models.

Uses separate projections that match HuggingFace key names exactly, so from_hf / to_hf are simple passthroughs (only tied-weight handling in from_hf).

nemo_automodel.components.models.llama.state_dict_adapter.LlamaStateDictAdapter.from_hf(
hf_state_dict: dict[str, typing.Any],
kwargs = {}
) -> dict[str, typing.Any]
nemo_automodel.components.models.llama.state_dict_adapter.LlamaStateDictAdapter.to_hf(
state_dict: dict[str, typing.Any],
exclude_key_regex: typing.Optional[str] = None,
kwargs = {}
) -> dict[str, typing.Any]
nemo_automodel.components.models.llama.state_dict_adapter.logger = logging.getLogger(__name__)