nemo_automodel.components.models.llama.state_dict_adapter

State dict adapter for Llama model.

The model uses separate q_proj / k_proj / v_proj / gate_proj / up_proj that match HuggingFace key names exactly, so the adapter is a passthrough (only tied-weight handling is applied in from_hf).

Module Contents

Classes

Name	Description
`LlamaStateDictAdapter`	State dict adapter for Llama models.

Data

logger

API

class nemo_automodel.components.models.llama.state_dict_adapter.LlamaStateDictAdapter(
    config: transformers.LlamaConfig
)

State dict adapter for Llama models.

Uses separate projections that match HuggingFace key names exactly, so from_hf / to_hf are simple passthroughs (only tied-weight handling in from_hf).

nemo_automodel.components.models.llama.state_dict_adapter.LlamaStateDictAdapter.from_hf(
    hf_state_dict: dict[str, typing.Any],
    kwargs = {}
) -> dict[str, typing.Any]

nemo_automodel.components.models.llama.state_dict_adapter.LlamaStateDictAdapter.to_hf(
    state_dict: dict[str, typing.Any],
    exclude_key_regex: typing.Optional[str] = None,
    kwargs = {}
) -> dict[str, typing.Any]

nemo_automodel.components.models.llama.state_dict_adapter.logger = logging.getLogger(__name__)