bridge.models.sarvam.sarvam_mla_bridge#
Module Contents#
Classes#
Megatron Hub Bridge for Sarvam MLA Causal LM. |
API#
- class bridge.models.sarvam.sarvam_mla_bridge.SarvamMLABridge#
Bases:
megatron.bridge.models.conversion.model_bridge.MegatronModelBridgeMegatron Hub Bridge for Sarvam MLA Causal LM.
This bridge handles the conversion between HuggingFace SarvamMLAForCausalLM and Megatron-Core GPTModel formats. Sarvam MLA models use multi-latent attention architecture.
- provider_bridge(
- hf_pretrained: megatron.bridge.models.hf_pretrained.causal_lm.PreTrainedCausalLM,
- mapping_registry() megatron.bridge.models.conversion.mapping_registry.MegatronMappingRegistry#