bridge.models.qwen.qwen3_bridge#
Module Contents#
Classes#
Megatron Bridge for Qwen3 Causal LM. |
API#
- class bridge.models.qwen.qwen3_bridge.Qwen3Bridge#
Bases:
megatron.bridge.models.conversion.model_bridge.MegatronModelBridgeMegatron Bridge for Qwen3 Causal LM.
This bridge handles the conversion between HuggingFace Qwen3ForCausalLM and Megatron-Core GPTModel formats. Qwen3 differs from Qwen2 by using QK layernorm and no QKV bias.
.. rubric:: Example
from megatron.bridge import AutoBridge bridge = AutoBridge.from_hf_pretrained(“Qwen/Qwen3-1.7B”) provider = bridge.to_megatron_provider()
- provider_bridge(hf_pretrained)#
Convert HuggingFace Qwen3 config to GPTModelProvider.
- mapping_registry() megatron.bridge.models.conversion.mapping_registry.MegatronMappingRegistry#
Return the MegatronMappingRegistry for Qwen3 parameter conversion.
Covers all Megatron-Core parameter names for both the standard decoder layers and the MTP (Multi-Token Prediction) transformer layers that are present when
mtp_num_layers >= 1.Simple 1:1 renames are expressed as :class:
AutoMappingentries. The fused QKV matrix is handled by :class:QKVMappingand the gated MLP gate+up projection by :class:GatedMLPMapping.