GLM-4.7 and GLM-4.7-Flash#

GLM-4.7 and GLM-4.7-Flash are GLM MoE language models from Zhipu AI. Megatron Bridge supports both checkpoints, with the full GLM-4.7 model handled by the GLM-4.5 bridge registration and the Flash variant handled by a dedicated GLM-4.7-Flash bridge.

Supported Variants#

Variant	Hugging Face ID	Bridge	Notes
GLM-4.7	`zai-org/GLM-4.7`	`GLM45Bridge`	Full MoE model, about 358B total parameters and 32B active
GLM-4.7-Flash	`zai-org/GLM-4.7-Flash`	`GLM47FlashBridge`	MLA + MoE model, about 30B total parameters and 3B active

Architecture Notes#

GLM-4.7 uses the glm4_moe architecture already covered by GLM45Bridge.
GLM-4.7-Flash uses glm4_moe_lite, Multi-Latent Attention, GLM-style MoE routing, and expert-bias routing.
GLM-4.7-Flash stores per-expert gate/up/down tensors in Hugging Face checkpoint format; the bridge maps them into Megatron MoE tensors.
transformers >= 5.0.0rc0 is required for the GLM-4.7 Hugging Face model classes used by these bridges.

Examples#

For checkpoint conversion, inference, hardware notes, and validated parallelism settings, see the GLM-4.7 examples README.