bridge.perf_recipes.qwen.b200.qwen3_moe#
B200 performance recipes for Qwen3 MoE.
Module Contents#
Functions#
Qwen3 235B-A22B pretrain: 64× B200, BF16, PP=8 VP=4 EP=8. |
|
Qwen3 235B-A22B pretrain: 64× B200, FP8 current-scaling, PP=8 EP=8. |
|
Qwen3 235B-A22B pretrain: 64× B200, MXFP8, PP=8 EP=8. |
|
Qwen3 30B-A3B pretrain: 8× B200, BF16, EP=8. |
|
Qwen3 30B-A3B pretrain: 8× B200, FP8 current-scaling, EP=8. |
|
Qwen3 30B-A3B pretrain: 8× B200, MXFP8, EP=8. |
|
Qwen3 235B-A22B pretrain: 256× B200, BF16, PP=8 VP=4 EP=8. |
|
Qwen3 235B-A22B pretrain: 256× B200, FP8 current-scaling, PP=8 EP=8. |
|
Qwen3 235B-A22B pretrain: 256× B200, MXFP8, PP=8 EP=8. |
|
|
Qwen3 235B A22B pretrain: 256× B200, FP8-MX, large-scale proxy (GBS=512). |
Qwen3 235B A22B pretrain: 64× B200, NVFP4 (same layout as FP8-CS). |
|
Qwen3 235B A22B pretrain: 256× B200, NVFP4 (same layout as FP8-CS). |
|
Qwen3 30B-A3B pretrain: 64× B200, BF16, legacy-scaled GBS. |
|
Qwen3 30B-A3B pretrain: 64× B200, FP8 current-scaling, legacy-scaled GBS. |
|
Qwen3 Next 80B-A3B pretrain: 64× B200, BF16 (same layout as B300 BF16). |
|
Qwen3 Next 80B-A3B pretrain: 64× B200, MXFP8, EP=64, deepep, MBS=1. |
API#
- bridge.perf_recipes.qwen.b200.qwen3_moe.qwen3_235b_a22b_pretrain_64gpu_b200_bf16_config() megatron.bridge.perf_recipes.qwen.common.ConfigContainer#
Qwen3 235B-A22B pretrain: 64× B200, BF16, PP=8 VP=4 EP=8.
- bridge.perf_recipes.qwen.b200.qwen3_moe.qwen3_235b_a22b_pretrain_64gpu_b200_fp8cs_config() megatron.bridge.perf_recipes.qwen.common.ConfigContainer#
Qwen3 235B-A22B pretrain: 64× B200, FP8 current-scaling, PP=8 EP=8.
- bridge.perf_recipes.qwen.b200.qwen3_moe.qwen3_235b_a22b_pretrain_64gpu_b200_fp8mx_config() megatron.bridge.perf_recipes.qwen.common.ConfigContainer#
Qwen3 235B-A22B pretrain: 64× B200, MXFP8, PP=8 EP=8.
- bridge.perf_recipes.qwen.b200.qwen3_moe.qwen3_30b_a3b_pretrain_8gpu_b200_bf16_config() megatron.bridge.perf_recipes.qwen.common.ConfigContainer#
Qwen3 30B-A3B pretrain: 8× B200, BF16, EP=8.
- bridge.perf_recipes.qwen.b200.qwen3_moe.qwen3_30b_a3b_pretrain_8gpu_b200_fp8cs_config() megatron.bridge.perf_recipes.qwen.common.ConfigContainer#
Qwen3 30B-A3B pretrain: 8× B200, FP8 current-scaling, EP=8.
- bridge.perf_recipes.qwen.b200.qwen3_moe.qwen3_30b_a3b_pretrain_8gpu_b200_fp8mx_config() megatron.bridge.perf_recipes.qwen.common.ConfigContainer#
Qwen3 30B-A3B pretrain: 8× B200, MXFP8, EP=8.
- bridge.perf_recipes.qwen.b200.qwen3_moe.qwen3_235b_a22b_pretrain_256gpu_b200_bf16_config() megatron.bridge.perf_recipes.qwen.common.ConfigContainer#
Qwen3 235B-A22B pretrain: 256× B200, BF16, PP=8 VP=4 EP=8.
- bridge.perf_recipes.qwen.b200.qwen3_moe.qwen3_235b_a22b_pretrain_256gpu_b200_fp8cs_config() megatron.bridge.perf_recipes.qwen.common.ConfigContainer#
Qwen3 235B-A22B pretrain: 256× B200, FP8 current-scaling, PP=8 EP=8.
- bridge.perf_recipes.qwen.b200.qwen3_moe.qwen3_235b_a22b_pretrain_256gpu_b200_fp8mx_config() megatron.bridge.perf_recipes.qwen.common.ConfigContainer#
Qwen3 235B-A22B pretrain: 256× B200, MXFP8, PP=8 EP=8.
- bridge.perf_recipes.qwen.b200.qwen3_moe.qwen3_235b_a22b_pretrain_256gpu_b200_fp8mx_large_scale_config() megatron.bridge.perf_recipes.qwen.common.ConfigContainer#
Qwen3 235B A22B pretrain: 256× B200, FP8-MX, large-scale proxy (GBS=512).
- bridge.perf_recipes.qwen.b200.qwen3_moe.qwen3_235b_a22b_pretrain_64gpu_b200_nvfp4_config() megatron.bridge.perf_recipes.qwen.common.ConfigContainer#
Qwen3 235B A22B pretrain: 64× B200, NVFP4 (same layout as FP8-CS).
- bridge.perf_recipes.qwen.b200.qwen3_moe.qwen3_235b_a22b_pretrain_256gpu_b200_nvfp4_config() megatron.bridge.perf_recipes.qwen.common.ConfigContainer#
Qwen3 235B A22B pretrain: 256× B200, NVFP4 (same layout as FP8-CS).
- bridge.perf_recipes.qwen.b200.qwen3_moe.qwen3_30b_a3b_pretrain_64gpu_b200_bf16_config() megatron.bridge.perf_recipes.qwen.common.ConfigContainer#
Qwen3 30B-A3B pretrain: 64× B200, BF16, legacy-scaled GBS.
- bridge.perf_recipes.qwen.b200.qwen3_moe.qwen3_30b_a3b_pretrain_64gpu_b200_fp8cs_config() megatron.bridge.perf_recipes.qwen.common.ConfigContainer#
Qwen3 30B-A3B pretrain: 64× B200, FP8 current-scaling, legacy-scaled GBS.
- bridge.perf_recipes.qwen.b200.qwen3_moe.qwen3_next_80b_a3b_pretrain_64gpu_b200_bf16_config() megatron.bridge.perf_recipes.qwen.common.ConfigContainer#
Qwen3 Next 80B-A3B pretrain: 64× B200, BF16 (same layout as B300 BF16).
- bridge.perf_recipes.qwen.b200.qwen3_moe.qwen3_next_80b_a3b_pretrain_64gpu_b200_fp8mx_config() megatron.bridge.perf_recipes.qwen.common.ConfigContainer#
Qwen3 Next 80B-A3B pretrain: 64× B200, MXFP8, EP=64, deepep, MBS=1.