bridge.perf_recipes.deepseek.gb300.deepseek_v3#
GB300 performance recipes for DeepSeek V3.
Module Contents#
Functions#
DeepSeek V3 pretrain: 256× GB300, BF16. |
|
DeepSeek V3 pretrain: 256× GB300, FP8 current-scaling. |
|
DeepSeek V3 pretrain: 256× GB300, MXFP8. |
|
DeepSeek V3 pretrain: 256× GB300, NVFP4. |
|
DeepSeek V3 pretrain: 64× GB300, BF16, Megatron FSDP. |
|
DeepSeek V3 pretrain: 64× GB300, MXFP8, Megatron FSDP. |
|
DeepSeek V3 pretrain: 256× GB300, MXFP8, large-scale proxy (BF16_V1 layout, GBS=256). |
API#
- bridge.perf_recipes.deepseek.gb300.deepseek_v3.deepseek_v3_pretrain_256gpu_gb300_bf16_config() megatron.bridge.perf_recipes.deepseek.common.ConfigContainer#
DeepSeek V3 pretrain: 256× GB300, BF16.
- bridge.perf_recipes.deepseek.gb300.deepseek_v3.deepseek_v3_pretrain_256gpu_gb300_fp8cs_config() megatron.bridge.perf_recipes.deepseek.common.ConfigContainer#
DeepSeek V3 pretrain: 256× GB300, FP8 current-scaling.
- bridge.perf_recipes.deepseek.gb300.deepseek_v3.deepseek_v3_pretrain_256gpu_gb300_fp8mx_config() megatron.bridge.perf_recipes.deepseek.common.ConfigContainer#
DeepSeek V3 pretrain: 256× GB300, MXFP8.
- bridge.perf_recipes.deepseek.gb300.deepseek_v3.deepseek_v3_pretrain_256gpu_gb300_nvfp4_config() megatron.bridge.perf_recipes.deepseek.common.ConfigContainer#
DeepSeek V3 pretrain: 256× GB300, NVFP4.
- bridge.perf_recipes.deepseek.gb300.deepseek_v3.deepseek_v3_pretrain_64gpu_gb300_bf16_fsdp_config() megatron.bridge.perf_recipes.deepseek.common.ConfigContainer#
DeepSeek V3 pretrain: 64× GB300, BF16, Megatron FSDP.
- bridge.perf_recipes.deepseek.gb300.deepseek_v3.deepseek_v3_pretrain_64gpu_gb300_fp8mx_fsdp_config() megatron.bridge.perf_recipes.deepseek.common.ConfigContainer#
DeepSeek V3 pretrain: 64× GB300, MXFP8, Megatron FSDP.
- bridge.perf_recipes.deepseek.gb300.deepseek_v3.deepseek_v3_pretrain_256gpu_gb300_fp8mx_large_scale_config() megatron.bridge.perf_recipes.deepseek.common.ConfigContainer#
DeepSeek V3 pretrain: 256× GB300, MXFP8, large-scale proxy (BF16_V1 layout, GBS=256).