bridge.perf_recipes.deepseek.gb200.deepseek_v3#

GB200 performance recipes for DeepSeek V3.

Module Contents#

Functions#

deepseek_v3_pretrain_256gpu_gb200_bf16_config

DeepSeek V3 pretrain: 256× GB200, BF16.

deepseek_v3_pretrain_256gpu_gb200_fp8cs_config

DeepSeek V3 pretrain: 256× GB200, FP8 current-scaling.

deepseek_v3_pretrain_256gpu_gb200_fp8mx_config

DeepSeek V3 pretrain: 256× GB200, MXFP8.

deepseek_v3_pretrain_256gpu_gb200_nvfp4_config

DeepSeek V3 pretrain: 256× GB200, NVFP4 (same layout as BF16, mlp recompute).

deepseek_v3_pretrain_256gpu_gb200_fp8mx_large_scale_config

DeepSeek V3 pretrain: 256× GB200, MXFP8, large-scale proxy (GBS=256).

API#

bridge.perf_recipes.deepseek.gb200.deepseek_v3.deepseek_v3_pretrain_256gpu_gb200_bf16_config() megatron.bridge.perf_recipes.deepseek.common.ConfigContainer#

DeepSeek V3 pretrain: 256× GB200, BF16.

bridge.perf_recipes.deepseek.gb200.deepseek_v3.deepseek_v3_pretrain_256gpu_gb200_fp8cs_config() megatron.bridge.perf_recipes.deepseek.common.ConfigContainer#

DeepSeek V3 pretrain: 256× GB200, FP8 current-scaling.

bridge.perf_recipes.deepseek.gb200.deepseek_v3.deepseek_v3_pretrain_256gpu_gb200_fp8mx_config() megatron.bridge.perf_recipes.deepseek.common.ConfigContainer#

DeepSeek V3 pretrain: 256× GB200, MXFP8.

bridge.perf_recipes.deepseek.gb200.deepseek_v3.deepseek_v3_pretrain_256gpu_gb200_nvfp4_config() megatron.bridge.perf_recipes.deepseek.common.ConfigContainer#

DeepSeek V3 pretrain: 256× GB200, NVFP4 (same layout as BF16, mlp recompute).

bridge.perf_recipes.deepseek.gb200.deepseek_v3.deepseek_v3_pretrain_256gpu_gb200_fp8mx_large_scale_config() megatron.bridge.perf_recipes.deepseek.common.ConfigContainer#

DeepSeek V3 pretrain: 256× GB200, MXFP8, large-scale proxy (GBS=256).