bridge.perf_recipes.llama.r100.llama3#
R100 performance recipes for Llama 3.
Module Contents#
Functions#
Llama3 8B pretrain: 8× R100, BF16. |
|
Llama3 8B pretrain: 8× R100, FP8 current-scaling. |
|
Llama3 8B pretrain: 8× R100, MXFP8. |
|
Llama3 8B pretrain: 8× R100, NVFP4. |
API#
- bridge.perf_recipes.llama.r100.llama3.llama3_8b_pretrain_8gpu_r100_bf16_config() megatron.bridge.perf_recipes.llama.common.ConfigContainer#
Llama3 8B pretrain: 8× R100, BF16.
- bridge.perf_recipes.llama.r100.llama3.llama3_8b_pretrain_8gpu_r100_fp8cs_config() megatron.bridge.perf_recipes.llama.common.ConfigContainer#
Llama3 8B pretrain: 8× R100, FP8 current-scaling.
- bridge.perf_recipes.llama.r100.llama3.llama3_8b_pretrain_8gpu_r100_fp8mx_config() megatron.bridge.perf_recipes.llama.common.ConfigContainer#
Llama3 8B pretrain: 8× R100, MXFP8.
- bridge.perf_recipes.llama.r100.llama3.llama3_8b_pretrain_8gpu_r100_nvfp4_config() megatron.bridge.perf_recipes.llama.common.ConfigContainer#
Llama3 8B pretrain: 8× R100, NVFP4.