bridge.perf_recipes.gpt_oss.common#
Common helpers for gpt_oss performance recipes.
Module Contents#
Functions#
Apply legacy GPT-OSS 120B FP8-MX full-iteration CUDA graph settings. |
|
Return legacy GPT-OSS 20B MXFP8 perf precision settings. |
|
Return legacy GPT-OSS 20B NVFP4 perf precision settings. |
|
Apply legacy GPT-OSS 20B perf defaults. |
|
Apply GPT-OSS 20B Transformer Engine graph capture defaults. |
|
Apply GPT-OSS 20B local full-iteration graph capture defaults. |
API#
- bridge.perf_recipes.gpt_oss.common._apply_gpt_oss_120b_full_iter_fp8mx_configs(
- cfg: megatron.bridge.training.config.ConfigContainer,
Apply legacy GPT-OSS 120B FP8-MX full-iteration CUDA graph settings.
- bridge.perf_recipes.gpt_oss.common._gpt_oss_20b_fp8mx_precision()#
Return legacy GPT-OSS 20B MXFP8 perf precision settings.
- bridge.perf_recipes.gpt_oss.common._gpt_oss_20b_nvfp4_precision()#
Return legacy GPT-OSS 20B NVFP4 perf precision settings.
- bridge.perf_recipes.gpt_oss.common._apply_gpt_oss_20b_common_configs(
- cfg: megatron.bridge.training.config.ConfigContainer,
Apply legacy GPT-OSS 20B perf defaults.
- bridge.perf_recipes.gpt_oss.common._apply_gpt_oss_20b_transformer_engine_graph_configs(
- cfg: megatron.bridge.training.config.ConfigContainer,
Apply GPT-OSS 20B Transformer Engine graph capture defaults.
- bridge.perf_recipes.gpt_oss.common._apply_gpt_oss_20b_local_graph_configs(
- cfg: megatron.bridge.training.config.ConfigContainer,
Apply GPT-OSS 20B local full-iteration graph capture defaults.