bridge.recipes.gpt.gpt3_175b#
Module Contents#
Functions#
Return a pre-training config for GPT3-175B. |
API#
- bridge.recipes.gpt.gpt3_175b.gpt3_175b_pretrain_config() megatron.bridge.training.config.ConfigContainer#
Return a pre-training config for GPT3-175B.
The default configuration is expected to run on 64 nodes with 8 GPUs each. Default parallelism: TP=4, PP=8, VP=6, SP=True.