bridge.recipes.gpt_oss.gpt_oss#

Module Contents#

Functions#

gpt_oss_20b_pretrain_config

Return a pre-training config for GPT-OSS 20B variant.

gpt_oss_120b_pretrain_config

Return a pre-training config for GPT-OSS 120B variant.

gpt_oss_20b_sft_config

Return a full SFT config for GPT-OSS 20B.

gpt_oss_120b_sft_config

Return a full SFT config for GPT-OSS 120B.

gpt_oss_20b_peft_config

Return a PEFT config for GPT-OSS 20B.

gpt_oss_120b_peft_config

Return a PEFT config for GPT-OSS 120B.

API#

bridge.recipes.gpt_oss.gpt_oss.gpt_oss_20b_pretrain_config() megatron.bridge.training.config.ConfigContainer#

Return a pre-training config for GPT-OSS 20B variant.

Recommended parallelism: TP=2, PP=4, EP=4

bridge.recipes.gpt_oss.gpt_oss.gpt_oss_120b_pretrain_config() megatron.bridge.training.config.ConfigContainer#

Return a pre-training config for GPT-OSS 120B variant.

Recommended parallelism: TP=2, PP=4, EP=16

bridge.recipes.gpt_oss.gpt_oss.gpt_oss_20b_sft_config() megatron.bridge.training.config.ConfigContainer#

Return a full SFT config for GPT-OSS 20B.

Default parallelism: TP=1, PP=1, EP=8

Returns:

ConfigContainer with all settings pre-configured for GPT-OSS 20B SFT.

bridge.recipes.gpt_oss.gpt_oss.gpt_oss_120b_sft_config() megatron.bridge.training.config.ConfigContainer#

Return a full SFT config for GPT-OSS 120B.

Default parallelism: TP=1, PP=4, EP=8

Returns:

ConfigContainer with all settings pre-configured for GPT-OSS 120B SFT.

bridge.recipes.gpt_oss.gpt_oss.gpt_oss_20b_peft_config(
peft_scheme: str | megatron.bridge.peft.base.PEFT = 'lora',
) megatron.bridge.training.config.ConfigContainer#

Return a PEFT config for GPT-OSS 20B.

Default parallelism: TP=1, PP=1, EP=1

Parameters:

peft_scheme – PEFT scheme - “lora”, “dora”, or a custom PEFT instance.

Returns:

ConfigContainer with all settings pre-configured for GPT-OSS 20B PEFT.

bridge.recipes.gpt_oss.gpt_oss.gpt_oss_120b_peft_config(
peft_scheme: str | megatron.bridge.peft.base.PEFT = 'lora',
) megatron.bridge.training.config.ConfigContainer#

Return a PEFT config for GPT-OSS 120B.

Default parallelism: TP=1, PP=1, EP=8

Parameters:

peft_scheme – PEFT scheme - “lora”, “dora”, or a custom PEFT instance.

Returns:

ConfigContainer with all settings pre-configured for GPT-OSS 120B PEFT.