bridge.recipes.ministral3.ministral3#
Ministral3 finetuning recipes with parameterless API.
This module provides SFT and PEFT configurations for Ministral3 models (3B, 8B, 14B).
Module Contents#
Functions#
Return a full SFT config for Ministral3 3B. |
|
Return a full SFT config for Ministral3 8B. |
|
Return a full SFT config for Ministral3 14B. |
|
Return a PEFT config for Ministral3 3B. |
|
Return a PEFT config for Ministral3 8B. |
|
Return a PEFT config for Ministral3 14B. |
API#
- bridge.recipes.ministral3.ministral3.ministral3_3b_sft_config() megatron.bridge.training.config.ConfigContainer#
Return a full SFT config for Ministral3 3B.
Default configuration: 1 node, 8 GPUs
TP=1, PP=1
LR=5e-6 (full SFT)
Sequence length: 4096
- bridge.recipes.ministral3.ministral3.ministral3_8b_sft_config() megatron.bridge.training.config.ConfigContainer#
Return a full SFT config for Ministral3 8B.
Default configuration: 1 node, 8 GPUs
TP=2, PP=1
LR=5e-6 (full SFT)
Sequence length: 4096
- bridge.recipes.ministral3.ministral3.ministral3_14b_sft_config() megatron.bridge.training.config.ConfigContainer#
Return a full SFT config for Ministral3 14B.
Default configuration: 1 node, 8 GPUs
TP=4, PP=1
LR=5e-6 (full SFT)
Sequence length: 4096
- bridge.recipes.ministral3.ministral3.ministral3_3b_peft_config(
- peft_scheme: str | megatron.bridge.peft.base.PEFT = 'lora',
Return a PEFT config for Ministral3 3B.
Default configuration: 1 node, 8 GPUs
TP=1, PP=1
LR=1e-4 (PEFT)
Sequence length: 4096
- Parameters:
peft_scheme – PEFT scheme - “lora”, “dora”, or a custom PEFT instance.
- bridge.recipes.ministral3.ministral3.ministral3_8b_peft_config(
- peft_scheme: str | megatron.bridge.peft.base.PEFT = 'lora',
Return a PEFT config for Ministral3 8B.
Default configuration: 1 node, 8 GPUs
TP=1, PP=1
LR=1e-4 (PEFT)
Sequence length: 4096
- Parameters:
peft_scheme – PEFT scheme - “lora”, “dora”, or a custom PEFT instance.
- bridge.recipes.ministral3.ministral3.ministral3_14b_peft_config(
- peft_scheme: str | megatron.bridge.peft.base.PEFT = 'lora',
Return a PEFT config for Ministral3 14B.
Default configuration: 1 node, 8 GPUs
TP=2, PP=1
LR=1e-4 (PEFT)
Sequence length: 4096
- Parameters:
peft_scheme – PEFT scheme - “lora”, “dora”, or a custom PEFT instance.