bridge.recipes.nemotron_vl.nemotron_nano_v2_vl#

Nemotron Nano V2 VL finetuning recipes with parameterless API.

This module provides SFT and PEFT configurations for Nemotron Nano V2 VL 12B.

Module Contents#

Functions#

nemotron_nano_v2_vl_12b_sft_config

Return a full SFT config for Nemotron Nano V2 VL 12B.

nemotron_nano_v2_vl_12b_peft_config

Return a PEFT config for Nemotron Nano V2 VL 12B.

API#

bridge.recipes.nemotron_vl.nemotron_nano_v2_vl.nemotron_nano_v2_vl_12b_sft_config() megatron.bridge.training.config.ConfigContainer#

Return a full SFT config for Nemotron Nano V2 VL 12B.

Default configuration: 1 node, 8 GPUs

  • TP=4, PP=1

  • LR=1e-5 (finetune default)

  • Sequence length: 4096

bridge.recipes.nemotron_vl.nemotron_nano_v2_vl.nemotron_nano_v2_vl_12b_peft_config(
peft_scheme: str | megatron.bridge.peft.base.PEFT = 'lora',
) megatron.bridge.training.config.ConfigContainer#

Return a PEFT config for Nemotron Nano V2 VL 12B.

Default configuration: 1 node, 8 GPUs

  • TP=2, PP=1

  • LR=5e-5 (PEFT)

  • Sequence length: 4096

Parameters:

peft_scheme – PEFT scheme - “lora”, “dora”, or a custom PEFT instance. Note: Default uses VLMLoRA targeting all model components.