bridge.models.nemotron_omni.nemotron_omni_bridge#
Nemotron Omni bridge.
Standalone bridge for the Nemotron-3 Omni family (HF architecture
NemotronH_Nano_Omni_Reasoning_V3). Inherits the language / vision /
mamba parameter mappings from :class:NemotronVLBridge and adds:
Omni-specific
CONFIG_MAPPINGentries (Mamba shape fields used by the hybrid LLM and the MoE shared-expert intermediate size).An overridden :meth:
provider_bridgethat produces a- class:
NemotronOmniModelProvider(MoE language model + RADIO ViT vision
optional Parakeet sound encoder) instead of the dense VL provider.
A :meth:
mapping_registryoverride that adds the temporalvideo_embedderparameter and the sound projection / sound encoder parameters (the latter via a single**wildcard, since the Megatron sound encoder is HF transformers’ParakeetEncoderand the parameter names line up 1:1 withsound_encoder.encoder.*).ADDITIONAL_FILE_PATTERNScovering the bespoke Omni HF modeling / processing / audio files that need to be copied during HF export.
Module Contents#
Classes#
Bridge for Nemotron-3 Omni (MoE LLM + vision + optional sound) models. |
API#
- class bridge.models.nemotron_omni.nemotron_omni_bridge.NemotronOmniBridge#
Bases:
megatron.bridge.models.nemotron_vl.nemotron_vl_bridge.NemotronVLBridgeBridge for Nemotron-3 Omni (MoE LLM + vision + optional sound) models.
- CONFIG_MAPPING#
None
- ADDITIONAL_FILE_PATTERNS#
[‘modeling*.py’, ‘configuration*.py’, ‘processing*.py’, ‘processing_utils.py’, ‘image_processing*.py…
- provider_bridge(
- hf_pretrained: megatron.bridge.models.hf_pretrained.vlm.PreTrainedVLM,
Create a NemotronOmniModelProvider from the HF Omni config.
Always returns an Omni provider (MoE language model + RADIO ViT vision + optional Parakeet sound encoder). When
sound_configis absent on the HF config,has_sound=Falseand the sound branch is skipped at construction time.
- mapping_registry() megatron.bridge.models.conversion.mapping_registry.MegatronMappingRegistry#
Inherit VL mappings and add temporal video, sound projection, and sound encoder mappings.