Nemotron-3-Nano-Omni
Nemotron-3-Nano-Omni
Nemotron-3-Nano-Omni-30B-A3B-Reasoning is NVIDIA’s omnimodal reasoning model. It pairs a NemotronH (hybrid Mamba-2 + Attention) MoE language backbone with a RADIO v2.5-H vision encoder and a Parakeet (FastConformer) sound encoder, supporting interleaved text, image, and audio inputs.
Available Models
- Nemotron-3-Nano-Omni-30B-A3B-Reasoning: 30B total, 3B activated (MoE)
Architecture
NemotronH_Nano_Omni_Reasoning_V3
Example Recipes
Try with NeMo AutoModel
1. Install (NeMo AutoModel):
2. Clone the repo to get the example recipes:
3. Run the recipe from inside the repo (8x H100 example):
For a full walkthrough — dataset preparation, SFT vs. LoRA configs, and post-training inference — see the Nemotron-Omni guide.