nemo_curator.models.nemotron_h_vl
nemo_curator.models.nemotron_h_vl
Module Contents
Classes
Data
API
Bases: ModelInterface
NemotronH hybrid Mamba-Attention VLM for video captioning.
Supports multiple checkpoint variants from HuggingFace:
- nemotron / nemotron-bf16: BF16 precision (default)
- nemotron-fp8: FP8 quantized
- nemotron-nvfp4: NVFP4 quantized
Models are automatically downloaded from HuggingFace on first use.
_hf_model_id
_normalized_variant
model_id_names
Return HuggingFace model ID for the selected variant.
stage2_prompt
weight_file
Create a refined prompt for stage 2 captioning.
classmethod
Download NemotronH VL weights from HuggingFace.
Models are automatically downloaded from HuggingFace Hub on first use. Supports multiple quantization variants for different performance/memory tradeoffs.
Parameters:
model_dir
Base directory for model weights. The model will be downloaded to a subdirectory named after the HuggingFace model ID.
variant
Model variant to download. Options:
- “nemotron” or “nemotron-bf16”: BF16 precision (default)
- “nemotron-fp8”: FP8 quantized
- “nemotron-nvfp4”: NVFP4 quantized