nemo_curator.models.nemotron_h_vl
nemo_curator.models.nemotron_h_vl
nemo_curator.models.nemotron_h_vl
Bases: ModelInterface
NemotronH hybrid Mamba-Attention VLM for video captioning.
Supports multiple checkpoint variants from HuggingFace:
Models are automatically downloaded from HuggingFace on first use.
Return HuggingFace model ID for the selected variant.
Create a refined prompt for stage 2 captioning.
Download NemotronH VL weights from HuggingFace.
Models are automatically downloaded from HuggingFace Hub on first use. Supports multiple quantization variants for different performance/memory tradeoffs.
Parameters:
Base directory for model weights. The model will be downloaded to a subdirectory named after the HuggingFace model ID.
Model variant to download. Options: