nemo_curator.stages.interleaved.filter.clip_score_filter
nemo_curator.stages.interleaved.filter.clip_score_filter
Module Contents
Classes
Functions
Data
API
Dataclass
Bases: BaseInterleavedFilterStage
Filter interleaved image rows by CLIP image-text relevance score.
For each image row, all text rows with the same sample_id form (image, text) pairs. CLIP similarity is computed for each pair. An image is kept only if at least one pair has score >= min_score; otherwise it is dropped.
min_score
model_dir
name
resources
Download the weights for the CLIP model on the node.
Decode image bytes per row; clear keep_mask entries where decode fails.
Return list of text_content from all text rows for the given sample_id (non-empty).