nemo_curator.stages.interleaved.filter.blur_filter

View as Markdown

Module Contents

Classes

NameDescription
InterleavedBlurFilterStageFilter interleaved image rows by sharpness (Laplacian variance); drop blurry images.

Functions

NameDescription
_sharpness_scoreCompute Laplacian variance as sharpness score; higher is sharper.

Data

DEFAULT_BLUR_SCORE_THRESHOLD

API

class nemo_curator.stages.interleaved.filter.blur_filter.InterleavedBlurFilterStage(
name: str = 'interleaved_blur_filter',
drop_invalid_rows: bool = True,
score_threshold: float = DEFAULT_BLUR_SCORE_THRESHOLD
)
Dataclass

Bases: BaseInterleavedFilterStage

Filter interleaved image rows by sharpness (Laplacian variance); drop blurry images.

name
str = 'interleaved_blur_filter'
score_threshold
float = DEFAULT_BLUR_SCORE_THRESHOLD
nemo_curator.stages.interleaved.filter.blur_filter.InterleavedBlurFilterStage.content_keep_mask(
task: nemo_curator.tasks.InterleavedBatch,
df: pandas.DataFrame
) -> pandas.Series
nemo_curator.stages.interleaved.filter.blur_filter._sharpness_score(
image: numpy.ndarray,
row_index: collections.abc.Hashable | None = None
) -> float

Compute Laplacian variance as sharpness score; higher is sharper.

nemo_curator.stages.interleaved.filter.blur_filter.DEFAULT_BLUR_SCORE_THRESHOLD: float = 100.0