Process Data for Image Curation#
Process image data you’ve loaded from tar archives using NeMo Curator’s suite of tools. These tools help you generate embeddings, filter images, and prepare your dataset to produce high-quality data for downstream AI tasks such as generative model training, dataset analysis, or quality control.
How it Works#
Image processing in NeMo Curator follows a pipeline-based approach with these stages:
Partition files using
FilePartitioningStageto distribute tar filesRead images using
ImageReaderStagewith DALI accelerationGenerate embeddings using
ImageEmbeddingStagewith CLIP modelsApply filters using
ImageAestheticFilterStageandImageNSFWFilterStageSave results using
ImageWriterStageto export curated datasets
Each stage processes ImageBatch objects containing images, metadata, and processing results. You can use built-in stages or create custom stages for advanced use cases.
Embedding Options#
Generate image embeddings using CLIP models with GPU acceleration. Supports various CLIP architectures and automatic model downloading.
Filter Options#
Assess the subjective quality of images using a model trained on human aesthetic preferences. Filters images based on aesthetic score thresholds.
Detect not-safe-for-work (NSFW) content in images using a CLIP-based filter. Filters explicit material from your datasets.