Process Data for Image Curation
Process image data you’ve loaded from tar archives using NeMo Curator’s suite of tools. These tools help you generate embeddings, filter images, and prepare your dataset to produce high-quality data for downstream AI tasks such as generative model training, dataset analysis, or quality control.
How it Works
Image processing in NeMo Curator follows a pipeline-based approach with these stages:
- Partition files using
FilePartitioningStageto distribute tar files - Read images using
ImageReaderStagewith DALI acceleration - Generate embeddings using
ImageEmbeddingStagewith CLIP models - Apply filters using
ImageAestheticFilterStageandImageNSFWFilterStage - Save results using
ImageWriterStageto export curated datasets
Each stage processes ImageBatch objects containing images, metadata, and processing results. You can use built-in stages or create custom stages for advanced use cases.
Embedding Options
Filter Options
Assess the subjective quality of images using a model trained on human aesthetic preferences. Filters images based on aesthetic score thresholds. ImageAestheticFilterStage aesthetic_score
Detect not-safe-for-work (NSFW) content in images using a CLIP-based filter. Filters explicit material from your datasets. ImageNSFWFilterStage nsfw_score