For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
DocumentationAPI Reference
DocumentationAPI Reference
  • Home
    • Welcome
  • About NeMo Curator
    • Overview
    • Key Features
  • Get Started
    • Overview
    • Install (All Modalities)
    • Text Quickstart
    • Image Quickstart
    • Video Quickstart
    • Audio Quickstart
  • Curate Text
    • Overview
    • Tutorials
    • Save and Export
  • Curate Images
    • Overview
      • Overview
    • Save and Export
  • Curate Video
    • Overview
    • Load Data
    • Save and Export
  • Curate Audio
    • Overview
    • Save and Export
  • Setup & Deployment
    • Overview
  • Reference
    • Overview
    • Related Tools
NVIDIANVIDIA
Developer-friendly docs for your API
Privacy Policy | Your Privacy Choices | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2026, NVIDIA Corporation.

LogoLogoNeMo Curator
On this page
  • How it Works
  • Embedding Options
  • Filter Options
Curate ImagesProcess Data

Process Data for Image Curation

||View as Markdown|

Process image data you’ve loaded from tar archives using NeMo Curator’s suite of tools. These tools help you generate embeddings, filter images, and prepare your dataset to produce high-quality data for downstream AI tasks such as generative model training, dataset analysis, or quality control.

How it Works

Image processing in NeMo Curator follows a pipeline-based approach with these stages:

  1. Partition files using FilePartitioningStage to distribute tar files
  2. Read images using ImageReaderStage with DALI acceleration
  3. Generate embeddings using ImageEmbeddingStage with CLIP models
  4. Apply filters using ImageAestheticFilterStage and ImageNSFWFilterStage
  5. Save results using ImageWriterStage to export curated datasets

Each stage processes ImageBatch objects containing images, metadata, and processing results. You can use built-in stages or create custom stages for advanced use cases.


Embedding Options

CLIP Embedding Stage

Generate image embeddings using CLIP models with GPU acceleration. Supports various CLIP architectures and automatic model downloading. ImageEmbeddingStage CLIP GPU-accelerated

Filter Options

Aesthetic Filter Stage

Assess the subjective quality of images using a model trained on human aesthetic preferences. Filters images based on aesthetic score thresholds. ImageAestheticFilterStage aesthetic_score

NSFW Filter Stage

Detect not-safe-for-work (NSFW) content in images using a CLIP-based filter. Filters explicit material from your datasets. ImageNSFWFilterStage nsfw_score

Previous

TAR Archives

Next

Overview