For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
DocumentationAPI Reference
DocumentationAPI Reference
  • Home
    • Welcome
  • About NeMo Curator
    • Overview
    • Key Features
  • Get Started
    • Overview
    • Install (All Modalities)
    • Text Quickstart
    • Image Quickstart
    • Video Quickstart
    • Audio Quickstart
  • Curate Text
    • Overview
    • Tutorials
    • Save and Export
  • Curate Images
    • Overview
      • Overview
        • Overview
        • CLIP Embedder
    • Save and Export
  • Curate Video
    • Overview
    • Load Data
    • Save and Export
  • Curate Audio
    • Overview
    • Save and Export
  • Setup & Deployment
    • Overview
  • Reference
    • Overview
    • Related Tools
NVIDIANVIDIA
Developer-friendly docs for your API
Privacy Policy | Your Privacy Choices | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2026, NVIDIA Corporation.

LogoLogoNeMo Curator
On this page
  • How It Works
  • Available Embedding Tools
Curate ImagesProcess DataEmbeddings

Image Embedding

||View as Markdown|

Generate image embeddings for large-scale datasets using NeMo Curator’s built-in embedders. Image embeddings enable downstream tasks such as classification, filtering, duplicate removal, and similarity search.

How It Works

Image embedding in NeMo Curator typically follows these steps:

  1. Load your dataset using FilePartitioningStage and ImageReaderStage
  2. Configure the ImageEmbeddingStage with CLIP model settings
  3. Apply the embedding stage to generate CLIP embeddings for each image
  4. Continue with downstream processing stages (filtering, classification, etc.)

The embedding stage integrates seamlessly into NeMo Curator’s pipeline architecture.


Available Embedding Tools

ImageEmbeddingStage

Generate CLIP embeddings using OpenAI’s ViT-L/14 model for high-quality image representations.


Previous

Overview

Next

CLIP Embedder