Image Embedding#
Generate image embeddings for large-scale datasets using NeMo Curator’s built-in and custom embedders. Image embeddings enable downstream tasks such as classification, filtering, duplicate removal, and similarity search.
How It Works#
Image embedding in NeMo Curator typically follows these steps:
Load your dataset using
ImageTextPairDataset
Select and configure an embedder (for example,
TimmImageEmbedder
)Apply the embedder to generate embeddings for each image
Save the resulting dataset with embeddings for downstream use
You can use built-in embedders or implement your own for advanced use cases.
Available Embedding Tools#
Use state-of-the-art models from the PyTorch Image Models (timm) library for embedding generation
Implement your own image embedding logic by subclassing the base class