For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
DocumentationAPI Reference
DocumentationAPI Reference
  • Home
    • Welcome
  • About NeMo Curator
    • Overview
    • Key Features
  • Get Started
    • Overview
    • Install (All Modalities)
    • Text Quickstart
    • Image Quickstart
    • Video Quickstart
    • Audio Quickstart
  • Curate Text
    • Overview
    • Tutorials
    • Save and Export
  • Curate Images
    • Overview
    • Save and Export
  • Curate Video
    • Overview
      • Overview
      • Beginner Tutorial
      • Split and Dedup
        • Overview
        • Add Custom Environment
        • Add Custom Code
        • Add Custom Model
        • Add Custom Stage
    • Load Data
    • Save and Export
  • Curate Audio
    • Overview
    • Save and Export
  • Setup & Deployment
    • Overview
  • Reference
    • Overview
    • Related Tools
NVIDIANVIDIA
Developer-friendly docs for your API
Privacy Policy | Your Privacy Choices | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2026, NVIDIA Corporation.

LogoLogoNeMo Curator
On this page
  • Before You Start
  • How to Add Dependencies with a Dockerfile
  • Define Build Steps
  • Build the Container
  • Next Steps
Curate VideoTutorialsPipeline Customization

Add Custom Environment

||View as Markdown|
Previous

Overview

Next

Add Custom Code

Learn how to package dependencies for NeMo Curator using a container image.

The NeMo Curator container provides a primary curator conda environment with pre-installed dependencies. If your pipeline needs additional system or Python packages, create a custom image. Refer to the container environments reference for defaults and build arguments.

Before You Start

Before you begin, make sure that you have:

  • Reviewed the pipeline concepts and diagrams.
  • A base container image suitable for NeMo Curator.
  • Optionally created custom code that defines your new requirements.

How to Add Dependencies with a Dockerfile

Define Build Steps

  1. Create an environments directory anywhere on your system to organize your custom pipeline stage environments.

  2. Create a new folder for your environment, for example: my-env/.

  3. Create a Dockerfile that installs your environment’s dependencies on top of the base image.

    1FROM <your-base-image>
    2
    3# System deps
    4RUN apt-get update && apt-get install -y --no-install-recommends \
    5 wget git && rm -rf /var/lib/apt/lists/*
    6
    7# Python deps (prefer aligning with pyproject optional extras)
    8# Example: install video stacks used by NeMo Curator
    9RUN pip install --no-cache-dir \
    10 av==13.1.0 opencv-python einops easydict
    11
    12# Optional: CV-CUDA (choose wheel that matches your CUDA/Python)
    13# RUN pip install cvcuda_cu12
    14
    15# Optional: vLLM / FlashAttention (see platform markers in pyproject)
    16# RUN pip install vllm==0.9.2 flash-attn&lt;=2.8.3
    17
    18# Copy your code if needed
    19COPY . /workspace
    20WORKDIR /workspace
  4. Save the file.

Build the Container

Build and tag your image using Docker or your preferred tool:

$docker build -t my-ray-curator:latest .

Next Steps

Now that you have created a custom environment, you can create custom code for that environment.