Curate VideoProcess Data

Process Data

View as Markdown

Use NeMo Curator stages to split videos into clips, encode them, generate embeddings or captions, and remove duplicates.

How it Works

Create a Pipeline and add stages for clip extraction, optional re-encoding and filtering, embeddings or captions, previews, and writing outputs. Each stage is modular and configurable to match your quality and performance needs.

Processing Options

Choose from the following stages to split, encode, filter, embed, caption, preview, and remove duplicates in your videos:

Write Outputs

Persist clips, embeddings, previews, and metadata at the end of the pipeline using ClipWriterStage. Refer to Save & Export for directory layout and examples.

Example (place as the final stage):

1from nemo_curator.stages.video.io.clip_writer import ClipWriterStage
2
3pipeline.add_stage(
4 ClipWriterStage(
5 output_path=OUT_DIR,
6 input_path=VIDEO_DIR,
7 upload_clips=True,
8 dry_run=False,
9 generate_embeddings=True,
10 generate_previews=False,
11 generate_captions=False,
12 embedding_algorithm="cosmos-embed1-224p",
13 caption_models=[],
14 enhanced_caption_models=[],
15 verbose=True,
16 )
17)

Path helpers are available to resolve common locations (such as clips/, filtered_clips/, previews/, metas/v0/, and ce1_embd_parquet/).