> For clean Markdown of any page, append .md to the page URL.
> For a complete documentation index, see https://docs.nvidia.com/nemo/curator/llms.txt.
> For full documentation content, see https://docs.nvidia.com/nemo/curator/llms-full.txt.

> Quickstart guides for getting started with NeMo Curator across text, image, and video modalities with minimal setup

# About Getting Started

## Before You Start

Welcome to NeMo Curator! This framework streamlines the curation and pre-processing of large-scale datasets for training generative AI models across text, image, audio and video modalities.

**Who are these quickstarts for?**

* AI/ML engineers and researchers who want to quickly test NeMo Curator's capabilities
* Users looking to run an initial curation pipeline with minimal setup
* Individuals exploring NeMo Curator prior to a full production deployment

**What you'll find here:**
Each quickstart enables you to get started with a specific domain in less than 30 minutes. Quickstarts provide basic installation steps, sample data, and a working example.

<Tip>
  For production deployments, cluster configurations, or detailed system requirements, refer to the [Setup & Deployment documentation](/admin).
</Tip>

***

## Install

Install NeMo Curator once with support for every modality, then jump into a quickstart below.

<Cards>
  <Card title="Install (All Modalities)" href="/get-started/installation">
    Full installation guide covering PyPI, source, and container methods, package extras for each modality, and post-install verification steps.
  </Card>
</Cards>

***

## Modality Quickstarts

The following quickstarts allow you to test NeMo Curator using a selected data modality.

<Cards>
  <Card title="Text Curation Quickstart" href="/get-started/text">
    Set up your environment and execute your first text curation pipeline with NeMo Curator. Instructions cover installation, data preparation, and use of the modular pipeline architecture for efficient large-scale text dataset curation.
  </Card>

  <Card title="Image Curation Quickstart" href="/get-started/image">
    Set up your environment and install the NeMo Curator image modules. The quickstart explains prerequisites, installation methods, and the use of the framework to curate large-scale image-text datasets for generative AI model training.
  </Card>

  <Card title="Video Curation Quickstart" href="/get-started/video">
    Set up your environment and execute your first video curation pipeline. The instructions include prerequisites, installation options, and guidance on splitting, encoding, embedding, and exporting curated video clips at scale.
  </Card>

  <Card title="Audio Curation Quickstart" href="/get-started/audio">
    Set up your environment and execute your first audio curation pipeline with NeMo Curator. Instructions cover installation, data preparation, and use of the modular pipeline architecture for efficient large-scale audio speech dataset curation.
  </Card>
</Cards>