About NeMo CuratorConceptsText Concepts

Text Curation Concepts

View as Markdown

This document covers the essential concepts for text data curation in NVIDIA NeMo Curator. These concepts assume basic familiarity with data science and machine learning principles.

Core Concept Areas

Text curation in NeMo Curator focuses on these key areas:

Infrastructure Components

The text curation concepts build on NVIDIA NeMo Curator’s core infrastructure components, which are shared across all modalities. These components include: