> For clean Markdown of any page, append .md to the page URL.
> For a complete documentation index, see https://docs.nvidia.com/nemo/curator/llms.txt.
> For full documentation content, see https://docs.nvidia.com/nemo/curator/llms-full.txt.
> For AI client integration (Claude Code, Cursor, etc.), connect to the MCP server at https://docs.nvidia.com/nemo/curator/_mcp/server.

> Hands-on tutorials for text curation workflows including quality assessment with NeMo Curator

# Text Curation Tutorials

Hands-on tutorials for text curation workflows are available in the [`tutorials/text` directory](https://github.com/NVIDIA-NeMo/Curator/tree/main/tutorials/text) of the NeMo Curator GitHub repository.

## Key Concepts for Tutorial Success

Before diving into the tutorials, familiarize yourself with these essential NeMo Curator concepts:

Core processing stages and pipeline concepts for text curation workflows
data-structures
distributed

Scoring and filtering techniques used in tutorials
heuristics
classifiers

Loading data from various sources
common-crawl
custom-data

GPU-accelerated classification concepts
gpu
scalable