DocumentationAPI Reference
DocumentationAPI Reference
  • API Reference
    • Overview
    • Pipeline
    • ProcessingStage
    • CompositeStage
    • Resources
On this page
  • Core Classes
  • Task Types
  • Executors
  • Configuration
  • Source Code
API Reference

API Reference

||View as Markdown|

This section provides API reference documentation for NeMo Curator’s core classes and interfaces.

Core Classes

Pipeline

The main orchestrator for executing sequences of processing stages.

ProcessingStage

Base class for all data processing stages in NeMo Curator.

CompositeStage

High-level stages that decompose into multiple execution stages.

Task Types

DocumentBatch

Task type for text document processing.

ImageBatch

Task type for image processing.

VideoTask

Task type for video processing.

AudioBatch

Task type for audio processing.

Executors

XennaExecutor

Production executor using Cosmos-Xenna for distributed execution.

Experimental Executors

Ray-based experimental executors.

Configuration

Resources

CPU and GPU resource configuration for stages.

Source Code

For complete implementation details, see the NeMo Curator source code on GitHub.

Next

nemo_curator

NVIDIANVIDIA
Developer-friendly docs for your API
Privacy Policy | Manage My Privacy | Do Not Sell or Share My Data | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2026, NVIDIA Corporation.

LogoLogoNeMo Curator