Generate Data#

Transform your data designs into synthetic datasets using NeMo Data Designer’s powerful generation engine. Follow the complete workflow from design validation to large-scale data production, or manage individual generation jobs for targeted data creation.

Note

Two Approaches Available You can interact with Data Designer using either:

  • Direct API: Full control over individual operations and job management

  • DataDesignerClient wrapper: Streamlined workflows with convenience methods like wait_until_done=True and automatic dataset loading

Both approaches are documented throughout this section.


Data Generation#

Create synthetic datasets using Data Designer’s comprehensive generation workflow and job management capabilities.

Data Generation Workflow

Follow the complete workflow from design phase through preview, iteration, and batch generation to create high-quality synthetic datasets.

Generating Data
Manage Data Generation Jobs

Create, monitor, and manage data generation jobs with direct API access for programmatic data creation.

Manage Data Generation Jobs