SDG Reference#
Complete specifications for the SDG pipeline. For pipeline overview and when to use it, refer to About Synthetic Data Generation.
Config Schema
All YAML fields: top-level settings, seed dataset, model aliases, column types, and output projections.
CLI Reference
nemotron steps run sdg/data_designer flags and hydra override syntax.
Output Projections
The three projection shapes with annotated JSONL examples.
Troubleshooting
Failure modes for local runs and cluster dispatch.