data_designer.config.data_designer_config
data_designer.config.data_designer_config
data_designer.config.data_designer_config
Bases: data_designer.config.exportable_config.ExportableConfigBase
Configuration for NeMo Data Designer.
This class defines the main configuration structure for NeMo Data Designer, which the engine consumes when generating synthetic data.
Parameters:
Required list of column configurations defining how each column should be generated. Must contain at least one column.
Optional list of model configurations for LLM-based generation. Each model config defines the model, provider, and inference parameters.
Optional list of tool configurations for MCP tool calling. Each tool config defines the provider, allowed tools, and execution limits.
Optional seed dataset settings to use for generation.
Optional list of column constraints.
Optional list of column profilers for analyzing generated data characteristics.
Optional list of processor configurations for post-generation transformations.
Attributes:
Required list of column configurations defining how each column should be generated. Must contain at least one column.
Optional list of model configurations for LLM-based generation. Each model config defines the model, provider, and inference parameters.
Optional list of tool configurations for MCP tool calling. Each tool config defines the provider, allowed tools, and execution limits.
Optional seed dataset settings to use for generation.
Optional list of column constraints.
Optional list of column profilers for analyzing generated data characteristics.
Optional list of processor configurations for post-generation transformations.
Initialization:
Create a new model by parsing and validating input data from keyword arguments.
Raises [ValidationError][pydantic_core.ValidationError] if the input data cannot be
validated to form a valid model.
self is explicitly positional-only to allow self as a field name.
Compute a deterministic content-addressable fingerprint of this config.
See data_designer.config.fingerprint.fingerprint_config for the full
list of identity-relevant and excluded fields, and how custom column
generators are identified.
Returns:
dict[str, str | int]
A dict with config_hash, config_hash_algo, and
config_hash_version.