nemo_microservices.types.beta.safe_synthesizer.data_parameters#
Module Contents#
Classes#
API#
- class nemo_microservices.types.beta.safe_synthesizer.data_parameters.DataParameters(/, **data: Any)#
Bases:
nemo_microservices._models.BaseModel- group_training_examples_by: str | None#
None
Column(s) to group training examples by.
This is useful when you want the model to learn inter-record correlations for a given grouping of records.
- holdout: float | None#
None
Amount of records to holdout.
If this is a float between 0 and 1, that ratio of records is held out. If an integer greater than 1, that number of records is held out.If the value is equal to zero, no holdout will be performed.
- max_holdout: int | None#
None
Maximum number of records to hold out.
Overrides any behavior set by holdout parameter.
- max_sequences_per_example: typing_extensions.Literal[auto] | int | None#
None
If specified, adds at most this number of sequences per example; otherwise, fills up context. Supports ‘auto’ where a value of 1 is chosen if differential privacy is enabled, and None otherwise. Required for DP to limit contribution of each example.
- order_training_examples_by: str | None#
None
Column to order training examples by.
This is useful when you want the model to learn sequential relationships for a given ordering of records. If you provide this parameter, you must also provide
group_training_examples_by.
- random_state: int | None#
None
Use this random state for holdout split to ensure reproducibility.