For AI agents: a documentation index is available at the root level at /llms.txt and /llms-full.txt. Append /llms.txt to any URL for a page-level index, or .md for the markdown version of any page.
  • Getting Started
    • Welcome
    • Contributing
  • Concepts
    • Columns
    • Seed Datasets
    • Agent Rollout Ingestion
    • Custom Columns
    • Validators
    • Processors
    • Person Sampling
    • Traces
    • Architecture & Performance
    • Deployment Options
    • Security
  • Tutorials
    • Overview
    • The Basics
    • Structured Outputs, Jinja Expressions, and Conditional Generation
    • Seeding with an External Dataset
    • Providing Images as Context
    • Generating Images
    • Image-to-Image Editing
  • Recipes
    • Recipe Cards
  • Plugins
    • Overview
    • Example Plugin
    • FileSystemSeedReader Plugins
    • Discover
  • Code Reference
    • Overview
      • Overview
      • models
      • mcp
      • column_configs
      • config_builder
      • data_designer_config
      • run_config
      • sampler_params
      • validator_params
      • seeds
      • processors
      • analysis
      • Config API
        • Analysis
        • Base
        • Column Configs
        • Column Types
        • Config Builder
        • Custom Column
        • Data Designer Config
        • Dataset Metadata
        • Default Model Settings
        • Errors
        • Exportable Config
        • Fingerprint
        • Interface
        • Mcp
        • Models
        • Preview Results
        • Processor Types
        • Processors
        • Run Config
        • Sampler Constraints
        • Sampler Params
        • Seed
        • Seed Source
        • Seed Source Dataframe
        • Seed Source Types
        • Testing
        • Utils
        • Validator Params
        • Version
  • Dev Notes
    • Overview
    • Have It Your Way
    • VLM Long Document Understanding
    • Push Datasets to Hugging Face Hub
    • Text-to-SQL for Nemotron Super
    • Async All the Way Down
    • Owning the Model Stack
NVIDIANVIDIA
Developer-friendly docs for your API
Privacy Policy | Your Privacy Choices | Terms of Service | Accessibility | Corporate Policies | Product Security | Contact

Copyright © 2026, NVIDIA Corporation.

LogoLogoNeMo Data Designer
On this page
  • Module Contents
  • Classes
  • Data
  • API
Code ReferenceConfigConfig API

data_designer.config.validator_params

||View as Markdown|
Previous

Warning Helpers

Next

Version

Module Contents

Classes

NameDescription
ValidatorTypestr(object=”) -> str str(bytes_or_buffer[, encoding[, errors]]) -> str
CodeValidatorParamsConfiguration for code validation. Supports Python and SQL code validation.
LocalCallableValidatorParamsConfiguration for local callable validation. Expects a function to be passed that validates the data.
RemoteValidatorParamsConfiguration for remote validation. Sends data to a remote endpoint for validation.

Data

SUPPORTED_CODE_LANGUAGES ValidatorParamsT

API

1SUPPORTED_CODE_LANGUAGES
1class data_designer.config.validator_params.ValidatorType

Bases: str, enum.Enum

1CODE = code
1LOCAL_CALLABLE = local_callable
1REMOTE = remote
1class data_designer.config.validator_params.CodeValidatorParams(
2 /,
3 **data: typing.Any
4)

Bases: data_designer.config.base.ConfigBase

Configuration for code validation. Supports Python and SQL code validation.

Parameters:

code_lang

The language of the code to validate. Supported values include: python, sql:sqlite, sql:postgres, sql:mysql, sql:tsql, sql:bigquery, sql:ansi.

Attributes:

code_lang
`required`

The language of the code to validate. Supported values include: python, sql:sqlite, sql:postgres, sql:mysql, sql:tsql, sql:bigquery, sql:ansi.

Initialization:

Create a new model by parsing and validating input data from keyword arguments.

Raises [ValidationError][pydantic_core.ValidationError] if the input data cannot be validated to form a valid model.

self is explicitly positional-only to allow self as a field name.

1validator_type: typing.Literal[data_designer.config.validator_params.ValidatorType] = Field(...)
1code_lang: data_designer.config.utils.code_lang.CodeLang = Field(...)
1validate_code_lang() -> typing_extensions.Self
1class data_designer.config.validator_params.LocalCallableValidatorParams(
2 /,
3 **data: typing.Any
4)

Bases: data_designer.config.base.ConfigBase

Configuration for local callable validation. Expects a function to be passed that validates the data.

Parameters:

validation_function

Function (Callable[[pd.DataFrame], pd.DataFrame]) to validate the data. Output must contain a column is_valid of type bool.

output_schema

The JSON schema for the local callable validator’s output. If not provided, the output will not be validated.

Attributes:

validation_function
`required`

Function (Callable[[pd.DataFrame], pd.DataFrame]) to validate the data. Output must contain a column is_valid of type bool.

output_schema

The JSON schema for the local callable validator’s output. If not provided, the output will not be validated.

Initialization:

Create a new model by parsing and validating input data from keyword arguments.

Raises [ValidationError][pydantic_core.ValidationError] if the input data cannot be validated to form a valid model.

self is explicitly positional-only to allow self as a field name.

1validator_type: typing.Literal[data_designer.config.validator_params.ValidatorType] = Field(...)
1validation_function: typing.Any = Field(...)
1output_schema: dict[str, typing.Any] | None = Field(...)
1serialize_validation_function(v: typing.Any) -> typing.Any
1validate_validation_function() -> typing_extensions.Self
1class data_designer.config.validator_params.RemoteValidatorParams(
2 /,
3 **data: typing.Any
4)

Bases: data_designer.config.base.ConfigBase

Configuration for remote validation. Sends data to a remote endpoint for validation.

Parameters:

endpoint_url

The URL of the remote endpoint.

output_schema

The JSON schema for the remote validator’s output. If not provided, the output will not be validated.

timeout

The timeout for the HTTP request in seconds. Defaults to 30.0.

max_retries

The maximum number of retry attempts. Defaults to 3.

retry_backoff

The backoff factor for the retry delay in seconds. Defaults to 2.0.

max_parallel_requests

The maximum number of parallel requests to make. Defaults to 4.

Attributes:

endpoint_url
`required`

The URL of the remote endpoint.

output_schema

The JSON schema for the remote validator’s output. If not provided, the output will not be validated.

timeout

The timeout for the HTTP request in seconds. Defaults to 30.0.

max_retries

The maximum number of retry attempts. Defaults to 3.

retry_backoff

The backoff factor for the retry delay in seconds. Defaults to 2.0.

max_parallel_requests

The maximum number of parallel requests to make. Defaults to 4.

Initialization:

Create a new model by parsing and validating input data from keyword arguments.

Raises [ValidationError][pydantic_core.ValidationError] if the input data cannot be validated to form a valid model.

self is explicitly positional-only to allow self as a field name.

1validator_type: typing.Literal[data_designer.config.validator_params.ValidatorType] = Field(...)
1endpoint_url: str = Field(...)
1output_schema: dict[str, typing.Any] | None = Field(...)
1timeout: float = Field(...)
1max_retries: int = Field(...)
1retry_backoff: float = Field(...)
1max_parallel_requests: int = Field(...)
ValidatorParamsT
typing_extensions.TypeAlias