Evaluation Target Schema Reference#

This page provides a complete schema reference for evaluation targets, including all fields, data types, and nested properties. Use this reference when you need detailed technical specifications for API requests or to understand the full structure of target objects.

When to use this page:

You need detailed field specifications for API integration
You want to understand all available configuration options
You are troubleshooting schema validation errors
You need to reference specific nested object structures

For task-oriented guidance, refer to Create Evaluation Target for step-by-step instructions on creating targets, or browse the target type guides for conceptual overviews and examples.

Important

Each target is uniquely identified by a combination of namespace and name. For example, my-organization/my-target.

Available Target Types#

Evaluation targets define what you’re evaluating. NeMo Evaluator supports the following target types:

Model – LLM model with inference endpoint
RAG – Retrieval-Augmented Generation pipeline
Retriever – Document retrieval pipeline
Rows – Inline evaluation data
Dataset – Reference to stored evaluation data

Primary Schemas#

These are the main schemas you’ll use when creating targets or reading target information from the API.

EvaluationTargetInput#

Use this schema when creating a new evaluation target through POST requests to the /evaluation/targets endpoint.

EvaluationTargetInput object

An entity representing the target of the evaluation.

Properties

name string

The name of the entity. Must be unique inside the namespace. If not specified, it will be the same as the automatically generated id.

Constraints: max length: 255, pattern: ^[\w\-\+.@:]*$

Default:

namespace string

The namespace of the entity. This can be missing for namespace entities or in deployments that don't use namespaces.

Default: default

description string

The description of the entity.

type * string

The type of the evaluation target, e.g., 'model', 'retriever', 'rag'.

Allowed values:

modelcached_outputsretrieverragrowsdataset

cached_outputs object

The model to be evaluated.

Properties

files_url * string

The files URL of the cached outputs.

Constraints: min length: 1

model string | object

The model to be evaluated.

Any of:

Option 1: string - A reference to ModelInputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

retriever object

Retriever to be evaluated.

Properties

pipeline object

Retriever pipeline run by Evaluator.

Properties

query_embedding_model * string | object

The query embedding model.

Any of:

Option 1: string - A reference to ModelInputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

index_embedding_model * string | object

The index embedding model.

Any of:

Option 1: string - A reference to ModelInputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

reranker_model string | object

The reranker model.

Any of:

Option 1: string - A reference to ModelInputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

top_k integer

The top k results to be used.

Default: 10

cached_outputs object

Cached pre-generated retriever outputs.

Properties

files_url * string

The files URL of the cached outputs.

Constraints: min length: 1

dataset object

Cached pre-generated retriever outputs dataset.

Properties

schema_version string

The version of the schema for the object. Internal use only.

Default: 1.0

id string

The ID of the entity. With the exception of namespaces, this is always a semantically-prefixed base58-encoded uuid4 [-base58(uuid4())].

description string

The description of the entity.

type_prefix string

The type prefix of the entity ID. If not specified, it will be inferred from the entity type name, but this will likely result in long prefixes.

namespace string

The namespace of the entity. This can be missing for namespace entities or in deployments that don't use namespaces.

Default: default

project string

The URN of the project associated with this entity.

created_at string

Timestamp for when the entity was created.

updated_at string

Timestamp for when the entity was last updated.

custom_fields object

A set of custom fields that the user can define and use for various purposes.

Allows additional properties: Yes

ownership object

Ownership information for the entity

Properties

created_by string

The ID of the user that created this entity.

Default:

updated_by string

The ID of the user that last updated this entity.

access_policies object

A general object for capturing access policies which can be used by an external service to determine ACLs

Default: {}

Additional properties schema:

[key: string] string

name string

The name of the entity. Must be unique inside the namespace. If not specified, it will be the same as the automatically generated id.

Constraints: max length: 255, pattern: ^[\w\-\+.@:]*$

Default:

version_id string

A unique, immutable id for the version. This is similar to the commit hash.

Default: main

version_tags array

The list of version tags associated with this entity.

Array items:

item object

A version tag marks a particular version for an entity. The name must be unique across all version tags for the same entity name.

Properties

name * string

The name of the version. Typically should be kept lower case, no space, no '/', no `@`.

created_by string

The ID of the user that created this entity.

Default:

created_at string

Timestamp for when the version tag was created.

format string

Specifies the dataset format, referring to the schema of the dataset rather than the file format. Examples include SQuAD, BEIR, etc.

files_url * string

The location where the artifact files are stored. This can be a URL pointing to NDS, Hugging Face, S3, or any other accessible resource location.

Constraints: min length: 1

hf_endpoint string

For HuggingFace URLs, the endpoint that should be used. By default, this is set to the Data Store URL. For HuggingFace Hub, this should be set to "https://huggingface.co".

split string

The split of the dataset. Examples include train, validation, test, etc.

limit integer

The maximum number of items to be used from the dataset.

rag object

RAG to be evaluated.

Properties

pipeline object

RAG pipeline.

Properties

retriever * object

The retriever pipeline included in the RAG pipeline.

Properties

pipeline object

Retriever pipeline run by Evaluator.

Properties

query_embedding_model * string | object

The query embedding model.

Any of:

Option 1: string - A reference to ModelInputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

index_embedding_model * string | object

The index embedding model.

Any of:

Option 1: string - A reference to ModelInputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

reranker_model string | object

The reranker model.

Any of:

Option 1: string - A reference to ModelInputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

top_k integer

The top k results to be used.

Default: 10

cached_outputs object

Cached pre-generated retriever outputs.

Properties

files_url * string

The files URL of the cached outputs.

Constraints: min length: 1

dataset object

Cached pre-generated retriever outputs dataset.

Properties

schema_version string

The version of the schema for the object. Internal use only.

Default: 1.0

id string

The ID of the entity. With the exception of namespaces, this is always a semantically-prefixed base58-encoded uuid4 [-base58(uuid4())].

description string

The description of the entity.

type_prefix string

The type prefix of the entity ID. If not specified, it will be inferred from the entity type name, but this will likely result in long prefixes.

namespace string

The namespace of the entity. This can be missing for namespace entities or in deployments that don't use namespaces.

Default: default

project string

The URN of the project associated with this entity.

created_at string

Timestamp for when the entity was created.

updated_at string

Timestamp for when the entity was last updated.

custom_fields object

A set of custom fields that the user can define and use for various purposes.

Allows additional properties: Yes

ownership object

Ownership information for the entity

Properties

created_by string

The ID of the user that created this entity.

Default:

updated_by string

The ID of the user that last updated this entity.

access_policies object

A general object for capturing access policies which can be used by an external service to determine ACLs

Default: {}

Additional properties schema:

[key: string] string

name string

The name of the entity. Must be unique inside the namespace. If not specified, it will be the same as the automatically generated id.

Constraints: max length: 255, pattern: ^[\w\-\+.@:]*$

Default:

version_id string

A unique, immutable id for the version. This is similar to the commit hash.

Default: main

version_tags array

The list of version tags associated with this entity.

Array items:

item object

A version tag marks a particular version for an entity. The name must be unique across all version tags for the same entity name.

Properties

name * string

The name of the version. Typically should be kept lower case, no space, no '/', no `@`.

created_by string

The ID of the user that created this entity.

Default:

created_at string

Timestamp for when the version tag was created.

format string

Specifies the dataset format, referring to the schema of the dataset rather than the file format. Examples include SQuAD, BEIR, etc.

files_url * string

The location where the artifact files are stored. This can be a URL pointing to NDS, Hugging Face, S3, or any other accessible resource location.

Constraints: min length: 1

hf_endpoint string

For HuggingFace URLs, the endpoint that should be used. By default, this is set to the Data Store URL. For HuggingFace Hub, this should be set to "https://huggingface.co".

split string

The split of the dataset. Examples include train, validation, test, etc.

limit integer

The maximum number of items to be used from the dataset.

model * string | object

The generation model for the RAG pipeline.

Any of:

Option 1: string - A reference to ModelInputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

context_ordering string

The context ordering for the RAG pipeline.

Default: desc

cached_outputs object

Cached pre-generated RAG outputs.

Properties

files_url * string

The files URL of the cached outputs.

Constraints: min length: 1

dataset object

Cached pre-generated RAG outputs dataset.

Properties

schema_version string

The version of the schema for the object. Internal use only.

Default: 1.0

id string

The ID of the entity. With the exception of namespaces, this is always a semantically-prefixed base58-encoded uuid4 [-base58(uuid4())].

description string

The description of the entity.

type_prefix string

The type prefix of the entity ID. If not specified, it will be inferred from the entity type name, but this will likely result in long prefixes.

namespace string

The namespace of the entity. This can be missing for namespace entities or in deployments that don't use namespaces.

Default: default

project string

The URN of the project associated with this entity.

created_at string

Timestamp for when the entity was created.

updated_at string

Timestamp for when the entity was last updated.

custom_fields object

A set of custom fields that the user can define and use for various purposes.

Allows additional properties: Yes

ownership object

Ownership information for the entity

Properties

created_by string

The ID of the user that created this entity.

Default:

updated_by string

The ID of the user that last updated this entity.

access_policies object

A general object for capturing access policies which can be used by an external service to determine ACLs

Default: {}

Additional properties schema:

[key: string] string

name string

The name of the entity. Must be unique inside the namespace. If not specified, it will be the same as the automatically generated id.

Constraints: max length: 255, pattern: ^[\w\-\+.@:]*$

Default:

version_id string

A unique, immutable id for the version. This is similar to the commit hash.

Default: main

version_tags array

The list of version tags associated with this entity.

Array items:

item object

A version tag marks a particular version for an entity. The name must be unique across all version tags for the same entity name.

Properties

name * string

The name of the version. Typically should be kept lower case, no space, no '/', no `@`.

created_by string

The ID of the user that created this entity.

Default:

created_at string

Timestamp for when the version tag was created.

format string

Specifies the dataset format, referring to the schema of the dataset rather than the file format. Examples include SQuAD, BEIR, etc.

files_url * string

The location where the artifact files are stored. This can be a URL pointing to NDS, Hugging Face, S3, or any other accessible resource location.

Constraints: min length: 1

hf_endpoint string

For HuggingFace URLs, the endpoint that should be used. By default, this is set to the Data Store URL. For HuggingFace Hub, this should be set to "https://huggingface.co".

split string

The split of the dataset. Examples include train, validation, test, etc.

limit integer

The maximum number of items to be used from the dataset.

rows array

Rows to be evaluated.

Array items:

item object

Allows additional properties: Yes

dataset string | object

Dataset to be evaluated.

Any of:

Option 1: string - A reference to DatasetInputEV.

Option 2: object - A dataset that can be used for fine-tuning or evaluation.

project string

The URN of the project associated with this entity.

custom_fields object

A set of custom fields that the user can define and use for various purposes.

Allows additional properties: Yes

ownership object

Ownership information for the entity

Properties

created_by string

The ID of the user that created this entity.

Default:

updated_by string

The ID of the user that last updated this entity.

access_policies object

A general object for capturing access policies which can be used by an external service to determine ACLs

Default: {}

Additional properties schema:

[key: string] string

EvaluationTargetOutput#

Use this schema to understand evaluation target responses from GET requests when retrieving or managing existing targets.

EvaluationTargetOutput object

An entity representing the target of the evaluation.

Properties

schema_version string

The version of the schema for the object. Internal use only.

Default: 1.0

id string

The ID of the entity. With the exception of namespaces, this is always a semantically-prefixed base58-encoded uuid4 [-base58(uuid4())].

description string

The description of the entity.

type_prefix string

Default: eval-target

namespace string

The namespace of the entity. This can be missing for namespace entities or in deployments that don't use namespaces.

Default: default

project string

The URN of the project associated with this entity.

created_at string

Timestamp for when the entity was created.

updated_at string

Timestamp for when the entity was last updated.

custom_fields object

A set of custom fields that the user can define and use for various purposes.

Allows additional properties: Yes

ownership object

Ownership information for the entity

Properties

created_by string

The ID of the user that created this entity.

Default:

updated_by string

The ID of the user that last updated this entity.

access_policies object

A general object for capturing access policies which can be used by an external service to determine ACLs

Default: {}

Additional properties schema:

[key: string] string

name string

The name of the entity. Must be unique inside the namespace. If not specified, it will be the same as the automatically generated id.

Constraints: max length: 255, pattern: ^[\w\-\+.@:]*$

Default:

type * string

The type of the evaluation target, e.g., 'model', 'retriever', 'rag'.

Allowed values:

modelcached_outputsretrieverragrowsdataset

cached_outputs object

The model to be evaluated.

Properties

files_url * string

The files URL of the cached outputs.

Constraints: min length: 1

model string | object

The model to be evaluated.

Any of:

Option 1: string - A reference to ModelOutputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

retriever object

Retriever to be evaluated.

Properties

pipeline object

Retriever pipeline run by Evaluator.

Properties

query_embedding_model * string | object

The query embedding model.

Any of:

Option 1: string - A reference to ModelOutputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

index_embedding_model * string | object

The index embedding model.

Any of:

Option 1: string - A reference to ModelOutputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

reranker_model string | object

The reranker model.

Any of:

Option 1: string - A reference to ModelOutputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

top_k integer

The top k results to be used.

Default: 10

cached_outputs object

Cached pre-generated retriever outputs.

Properties

files_url * string

The files URL of the cached outputs.

Constraints: min length: 1

dataset object

Cached pre-generated retriever outputs dataset.

Properties

schema_version string

The version of the schema for the object. Internal use only.

Default: 1.0

id string

The ID of the entity. With the exception of namespaces, this is always a semantically-prefixed base58-encoded uuid4 [-base58(uuid4())].

description string

The description of the entity.

type_prefix string

The type prefix of the entity ID. If not specified, it will be inferred from the entity type name, but this will likely result in long prefixes.

namespace string

The namespace of the entity. This can be missing for namespace entities or in deployments that don't use namespaces.

Default: default

project string

The URN of the project associated with this entity.

created_at string

Timestamp for when the entity was created.

updated_at string

Timestamp for when the entity was last updated.

custom_fields object

A set of custom fields that the user can define and use for various purposes.

Allows additional properties: Yes

ownership object

Ownership information for the entity

Properties

created_by string

The ID of the user that created this entity.

Default:

updated_by string

The ID of the user that last updated this entity.

access_policies object

A general object for capturing access policies which can be used by an external service to determine ACLs

Default: {}

Additional properties schema:

[key: string] string

name string

The name of the entity. Must be unique inside the namespace. If not specified, it will be the same as the automatically generated id.

Constraints: max length: 255, pattern: ^[\w\-\+.@:]*$

Default:

version_id string

A unique, immutable id for the version. This is similar to the commit hash.

Default: main

version_tags array

The list of version tags associated with this entity.

Array items:

item object

A version tag marks a particular version for an entity. The name must be unique across all version tags for the same entity name.

Properties

name * string

The name of the version. Typically should be kept lower case, no space, no '/', no `@`.

created_by string

The ID of the user that created this entity.

Default:

created_at string

Timestamp for when the version tag was created.

format string

Specifies the dataset format, referring to the schema of the dataset rather than the file format. Examples include SQuAD, BEIR, etc.

files_url * string

The location where the artifact files are stored. This can be a URL pointing to NDS, Hugging Face, S3, or any other accessible resource location.

Constraints: min length: 1

hf_endpoint string

For HuggingFace URLs, the endpoint that should be used. By default, this is set to the Data Store URL. For HuggingFace Hub, this should be set to "https://huggingface.co".

split string

The split of the dataset. Examples include train, validation, test, etc.

limit integer

The maximum number of items to be used from the dataset.

rag object

RAG to be evaluated.

Properties

pipeline object

RAG pipeline.

Properties

retriever * object

The retriever pipeline included in the RAG pipeline.

Properties

pipeline object

Retriever pipeline run by Evaluator.

Properties

query_embedding_model * string | object

The query embedding model.

Any of:

Option 1: string - A reference to ModelOutputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

index_embedding_model * string | object

The index embedding model.

Any of:

Option 1: string - A reference to ModelOutputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

reranker_model string | object

The reranker model.

Any of:

Option 1: string - A reference to ModelOutputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

top_k integer

The top k results to be used.

Default: 10

cached_outputs object

Cached pre-generated retriever outputs.

Properties

files_url * string

The files URL of the cached outputs.

Constraints: min length: 1

dataset object

Cached pre-generated retriever outputs dataset.

Properties

schema_version string

The version of the schema for the object. Internal use only.

Default: 1.0

id string

The ID of the entity. With the exception of namespaces, this is always a semantically-prefixed base58-encoded uuid4 [-base58(uuid4())].

description string

The description of the entity.

type_prefix string

The type prefix of the entity ID. If not specified, it will be inferred from the entity type name, but this will likely result in long prefixes.

namespace string

The namespace of the entity. This can be missing for namespace entities or in deployments that don't use namespaces.

Default: default

project string

The URN of the project associated with this entity.

created_at string

Timestamp for when the entity was created.

updated_at string

Timestamp for when the entity was last updated.

custom_fields object

A set of custom fields that the user can define and use for various purposes.

Allows additional properties: Yes

ownership object

Ownership information for the entity

Properties

created_by string

The ID of the user that created this entity.

Default:

updated_by string

The ID of the user that last updated this entity.

access_policies object

A general object for capturing access policies which can be used by an external service to determine ACLs

Default: {}

Additional properties schema:

[key: string] string

name string

The name of the entity. Must be unique inside the namespace. If not specified, it will be the same as the automatically generated id.

Constraints: max length: 255, pattern: ^[\w\-\+.@:]*$

Default:

version_id string

A unique, immutable id for the version. This is similar to the commit hash.

Default: main

version_tags array

The list of version tags associated with this entity.

Array items:

item object

A version tag marks a particular version for an entity. The name must be unique across all version tags for the same entity name.

Properties

name * string

The name of the version. Typically should be kept lower case, no space, no '/', no `@`.

created_by string

The ID of the user that created this entity.

Default:

created_at string

Timestamp for when the version tag was created.

format string

Specifies the dataset format, referring to the schema of the dataset rather than the file format. Examples include SQuAD, BEIR, etc.

files_url * string

The location where the artifact files are stored. This can be a URL pointing to NDS, Hugging Face, S3, or any other accessible resource location.

Constraints: min length: 1

hf_endpoint string

For HuggingFace URLs, the endpoint that should be used. By default, this is set to the Data Store URL. For HuggingFace Hub, this should be set to "https://huggingface.co".

split string

The split of the dataset. Examples include train, validation, test, etc.

limit integer

The maximum number of items to be used from the dataset.

model * string | object

The generation model for the RAG pipeline.

Any of:

Option 1: string - A reference to ModelOutputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

context_ordering string

The context ordering for the RAG pipeline.

Default: desc

cached_outputs object

Cached pre-generated RAG outputs.

Properties

files_url * string

The files URL of the cached outputs.

Constraints: min length: 1

dataset object

Cached pre-generated RAG outputs dataset.

Properties

schema_version string

The version of the schema for the object. Internal use only.

Default: 1.0

id string

The ID of the entity. With the exception of namespaces, this is always a semantically-prefixed base58-encoded uuid4 [-base58(uuid4())].

description string

The description of the entity.

type_prefix string

The type prefix of the entity ID. If not specified, it will be inferred from the entity type name, but this will likely result in long prefixes.

namespace string

The namespace of the entity. This can be missing for namespace entities or in deployments that don't use namespaces.

Default: default

project string

The URN of the project associated with this entity.

created_at string

Timestamp for when the entity was created.

updated_at string

Timestamp for when the entity was last updated.

custom_fields object

A set of custom fields that the user can define and use for various purposes.

Allows additional properties: Yes

ownership object

Ownership information for the entity

Properties

created_by string

The ID of the user that created this entity.

Default:

updated_by string

The ID of the user that last updated this entity.

access_policies object

A general object for capturing access policies which can be used by an external service to determine ACLs

Default: {}

Additional properties schema:

[key: string] string

name string

The name of the entity. Must be unique inside the namespace. If not specified, it will be the same as the automatically generated id.

Constraints: max length: 255, pattern: ^[\w\-\+.@:]*$

Default:

version_id string

A unique, immutable id for the version. This is similar to the commit hash.

Default: main

version_tags array

The list of version tags associated with this entity.

Array items:

item object

A version tag marks a particular version for an entity. The name must be unique across all version tags for the same entity name.

Properties

name * string

The name of the version. Typically should be kept lower case, no space, no '/', no `@`.

created_by string

The ID of the user that created this entity.

Default:

created_at string

Timestamp for when the version tag was created.

format string

Specifies the dataset format, referring to the schema of the dataset rather than the file format. Examples include SQuAD, BEIR, etc.

files_url * string

The location where the artifact files are stored. This can be a URL pointing to NDS, Hugging Face, S3, or any other accessible resource location.

Constraints: min length: 1

hf_endpoint string

For HuggingFace URLs, the endpoint that should be used. By default, this is set to the Data Store URL. For HuggingFace Hub, this should be set to "https://huggingface.co".

split string

The split of the dataset. Examples include train, validation, test, etc.

limit integer

The maximum number of items to be used from the dataset.

rows array

Rows to be evaluated.

Array items:

item object

Allows additional properties: Yes

dataset string | object

Dataset to be evaluated.

Any of:

Option 1: string - A reference to DatasetEV.

Option 2: object - A dataset that can be used for fine-tuning or evaluation.

Nested Configuration Schemas#

The schemas in this section define inline objects you can provide when creating evaluation targets. Use these schemas when you want to configure resources directly in the target creation request rather than referencing existing resources by name.

Understanding String References vs Inline Objects#

When creating an evaluation target, fields like model, rag, retriever, and dataset accept two types of values:

String reference – Reference an existing resource by namespace and name (for example, "my-org/my-model")
Inline object – Provide complete configuration using one of the schemas below

Use string references when you have pre-configured resources. Use inline objects when you want to define everything in a single request.

Example using a model target:

# Option 1: Reference an existing model by name
client.evaluation.targets.create(
    type="model",
    model="my-org/existing-model"
)

# Option 2: Provide inline model configuration (uses ModelInput schema)
client.evaluation.targets.create(
    type="model",
    model={
        "inference": {
            "url": "https://api.example.com/v1/chat/completions",
            "model_id": "meta/llama-3.1-8b-instruct",
            "format": "nim"
        }
    }
)

This pattern applies to all target types—use string references for existing resources or inline objects for complete configuration in a single request.

Target Type Configuration Schemas#

These schemas define the structure of inline objects for different target types.

ModelInput#

Define an inline model configuration in the model field when creating a target with type: "model".

ModelInput object

Information about a machine learning model, typically an LLM.

Properties

name string

The name of the entity. Must be unique inside the namespace. If not specified, it will be the same as the automatically generated id.

Constraints: max length: 255, pattern: ^[\w\-\+.@:]*$

Default:

namespace string

The namespace of the entity. This can be missing for namespace entities or in deployments that don't use namespaces.

Default: default

description string

The description of the entity.

spec object

Detailed specification for the model.

Properties

num_parameters * integer

The total number of trainable parameters in the model's neural network architecture.

context_size * integer

The maximum number of tokens to process together in a single forward pass through the model.

num_virtual_tokens * integer

The number of virtual tokens the model can support for techniques such as prompt tuning, where special trainable embeddings are prepended to inputs.

is_chat * boolean

Indicates if the model is designed for multi-turn conversation rather than single-prompt completion.

artifact object

A set of checkpoint files, configs, and other auxiliary info associated with this model.

Properties

gpu_arch string

The GPU architecture the model is optimized for.

precision string

The numerical precision format used to store model weights.

Allowed values:

int8bf16fp16fp32fp8-mixedbf16-mixed

tensor_parallelism integer

The number of GPU devices to split and process the model's neural network layers.

backend_engine string

The inference engine framework to run the model.

Allowed values:

nemotrt_llmvllmfaster_transformerhugging_face

status * string

The current state of the model artifact in the upload/deployment lifecycle.

Allowed values:

createdupload_failedupload_completed

files_url * string

The location where the artifact files are stored.

Constraints: min length: 1

base_model string | object

Link to another model which is used as a base for the current model.

Any of:

Option 1: string - A reference to Model.

Option 2: object - Information about a machine learning model, typically an LLM.

api_endpoint object

Data about the inference endpoint for this model.

Properties

url * string

The API endpoint URL.

Constraints: min length: 1

model_id * string

The id of the model. How this is used depends on the API endpoint format.

api_key string

The API key that should be used to access the endpoint.

format string

The format for the API endpoint, e.g., NVIDIA NIM, OpenAI, Llama-Stack.

Allowed values:

nimopenailama_stack

Default: nim

peft object

Configuration parameters for parameter efficient fine-tuning (PEFT) built into this model. PEFT methods like LoRA provide efficient model customization by training a small number of additional parameters while keeping the base model frozen.

Properties

finetuning_type * string

The type of finetuning.

Allowed values:

loralora_mergedall_weights

lora object

Configuration parameters for low-rank adaptation (LoRA) fine tuning, which injects trainable rank decomposition matrices into transformer layers to reduce parameters while maintaining quality. These parameters control adaptation strength using alpha scaling and layer selection (MLP/output) to optimize the balance between model size and performance.

Properties

alpha * integer

A scaling factor that controls how much influence the LoRA adaptations have on the base model's behavior. The alpha parameter should typically be set to dim or 0.5 * dim as the actual scaling applied in the training loop is alpha / dim.

apply_lora_to_mlp * boolean

Controls whether to adapt the model's feed-forward neural network layers using LoRA.

apply_lora_to_output * boolean

Controls whether to adapt the model's final output layer using LoRA.

p_tuning object

Configuration details for P-tuning optimization of the model.

Properties

token_embeddings * array

Learned continuous prompt embeddings that help optimize the model for specific tasks.

Array items:

item array

Array items:

item number

prompt object

Configuration for prompt engineering, which involves crafting input text to guide the model's behavior and responses through system prompts, few-shot examples, and other techniques.

Properties

system_prompt string

Initial instructions that define the model's role and behavior for the conversation.

inference_params object

Custom settings that control the model's text generation behavior.

Properties

top_p number

top_k integer

temperature number

max_tokens integer

stop array

Array items:

item string

reasoning_params object

Custom settings that control the model's reasoning behavior.

Properties

end_token string

Configure the end token to trim reasoning context based on the model's reasoning API. Used for omitting Nemotron reasoning steps from output denoted with tags

include_if_not_finished boolean

Configure whether to include reasoning context if the model has not finished reasoning.

effort string

Option for OpenAI models to specify low, medium, or high reasoning effort which balances between speed and reasoning accuracy.

icl_few_shot_examples string

Example input-output pairs that guide the model in understanding the desired task format and behavior.

system_prompt_template string

The template which will be used to compile the final prompt used for prompting the LLM. Currently supports only 'icl_few_shot_examples'

guardrails object

Data about the guardrail configuration built into this model. Guardrails provide safety and quality controls around model outputs by defining rules and filters that validate, modify or block responses based on criteria like toxicity, factuality, or custom business logic.

Properties

schema_version string

The version of the schema for the object. Internal use only.

Default: 1.0

id string

The ID of the entity. With the exception of namespaces, this is always a semantically-prefixed base58-encoded uuid4 [-base58(uuid4())].

description string

The description of the entity.

type_prefix string

Default: guardrail

namespace string

The namespace of the entity. This can be missing for namespace entities or in deployments that don't use namespaces.

Default: default

project string

The URN of the project associated with this entity.

created_at string

Timestamp for when the entity was created.

updated_at string

Timestamp for when the entity was last updated.

custom_fields object

A set of custom fields that the user can define and use for various purposes.

Allows additional properties: Yes

ownership object

Ownership information for the entity

Properties

created_by string

The ID of the user that created this entity.

Default:

updated_by string

The ID of the user that last updated this entity.

access_policies object

A general object for capturing access policies which can be used by an external service to determine ACLs

Default: {}

Additional properties schema:

[key: string] string

name string

The name of the entity. Must be unique inside the namespace. If not specified, it will be the same as the automatically generated id.

Constraints: max length: 255, pattern: ^[\w\-\+.@:]*$

Default:

version_id string

A unique, immutable id for the version. This is similar to the commit hash.

Default: main

version_tags array

The list of version tags associated with this entity.

Array items:

item object

A version tag marks a particular version for an entity. The name must be unique across all version tags for the same entity name.

Properties

name * string

The name of the version. Typically should be kept lower case, no space, no '/', no `@`.

created_by string

The ID of the user that created this entity.

Default:

created_at string

Timestamp for when the version tag was created.

data object

Guardrail configuration data

Allows additional properties: Yes

Properties

models * array

The list of models used by the rails configuration.

Array items:

item object

Configuration of a model used by the rails engine. Typically, the main model is configured e.g.: { "type": "main", "engine": "openai", "model": "gpt-3.5-turbo-instruct" }

Properties

type * string

engine * string

model string

The name of the model. If not specified, it should be specified through the parameters attribute.

api_key_env_var string

Optional environment variable with model's API Key. Do not include "$".

reasoning_config object

Configuration parameters for reasoning LLMs.

Properties

remove_reasoning_traces boolean

For reasoning models (e.g. DeepSeek-r1), if the output parser should remove reasoning traces.

Default: True

remove_thinking_traces boolean

start_token string

The start token used for reasoning traces.

Default:

end_token string

The end token used for reasoning traces.

Default:

parameters object

Allows additional properties: Yes

mode string

Whether the mode is 'text' completion or 'chat' completion. Allowed values are 'chat' or 'text'.

Allowed values:

chattext

Default: chat

instructions array

List of instructions in natural language that the LLM should use.

Default:

[{'type': 'general', 'content': 'Below is a conversation between a helpful AI assistant and a user. The bot is designed to generate human-like text based on the input that it receives. The bot is talkative and provides lots of specific details. If the bot does not know the answer to a question, it truthfully says it does not know.'}]

Array items:

item object

Configuration for instructions in natural language that should be passed to the LLM.

Properties

type * string

content * string

actions_server_url string

The URL of the actions server that should be used for the rails.

sample_conversation string

The sample conversation that should be used inside the prompts.

Default:

user "Hello there!"
  express greeting
bot express greeting
  "Hello! How can I assist you today?"
user "What can you do for me?"
  ask about capabilities
bot respond about capabilities
  "As an AI assistant, I can help you with a wide range of tasks. This includes question answering on various topics, generating text for various purposes and providing suggestions based on your preferences."
user "Tell me a bit about the history of NVIDIA."
  ask general question
bot response for general question
  "NVIDIA is a technology company that specializes in designing and manufacturing graphics processing units (GPUs) and other computer hardware. The company was founded in 1993 by Jen-Hsun Huang, Chris Malachowsky, and Curtis Priem."
user "tell me more"
  request more information
bot provide more information
  "Initially, the company focused on developing 3D graphics processing technology for the PC gaming market. In 1999, NVIDIA released the GeForce 256, the world's first GPU, which was a major breakthrough for the gaming industry. The company continued to innovate in the GPU space, releasing new products and expanding into other markets such as professional graphics, mobile devices, and artificial intelligence."
user "thanks"
  express appreciation
bot express appreciation and offer additional help
  "You're welcome. If you have any more questions or if there's anything else I can help you with, please don't hesitate to ask."

prompts array

The prompts that should be used for the various LLM tasks.

Array items:

item object

Configuration for prompts that will be used for a specific task.

Properties

task * string

The id of the task associated with this prompt.

content string

The content of the prompt, if it's a string.

messages array

The list of messages included in the prompt. Used for chat models.

Array items:

item object | string

Any of:

Option 1: object - Template for a message structure.

Option 2: string

models array

If specified, the prompt will be used only for the given LLM engines/models. The format is a list of strings with the format: or /.

Array items:

item string

output_parser string

The name of the output parser to use for this prompt.

max_length integer

The maximum length of the prompt in number of characters.

Default: 16000

mode string

Corresponds to the `prompting_mode` for which this prompt is fetched. Default is 'standard'.

Default: standard

stop array

If specified, will be configure stop tokens for models that support this.

Array items:

item string

max_tokens integer

The maximum number of tokens that can be generated in the chat completion.

prompting_mode string

Allows choosing between different prompting strategies.

Default: standard

lowest_temperature number

The lowest temperature that should be used for the LLM.

Default: 0.001

enable_multi_step_generation boolean

Whether to enable multi-step generation for the LLM.

Default: False

colang_version string

The Colang version to use.

Default: 1.0

custom_data object

Any custom configuration data that might be needed.

Allows additional properties: Yes

rails object

Configuration for the various rails (input, output, etc.).

Allows additional properties: No

Properties

config object

Configuration data for specific rails that are supported out-of-the-box.

Properties

fact_checking object

Configuration data for the fact-checking rail.

Properties

parameters object

Allows additional properties: Yes

fallback_to_self_check boolean

Whether to fall back to self-check if another method fail.

Default: False

autoalign object

Configuration data for the AutoAlign guardrails API.

Properties

parameters object

Allows additional properties: Yes

input object

Input configuration for AutoAlign guardrails

Properties

guardrails_config object

The guardrails configuration that is passed to the AutoAlign endpoint

Allows additional properties: Yes

output object

Output configuration for AutoAlign guardrails

Properties

guardrails_config object

The guardrails configuration that is passed to the AutoAlign endpoint

Allows additional properties: Yes

patronus object

Configuration data for the Patronus Evaluate API.

Properties

input object

Patronus Evaluate API configuration for an Input Guardrail

Properties

evaluate_config object

Configuration passed to the Patronus Evaluate API

Properties

success_strategy string

Strategy to determine whether the Patronus Evaluate API Guardrail passes or not.

Allowed values:

all_passany_pass

Default: all_pass

params object

Parameters to the Patronus Evaluate API

Allows additional properties: Yes

output object

Patronus Evaluate API configuration for an Output Guardrail

Properties

evaluate_config object

Configuration passed to the Patronus Evaluate API

Properties

success_strategy string

Strategy to determine whether the Patronus Evaluate API Guardrail passes or not.

Allowed values:

all_passany_pass

Default: all_pass

params object

Parameters to the Patronus Evaluate API

Allows additional properties: Yes

sensitive_data_detection object

Configuration for detecting sensitive data.

Properties

recognizers array

Additional custom recognizers. Check out https://microsoft.github.io/presidio/tutorial/08_no_code/ for more details.

Array items:

item object

Allows additional properties: Yes

input object

Configuration of the entities to be detected on the user input.

Properties

entities array

The list of entities that should be detected. Check out https://microsoft.github.io/presidio/supported_entities/ forthe list of supported entities.

Array items:

item string

mask_token string

The token that should be used to mask the sensitive data.

Default: *

score_threshold number

The score threshold that should be used to detect the sensitive data.

Default: 0.2

output object

Configuration of the entities to be detected on the bot output.

Properties

entities array

The list of entities that should be detected. Check out https://microsoft.github.io/presidio/supported_entities/ forthe list of supported entities.

Array items:

item string

mask_token string

The token that should be used to mask the sensitive data.

Default: *

score_threshold number

The score threshold that should be used to detect the sensitive data.

Default: 0.2

retrieval object

Configuration of the entities to be detected on retrieved relevant chunks.

Properties

entities array

The list of entities that should be detected. Check out https://microsoft.github.io/presidio/supported_entities/ forthe list of supported entities.

Array items:

item string

mask_token string

The token that should be used to mask the sensitive data.

Default: *

score_threshold number

The score threshold that should be used to detect the sensitive data.

Default: 0.2

jailbreak_detection object

Configuration for jailbreak detection.

Properties

server_endpoint string

The endpoint for the jailbreak detection heuristics/model container.

length_per_perplexity_threshold number

The length/perplexity threshold.

Default: 89.79

prefix_suffix_perplexity_threshold number

The prefix/suffix perplexity threshold.

Default: 1845.65

nim_base_url string

Base URL for jailbreak detection model. Example: http://localhost:8000/v1

nim_server_endpoint string

Classification path uri. Defaults to 'classify' for NemoGuard JailbreakDetect.

Default: classify

api_key string

Secret String with API key for use in Jailbreak requests. Takes precedence over api_key_env_var

api_key_env_var string

Environment variable containing API key for jailbreak detection model

nim_url string

DEPRECATED: Use nim_base_url instead

nim_port integer

DEPRECATED: Include port in nim_base_url instead

embedding string

injection_detection object

Configuration for injection detection.

Properties

injections array

The list of injection types to detect. Options are 'sqli', 'template', 'code', 'xss'.Currently, only SQL injection, template injection, code injection, and markdown cross-site scripting are supported. Custom rules can be added, provided they are in the `yara_path` and have a `.yara` file extension.

Array items:

item string

action string

Action to take. Options are 'reject' to offer a rejection message, 'omit' to mask the offending content, and 'sanitize' to pass the content as-is in the safest way. These options are listed in descending order of relative safety. 'sanitize' is not implemented at this time.

Constraints: pattern: ^(reject|omit)$

Default: reject

yara_rules object

Dictionary mapping rule names to YARA rule strings. If provided, these rules will be used instead of loading rules from yara_path. Each rule should be a valid YARA rule string.

Additional properties schema:

[key: string] string

privateai object

Configuration for Private AI.

Properties

server_endpoint string

The endpoint for the private AI detection server.

Default: http://localhost:8080/process/text

input object

Configuration of the entities to be detected on the user input.

Properties

entities array

The list of entities that should be detected.

Array items:

item string

output object

Configuration of the entities to be detected on the bot output.

Properties

entities array

The list of entities that should be detected.

Array items:

item string

retrieval object

Configuration of the entities to be detected on retrieved relevant chunks.

Properties

entities array

The list of entities that should be detected.

Array items:

item string

fiddler object

Configuration for Fiddler Guardrails.

Properties

fiddler_endpoint string

The global endpoint for Fiddler Guardrails requests.

Default: http://localhost:8080/process/text

safety_threshold number

Fiddler Guardrails safety detection threshold.

Default: 0.1

faithfulness_threshold number

Fiddler Guardrails faithfulness detection threshold.

Default: 0.05

clavata object

Configuration for Clavata.

Properties

server_endpoint string

The endpoint for the Clavata API

Default: https://gateway.app.clavata.ai:8443

policies object

A dictionary of policy aliases and their corresponding IDs.

Additional properties schema:

[key: string] string

label_match_logic string

The logic to use when deciding whether the evaluation matched. If ANY, only one of the configured labels needs to be found in the input or output. If ALL, all configured labels must be found in the input or output.

Allowed values:

ANYALL

Default: ANY

input object

Clavata configuration for an Input Guardrail

Properties

policy * string

The policy alias to use when evaluating inputs or outputs.

labels array

A list of labels to match against the policy. If no labels are provided, the overall policy result will be returned. If labels are provided, only hits on the provided labels will be considered a hit.

Array items:

item string

output object

Clavata configuration for an Output Guardrail

Properties

policy * string

The policy alias to use when evaluating inputs or outputs.

labels array

A list of labels to match against the policy. If no labels are provided, the overall policy result will be returned. If labels are provided, only hits on the provided labels will be considered a hit.

Array items:

item string

pangea object

Configuration for Pangea.

Properties

input object

Pangea configuration for an Input Guardrail

Properties

recipe * string

Recipe key of a configuration of data types and settings defined in the Pangea User Console. It specifies the rules that are to be applied to the text, such as defang malicious URLs.

output object

Pangea configuration for an Output Guardrail

Properties

recipe * string

Recipe key of a configuration of data types and settings defined in the Pangea User Console. It specifies the rules that are to be applied to the text, such as defang malicious URLs.

guardrails_ai object

Configuration for Guardrails AI validators.

Properties

validators array

List of Guardrails AI validators to apply. Each validator can have its own parameters and metadata.

Array items:

item object

Configuration for a single Guardrails AI validator.

Properties

name * string

Unique identifier or import path for the Guardrails AI validator (e.g., 'toxic_language', 'pii', 'regex_match', or 'guardrails/competitor_check').

parameters object

Parameters to pass to the validator during initialization (e.g., threshold, regex pattern).

Allows additional properties: Yes

metadata object

Metadata to pass to the validator during validation (e.g., valid_topics, context).

Allows additional properties: Yes

trend_micro object

Configuration for Trend Micro.

Properties

v1_url string

The endpoint for the Trend Micro AI Guard API

Default: https://api.xdr.trendmicro.com/beta/aiSecurity/guard

api_key_env_var string

Environment variable containing API key for Trend Micro AI Guard

input object

Configuration of the input rails.

Properties

parallel boolean

If True, the input rails are executed in parallel.

Default: False

flows array

The names of all the flows that implement input rails.

Array items:

item string

output object

Configuration of the output rails.

Properties

parallel boolean

If True, the output rails are executed in parallel.

Default: False

flows array

The names of all the flows that implement output rails.

Array items:

item string

streaming object

Configuration for streaming output rails.

Properties

enabled boolean

Enables streaming mode when True.

Default: False

chunk_size integer

The number of tokens in each processing chunk. This is the size of the token block on which output rails are applied.

Default: 200

context_size integer

The number of tokens carried over from the previous chunk to provide context for continuity in processing.

Default: 50

stream_first boolean

If True, token chunks are streamed immediately before output rails are applied.

Default: True

apply_to_reasoning_traces boolean

If True, output rails will apply guardrails to both reasoning traces and output response. If False, output rails will only apply guardrails to the output response excluding the reasoning traces, thus keeping reasoning traces unaltered.

Default: False

retrieval object

Configuration of the retrieval rails.

Properties

flows array

The names of all the flows that implement retrieval rails.

Array items:

item string

dialog object

Configuration of the dialog rails.

Properties

single_call object

Configuration for the single LLM call option.

Properties

enabled boolean

Default: False

fallback_to_multiple_calls boolean

Whether to fall back to multiple calls if a single call is not possible.

Default: True

user_messages object

Configuration for how the user messages are interpreted.

Properties

embeddings_only boolean

Whether to use only embeddings for computing the user canonical form messages.

Default: False

embeddings_only_similarity_threshold number

The similarity threshold to use when using only embeddings for computing the user canonical form messages.

embeddings_only_fallback_intent string

Defines the fallback intent when the similarity is below the threshold. If set to None, the user intent is computed normally using the LLM. If set to a string value, that string is used as the intent.

actions object

Configuration of action rails.

Properties

instant_actions array

The names of all actions which should finish instantly.

Array items:

item string

tool_output object

Configuration of tool output rails.

Properties

flows array

The names of all the flows that implement tool output rails.

Array items:

item string

parallel boolean

If True, the tool output rails are executed in parallel.

Default: False

tool_input object

Configuration of tool input rails.

Properties

flows array

The names of all the flows that implement tool input rails.

Array items:

item string

parallel boolean

If True, the tool input rails are executed in parallel.

Default: False

enable_rails_exceptions boolean

If set, the pre-defined guardrails raise exceptions instead of returning pre-defined messages.

Default: False

passthrough boolean

Weather the original prompt should pass through the guardrails configuration as is. This means it will not be altered in any way.

tracing object

Configuration for tracing.

Properties

enabled boolean

Default: False

adapters array

The list of tracing adapters to use. If not specified, the default adapters are used.

Array items:

item object

Allows additional properties: Yes

Properties

name string

The name of the adapter.

Default: FileSystem

span_format string

The span format to use. Options are 'legacy' (simple metrics) or 'opentelemetry' (OpenTelemetry semantic conventions).

Default: opentelemetry

enable_content_capture boolean

Capture prompts and responses (user/assistant/tool message content) in tracing/telemetry events. Disabled by default for privacy and alignment with OpenTelemetry GenAI semantic conventions. WARNING: Enabling this may include PII and sensitive data in your telemetry backend.

Default: False

files_url string

The location where the artifact files are stored.

model_providers array

List of ModelProvider namespace/name resource names that provide inference for this Model Entity

Array items:

item string

project string

The URN of the project associated with this entity.

custom_fields object

A set of custom fields that the user can define and use for various purposes.

Allows additional properties: Yes

ownership object

Ownership information for the entity

Properties

created_by string

The ID of the user that created this entity.

Default:

updated_by string

The ID of the user that last updated this entity.

access_policies object

A general object for capturing access policies which can be used by an external service to determine ACLs

Default: {}

Additional properties schema:

[key: string] string

RAGTargetInput#

Define an inline RAG pipeline configuration in the rag field when creating a target with type: "rag".

RAGTargetInput object

Properties

pipeline object

RAG pipeline.

Properties

retriever * object

The retriever pipeline included in the RAG pipeline.

Properties

pipeline object

Retriever pipeline run by Evaluator.

Properties

query_embedding_model * string | object

The query embedding model.

Any of:

Option 1: string - A reference to ModelInputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

index_embedding_model * string | object

The index embedding model.

Any of:

Option 1: string - A reference to ModelInputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

reranker_model string | object

The reranker model.

Any of:

Option 1: string - A reference to ModelInputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

top_k integer

The top k results to be used.

Default: 10

cached_outputs object

Cached pre-generated retriever outputs.

Properties

files_url * string

The files URL of the cached outputs.

Constraints: min length: 1

dataset object

Cached pre-generated retriever outputs dataset.

Properties

schema_version string

The version of the schema for the object. Internal use only.

Default: 1.0

id string

The ID of the entity. With the exception of namespaces, this is always a semantically-prefixed base58-encoded uuid4 [-base58(uuid4())].

description string

The description of the entity.

type_prefix string

The type prefix of the entity ID. If not specified, it will be inferred from the entity type name, but this will likely result in long prefixes.

namespace string

The namespace of the entity. This can be missing for namespace entities or in deployments that don't use namespaces.

Default: default

project string

The URN of the project associated with this entity.

created_at string

Timestamp for when the entity was created.

updated_at string

Timestamp for when the entity was last updated.

custom_fields object

A set of custom fields that the user can define and use for various purposes.

Allows additional properties: Yes

ownership object

Ownership information for the entity

Properties

created_by string

The ID of the user that created this entity.

Default:

updated_by string

The ID of the user that last updated this entity.

access_policies object

A general object for capturing access policies which can be used by an external service to determine ACLs

Default: {}

Additional properties schema:

[key: string] string

name string

The name of the entity. Must be unique inside the namespace. If not specified, it will be the same as the automatically generated id.

Constraints: max length: 255, pattern: ^[\w\-\+.@:]*$

Default:

version_id string

A unique, immutable id for the version. This is similar to the commit hash.

Default: main

version_tags array

The list of version tags associated with this entity.

Array items:

item object

A version tag marks a particular version for an entity. The name must be unique across all version tags for the same entity name.

Properties

name * string

The name of the version. Typically should be kept lower case, no space, no '/', no `@`.

created_by string

The ID of the user that created this entity.

Default:

created_at string

Timestamp for when the version tag was created.

format string

Specifies the dataset format, referring to the schema of the dataset rather than the file format. Examples include SQuAD, BEIR, etc.

files_url * string

The location where the artifact files are stored. This can be a URL pointing to NDS, Hugging Face, S3, or any other accessible resource location.

Constraints: min length: 1

hf_endpoint string

For HuggingFace URLs, the endpoint that should be used. By default, this is set to the Data Store URL. For HuggingFace Hub, this should be set to "https://huggingface.co".

split string

The split of the dataset. Examples include train, validation, test, etc.

limit integer

The maximum number of items to be used from the dataset.

model * string | object

The generation model for the RAG pipeline.

Any of:

Option 1: string - A reference to ModelInputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

context_ordering string

The context ordering for the RAG pipeline.

Default: desc

cached_outputs object

Cached pre-generated RAG outputs.

Properties

files_url * string

The files URL of the cached outputs.

Constraints: min length: 1

dataset object

Cached pre-generated RAG outputs dataset.

Properties

schema_version string

The version of the schema for the object. Internal use only.

Default: 1.0

id string

The ID of the entity. With the exception of namespaces, this is always a semantically-prefixed base58-encoded uuid4 [-base58(uuid4())].

description string

The description of the entity.

type_prefix string

The type prefix of the entity ID. If not specified, it will be inferred from the entity type name, but this will likely result in long prefixes.

namespace string

The namespace of the entity. This can be missing for namespace entities or in deployments that don't use namespaces.

Default: default

project string

The URN of the project associated with this entity.

created_at string

Timestamp for when the entity was created.

updated_at string

Timestamp for when the entity was last updated.

custom_fields object

A set of custom fields that the user can define and use for various purposes.

Allows additional properties: Yes

ownership object

Ownership information for the entity

Properties

created_by string

The ID of the user that created this entity.

Default:

updated_by string

The ID of the user that last updated this entity.

access_policies object

A general object for capturing access policies which can be used by an external service to determine ACLs

Default: {}

Additional properties schema:

[key: string] string

name string

The name of the entity. Must be unique inside the namespace. If not specified, it will be the same as the automatically generated id.

Constraints: max length: 255, pattern: ^[\w\-\+.@:]*$

Default:

version_id string

A unique, immutable id for the version. This is similar to the commit hash.

Default: main

version_tags array

The list of version tags associated with this entity.

Array items:

item object

A version tag marks a particular version for an entity. The name must be unique across all version tags for the same entity name.

Properties

name * string

The name of the version. Typically should be kept lower case, no space, no '/', no `@`.

created_by string

The ID of the user that created this entity.

Default:

created_at string

Timestamp for when the version tag was created.

format string

Specifies the dataset format, referring to the schema of the dataset rather than the file format. Examples include SQuAD, BEIR, etc.

files_url * string

The location where the artifact files are stored. This can be a URL pointing to NDS, Hugging Face, S3, or any other accessible resource location.

Constraints: min length: 1

hf_endpoint string

For HuggingFace URLs, the endpoint that should be used. By default, this is set to the Data Store URL. For HuggingFace Hub, this should be set to "https://huggingface.co".

split string

The split of the dataset. Examples include train, validation, test, etc.

limit integer

The maximum number of items to be used from the dataset.

RetrieverTargetInput#

Define an inline retriever configuration in the retriever field when creating a target with type: "retriever".

RetrieverTargetInput object

Properties

pipeline object

Retriever pipeline run by Evaluator.

Properties

query_embedding_model * string | object

The query embedding model.

Any of:

Option 1: string - A reference to ModelInputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

index_embedding_model * string | object

The index embedding model.

Any of:

Option 1: string - A reference to ModelInputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

reranker_model string | object

The reranker model.

Any of:

Option 1: string - A reference to ModelInputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

top_k integer

The top k results to be used.

Default: 10

cached_outputs object

Cached pre-generated retriever outputs.

Properties

files_url * string

The files URL of the cached outputs.

Constraints: min length: 1

dataset object

Cached pre-generated retriever outputs dataset.

Properties

schema_version string

The version of the schema for the object. Internal use only.

Default: 1.0

id string

The ID of the entity. With the exception of namespaces, this is always a semantically-prefixed base58-encoded uuid4 [-base58(uuid4())].

description string

The description of the entity.

type_prefix string

The type prefix of the entity ID. If not specified, it will be inferred from the entity type name, but this will likely result in long prefixes.

namespace string

The namespace of the entity. This can be missing for namespace entities or in deployments that don't use namespaces.

Default: default

project string

The URN of the project associated with this entity.

created_at string

Timestamp for when the entity was created.

updated_at string

Timestamp for when the entity was last updated.

custom_fields object

A set of custom fields that the user can define and use for various purposes.

Allows additional properties: Yes

ownership object

Ownership information for the entity

Properties

created_by string

The ID of the user that created this entity.

Default:

updated_by string

The ID of the user that last updated this entity.

access_policies object

A general object for capturing access policies which can be used by an external service to determine ACLs

Default: {}

Additional properties schema:

[key: string] string

name string

The name of the entity. Must be unique inside the namespace. If not specified, it will be the same as the automatically generated id.

Constraints: max length: 255, pattern: ^[\w\-\+.@:]*$

Default:

version_id string

A unique, immutable id for the version. This is similar to the commit hash.

Default: main

version_tags array

The list of version tags associated with this entity.

Array items:

item object

A version tag marks a particular version for an entity. The name must be unique across all version tags for the same entity name.

Properties

name * string

The name of the version. Typically should be kept lower case, no space, no '/', no `@`.

created_by string

The ID of the user that created this entity.

Default:

created_at string

Timestamp for when the version tag was created.

format string

Specifies the dataset format, referring to the schema of the dataset rather than the file format. Examples include SQuAD, BEIR, etc.

files_url * string

The location where the artifact files are stored. This can be a URL pointing to NDS, Hugging Face, S3, or any other accessible resource location.

Constraints: min length: 1

hf_endpoint string

For HuggingFace URLs, the endpoint that should be used. By default, this is set to the Data Store URL. For HuggingFace Hub, this should be set to "https://huggingface.co".

split string

The split of the dataset. Examples include train, validation, test, etc.

limit integer

The maximum number of items to be used from the dataset.

Pipeline Configuration Schemas#

These schemas define pipeline-specific settings used within RAG and retriever target configurations.

RAGPipelineDataInput#

Nested pipeline configuration schema used within RAGTargetInput. Defines retrieval and generation settings for RAG pipelines.

RAGPipelineDataInput object

Data for evaluating a RAG pipeline.

Properties

retriever * object

The retriever pipeline included in the RAG pipeline.

Properties

pipeline object

Retriever pipeline run by Evaluator.

Properties

query_embedding_model * string | object

The query embedding model.

Any of:

Option 1: string - A reference to ModelInputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

index_embedding_model * string | object

The index embedding model.

Any of:

Option 1: string - A reference to ModelInputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

reranker_model string | object

The reranker model.

Any of:

Option 1: string - A reference to ModelInputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

top_k integer

The top k results to be used.

Default: 10

cached_outputs object

Cached pre-generated retriever outputs.

Properties

files_url * string

The files URL of the cached outputs.

Constraints: min length: 1

dataset object

Cached pre-generated retriever outputs dataset.

Properties

schema_version string

The version of the schema for the object. Internal use only.

Default: 1.0

id string

The ID of the entity. With the exception of namespaces, this is always a semantically-prefixed base58-encoded uuid4 [-base58(uuid4())].

description string

The description of the entity.

type_prefix string

The type prefix of the entity ID. If not specified, it will be inferred from the entity type name, but this will likely result in long prefixes.

namespace string

The namespace of the entity. This can be missing for namespace entities or in deployments that don't use namespaces.

Default: default

project string

The URN of the project associated with this entity.

created_at string

Timestamp for when the entity was created.

updated_at string

Timestamp for when the entity was last updated.

custom_fields object

A set of custom fields that the user can define and use for various purposes.

Allows additional properties: Yes

ownership object

Ownership information for the entity

Properties

created_by string

The ID of the user that created this entity.

Default:

updated_by string

The ID of the user that last updated this entity.

access_policies object

A general object for capturing access policies which can be used by an external service to determine ACLs

Default: {}

Additional properties schema:

[key: string] string

name string

The name of the entity. Must be unique inside the namespace. If not specified, it will be the same as the automatically generated id.

Constraints: max length: 255, pattern: ^[\w\-\+.@:]*$

Default:

version_id string

A unique, immutable id for the version. This is similar to the commit hash.

Default: main

version_tags array

The list of version tags associated with this entity.

Array items:

item object

A version tag marks a particular version for an entity. The name must be unique across all version tags for the same entity name.

Properties

name * string

The name of the version. Typically should be kept lower case, no space, no '/', no `@`.

created_by string

The ID of the user that created this entity.

Default:

created_at string

Timestamp for when the version tag was created.

format string

Specifies the dataset format, referring to the schema of the dataset rather than the file format. Examples include SQuAD, BEIR, etc.

files_url * string

The location where the artifact files are stored. This can be a URL pointing to NDS, Hugging Face, S3, or any other accessible resource location.

Constraints: min length: 1

hf_endpoint string

For HuggingFace URLs, the endpoint that should be used. By default, this is set to the Data Store URL. For HuggingFace Hub, this should be set to "https://huggingface.co".

split string

The split of the dataset. Examples include train, validation, test, etc.

limit integer

The maximum number of items to be used from the dataset.

model * string | object

The generation model for the RAG pipeline.

Any of:

Option 1: string - A reference to ModelInputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

context_ordering string

The context ordering for the RAG pipeline.

Default: desc

RetrieverPipelineDataInput#

Nested pipeline configuration schema used within RetrieverTargetInput. Defines embedding and reranking settings for retriever pipelines.

RetrieverPipelineDataInput object

Data for evaluating a retriever pipeline.

Properties

query_embedding_model * string | object

The query embedding model.

Any of:

Option 1: string - A reference to ModelInputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

index_embedding_model * string | object

The index embedding model.

Any of:

Option 1: string - A reference to ModelInputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

reranker_model string | object

The reranker model.

Any of:

Option 1: string - A reference to ModelInputEV.

Option 2: object - Information about a machine learning model, typically an LLM.

top_k integer

The top k results to be used.

Default: 10

Data Source Schemas#

Dataset#

Define an inline dataset configuration in the dataset field when creating a target with type: "dataset".

Dataset object

A dataset that can be used for fine-tuning or evaluation.

Properties

created_at string

Timestamp for when the entity was created.

updated_at string

Timestamp for when the entity was last updated.

name string

The name of the entity. Must be unique inside the namespace. If not specified, it will be the same as the automatically generated id.

Constraints: max length: 255, pattern: ^[\w\-\+.@:]*$

Default:

namespace string

The namespace of the entity. This can be missing for namespace entities or in deployments that don't use namespaces.

Default: default

description string

The description of the entity.

format string

Specifies the dataset format, referring to the schema of the dataset rather than the file format. Examples include SQuAD, BEIR, etc.

files_url * string

The location where the artifact files are stored. This can be a URL pointing to NDS, Hugging Face, S3, or any other accessible resource location.

Constraints: min length: 1

hf_endpoint string

For HuggingFace URLs, the endpoint that should be used. By default, this is set to the Data Store URL. For HuggingFace Hub, this should be set to "https://huggingface.co".

split string

The split of the dataset. Examples include train, validation, test, etc.

limit integer

The maximum number of items to be used from the dataset.

id string

The ID of the entity. With the exception of namespaces, this is always a semantically-prefixed base58-encoded uuid4 [-base58(uuid4())].

project string

The URN of the project associated with this entity.

custom_fields object

A set of custom fields that the user can define and use for various purposes.

Allows additional properties: Yes