mteb#

This page contains all evaluation tasks for the mteb harness.

Task	Description
MMTEB	MMTEB
MTEB	MTEB
MTEB_NL_RETRIEVAL	MTEB_NL_RETRIEVAL
MTEB_VDR	MTEB Visual Document Retrieval benchmark
RTEB	RTEB
ViDoReV1	ViDoReV1
ViDoReV2	ViDoReV2
ViDoReV3	ViDoReV3
ViDoReV3_Text	ViDoReV3 Text (text_image markdown only)
ViDoReV3_Text_Image	ViDoReV3 Text+Image (text_image markdown + images)
custom_beir_task	Custom BEIR-formatted text retrieval benchmark
fiqa	Financial Opinion Mining and Question Answering
hotpotqa	HotpotQA is a question answering dataset featuring natural, multi-hop questions, with strong supervision for supporting facts to enable more explainable question answering systems.
miracl	MIRACL (Multilingual Information Retrieval Across a Continuum of Languages) is a multilingual retrieval dataset that focuses on search across 18 different languages.
miracl_lite	MIRACL (Multilingual Information Retrieval Across a Continuum of Languages) is a multilingual retrieval dataset that focuses on search across 18 different languages.
mldr	MLDR
mlqa	MLQA (MultiLingual Question Answering) is a benchmark dataset for evaluating cross-lingual question answering performance.
nano_fiqa	NanoFiQA2018 is a smaller subset of the Financial Opinion Mining and Question Answering dataset.
nq	Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is designed for the training and evaluation of automatic question answering systems.
nvidia_digital_corpora_10k	NVIDIA Internal - Digital Corpora10k Retrieval
nvidia_digital_corpora_10k_text	NVIDIA Internal - Digital Corpora10k Text Retrieval
nvidia_earnings_v2	NVIDIA Internal - Earnings V2 Multimodal Retrieval
nvidia_earnings_v2_text	NVIDIA Internal - Earnings V2 Text Retrieval
nvidia_vidore_v1	NVIDIA ViDoReV1
nvidia_vidore_v1_text	NVIDIA Internal - ViDoReV1 Text Retrieval
nvidia_vidore_v2	NVIDIA ViDoReV2
nvidia_vidore_v2_text	NVIDIA Internal - ViDoReV2 Text Retrieval
nvidia_vidore_v3	NVIDIA ViDoReV3
nvidia_vidore_v3_text	NVIDIA Internal - ViDoReV3 Text Retrieval
techqa	NVIDIA TechQA

MMTEB#

MMTEB

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: MMTEB

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: MTEB(Multilingual, v2)
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: MMTEB
target:
  api_endpoint: {}

MTEB#

MTEB

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: MTEB

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: MTEB(eng, v2)
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: MTEB
target:
  api_endpoint: {}

MTEB_NL_RETRIEVAL#

MTEB_NL_RETRIEVAL

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: MTEB_NL_RETRIEVAL

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: MTEB(nld, v1, retrieval)
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: MTEB_NL_RETRIEVAL
target:
  api_endpoint: {}

MTEB_VDR#

MTEB Visual Document Retrieval benchmark

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: MTEB_VDR

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: VisualDocumentRetrieval
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: MTEB_VDR
target:
  api_endpoint: {}

RTEB#

RTEB

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: RTEB

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: RTEB(beta)
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: RTEB
target:
  api_endpoint: {}

ViDoReV1#

ViDoReV1

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: ViDoReV1

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: ViDoRe(v1)
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: ViDoReV1
target:
  api_endpoint: {}

ViDoReV2#

ViDoReV2

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: ViDoReV2

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: ViDoRe(v2)
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: ViDoReV2
target:
  api_endpoint: {}

ViDoReV3#

ViDoReV3

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: ViDoReV3

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: ViDoRe(v3)
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: ViDoReV3
target:
  api_endpoint: {}

ViDoReV3_Text#

ViDoReV3 Text (text_image markdown only)

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: ViDoReV3_Text

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: ViDoRe(v3, Text)
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: ViDoReV3_Text
target:
  api_endpoint: {}

ViDoReV3_Text_Image#

ViDoReV3 Text+Image (text_image markdown + images)

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: ViDoReV3_Text_Image

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: ViDoRe(v3, Text+Image)
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: ViDoReV3_Text_Image
target:
  api_endpoint: {}

custom_beir_task#

Custom BEIR-formatted text retrieval benchmark

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: custom_beir_task

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: custom_beir_task
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: custom_beir_task
target:
  api_endpoint: {}

fiqa#

Financial Opinion Mining and Question Answering

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: fiqa

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: FiQA2018
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: fiqa
target:
  api_endpoint: {}

hotpotqa#

HotpotQA is a question answering dataset featuring natural, multi-hop questions, with strong supervision for supporting facts to enable more explainable question answering systems.

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: hotpotqa

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: HotpotQA
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: hotpotqa
target:
  api_endpoint: {}

miracl#

MIRACL (Multilingual Information Retrieval Across a Continuum of Languages) is a multilingual retrieval dataset that focuses on search across 18 different languages.

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: miracl

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: MIRACLRetrieval
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: miracl
target:
  api_endpoint: {}

miracl_lite#

MIRACL (Multilingual Information Retrieval Across a Continuum of Languages) is a multilingual retrieval dataset that focuses on search across 18 different languages.

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: miracl_lite

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: MIRACLRetrieval
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: true
      language: null
  supported_endpoint_types:
  - embedding
  type: miracl_lite
target:
  api_endpoint: {}

mldr#

MLDR

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: mldr

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: MultiLongDocRetrieval
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: mldr
target:
  api_endpoint: {}

mlqa#

MLQA (MultiLingual Question Answering) is a benchmark dataset for evaluating cross-lingual question answering performance.

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: mlqa

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: MLQARetrieval
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: mlqa
target:
  api_endpoint: {}

nano_fiqa#

NanoFiQA2018 is a smaller subset of the Financial Opinion Mining and Question Answering dataset.

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: nano_fiqa

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: NanoFiQA2018Retrieval
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: train
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: nano_fiqa
target:
  api_endpoint: {}

nq#

Natural Questions (NQ) contains real user questions issued to Google search, and answers found from Wikipedia by annotators. NQ is designed for the training and evaluation of automatic question answering systems.

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: nq

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: NQ
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: nq
target:
  api_endpoint: {}

nvidia_digital_corpora_10k#

NVIDIA Internal - Digital Corpora10k Retrieval

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: nvidia_digital_corpora_10k

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: nvidia_digital_corpora_10k
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: nvidia_digital_corpora_10k
target:
  api_endpoint: {}

nvidia_digital_corpora_10k_text#

NVIDIA Internal - Digital Corpora10k Text Retrieval

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: nvidia_digital_corpora_10k_text

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: nvidia_digital_corpora_10k_text
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: nvidia_digital_corpora_10k_text
target:
  api_endpoint: {}

nvidia_earnings_v2#

NVIDIA Internal - Earnings V2 Multimodal Retrieval

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: nvidia_earnings_v2

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: nvidia_earnings_v2
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: nvidia_earnings_v2
target:
  api_endpoint: {}

nvidia_earnings_v2_text#

NVIDIA Internal - Earnings V2 Text Retrieval

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: nvidia_earnings_v2_text

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: nvidia_earnings_v2_text
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: nvidia_earnings_v2_text
target:
  api_endpoint: {}

nvidia_vidore_v1#

NVIDIA ViDoReV1

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: nvidia_vidore_v1

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: nvidia_vidore_v1
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: nvidia_vidore_v1
target:
  api_endpoint: {}

nvidia_vidore_v1_text#

NVIDIA Internal - ViDoReV1 Text Retrieval

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: nvidia_vidore_v1_text

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: NVIDIA ViDoRe V1 (Text)
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: nvidia_vidore_v1_text
target:
  api_endpoint: {}

nvidia_vidore_v2#

NVIDIA ViDoReV2

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: nvidia_vidore_v2

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: nvidia_vidore_v2
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: nvidia_vidore_v2
target:
  api_endpoint: {}

nvidia_vidore_v2_text#

NVIDIA Internal - ViDoReV2 Text Retrieval

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: nvidia_vidore_v2_text

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: NVIDIA ViDoRe V2 (Text)
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: nvidia_vidore_v2_text
target:
  api_endpoint: {}

nvidia_vidore_v3#

NVIDIA ViDoReV3

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: nvidia_vidore_v3

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: nvidia_vidore_v3
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: nvidia_vidore_v3
target:
  api_endpoint: {}

nvidia_vidore_v3_text#

NVIDIA Internal - ViDoReV3 Text Retrieval

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: nvidia_vidore_v3_text

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: NVIDIA ViDoRe V3 (Text)
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: nvidia_vidore_v3_text
target:
  api_endpoint: {}

techqa#

NVIDIA TechQA

Container

Harness: mteb

Container:

nvcr.io/nvidia/eval-factory/mteb:26.01

Container Digest:

sha256:fb0ea5360bec880d4ecbfc63015d775dc3d22601e5ab17d760a992402646cbbb

Container Arch: multiarch

Task Type: techqa

Command

{% if target.api_endpoint.api_key_name is not none %}export API_TOKEN=${{target.api_endpoint.api_key_name}} &&{% endif %} {% if config.params.extra.dataset_path is not none %} export MTEB_INTERNAL_DATASET_PATH={{config.params.extra.dataset_path}} &&{% endif %} {% if config.params.extra.ranker.api_key is not none %}export RANKER_API_TOKEN=${{config.params.extra.ranker.api_key}} &&{% endif %} mteb  --encoder_name {{target.api_endpoint.model_id}} --encoder_url {{target.api_endpoint.url}} --task "{{config.params.task}}" --workdir {{config.output_dir}} --batch_size {{config.params.extra.batch_size}} --async_limit {{config.params.parallelism}} --max_retries {{config.params.max_retries}} --request_timeout {{config.params.request_timeout}} {% if config.params.extra.cache_path is not none %} --cache_path {{config.params.extra.cache_path}}{% endif %} {% if config.params.extra.args is not none %} {{config.params.extra.args}} {% endif %} {% if config.params.extra.language is not none %} --langs {{config.params.extra.language}} {% endif %} {% if config.params.extra.query_prompt_template is not none %} --query_prompt_template "{{config.params.extra.query_prompt_template}}"{% endif %} {% if config.params.extra.document_prompt_template is not none %} --document_prompt_template "{{config.params.extra.document_prompt_template}}"{% endif %} {% if config.params.extra.ranker.model_id is not none %} --ranker_name {{config.params.extra.ranker.model_id}} --ranker_url {{config.params.extra.ranker.url}} --ranker_endpoint_type {{config.params.extra.ranker.endpoint_type}}{% endif %} --truncate {{config.params.extra.truncate}} --top_k {{config.params.extra.top_k}} {% if config.params.extra.version_lite is not none%} --version_lite {{config.params.extra.version_lite}} {% endif %} {% if config.params.extra.eval_split is not none %} --eval_split {{config.params.extra.eval_split}} {% endif %}

Defaults

framework_name: mteb
pkg_name: mteb
config:
  params:
    max_retries: 10
    parallelism: 20
    task: TechQA
    request_timeout: 300
    extra:
      query_prompt_template: null
      document_prompt_template: null
      ranker:
        model_id: null
        url: null
        api_key: null
        endpoint_type: nim
      top_k: 40
      truncate: END
      batch_size: 128
      eval_split: test
      dataset_path: null
      cache_path: null
      args: null
      version_lite: null
      language: null
  supported_endpoint_types:
  - embedding
  type: techqa
target:
  api_endpoint: {}