nemo_rl.data.datasets.response_datasets.clevr#

Module Contents#

Classes#

Functions#

format_answer_fromtags

Extract content between tags and strip whitespace.

format_clevr_cogent_dataset

Format the CLEVR-CoGenT dataset into an OpenAI-API-like message log.

prepare_clevr_cogent_dataset

API#

nemo_rl.data.datasets.response_datasets.clevr.format_answer_fromtags(answer: str) str#

Extract content between tags and strip whitespace.

nemo_rl.data.datasets.response_datasets.clevr.format_clevr_cogent_dataset(
example: dict[str, Any],
return_pil: bool = False,
) dict[str, Any]#

Format the CLEVR-CoGenT dataset into an OpenAI-API-like message log.

nemo_rl.data.datasets.response_datasets.clevr.prepare_clevr_cogent_dataset(
split: str = 'trainA',
task_name: Optional[str] = None,
)#
class nemo_rl.data.datasets.response_datasets.clevr.CLEVRCoGenTDataset(
split: str = 'trainA',
prompt_file: Optional[str] = None,
)#

Initialization

Simple wrapper around the CLEVR-CoGenT dataset.

Parameters:
  • split – The split of the dataset to use.

  • prompt_file – The file containing the prompt for the dataset.