nemo_rl.data.datasets.response_datasets.clevr#

Module Contents#

Classes#

CLEVRCoGenTDataset

Simple wrapper around the CLEVR-CoGenT dataset.

Functions#

format_answer_fromtags

Extract content between tags and strip whitespace.

format_clevr_cogent_dataset

Format the CLEVR-CoGenT dataset into an OpenAI-API-like message log.

API#

nemo_rl.data.datasets.response_datasets.clevr.format_answer_fromtags(answer: str) str#

Extract content between tags and strip whitespace.

nemo_rl.data.datasets.response_datasets.clevr.format_clevr_cogent_dataset(
example: dict[str, Any],
return_pil: bool = False,
) dict[str, Any]#

Format the CLEVR-CoGenT dataset into an OpenAI-API-like message log.

class nemo_rl.data.datasets.response_datasets.clevr.CLEVRCoGenTDataset(split: str = 'train', **kwargs)#

Bases: nemo_rl.data.datasets.raw_dataset.RawDataset

Simple wrapper around the CLEVR-CoGenT dataset.

Parameters:

split – Split name for the dataset, default is “train”

Initialization