`nemo_rl.data.datasets.response_datasets.dapo_math`#

Module Contents#

`format_dapo_math_17k`
`prepare_dapo_math_17k_dataset`	Load and split the DeepScaler dataset into train and test sets.

nemo_rl.data.datasets.response_datasets.dapo_math.format_dapo_math_17k( data: dict[str, str | float | int], task_name: str = 'DAPOMath17K', ) → dict[str, list[Any] | str]#

nemo_rl.data.datasets.response_datasets.dapo_math.prepare_dapo_math_17k_dataset( seed: int = 42, task_name: str = 'DAPOMath17K', ) → dict[str, datasets.Dataset | None]#: Load and split the DeepScaler dataset into train and test sets.

class nemo_rl.data.datasets.response_datasets.dapo_math.DAPOMath17KDataset(seed: int = 42)#

Initialization

Initialize the DAPO Math 17K dataset with train split.