nemo_rl.data.datasets.response_datasets.dapo_math#
Module Contents#
Classes#
Functions#
Load and split the DeepScaler dataset into train and test sets. |
API#
- nemo_rl.data.datasets.response_datasets.dapo_math.format_dapo_math_17k(
- data: dict[str, str | float | int],
- task_name: str = 'DAPOMath17K',
- nemo_rl.data.datasets.response_datasets.dapo_math.prepare_dapo_math_17k_dataset(
- seed: int = 42,
- task_name: str = 'DAPOMath17K',
Load and split the DeepScaler dataset into train and test sets.
- class nemo_rl.data.datasets.response_datasets.dapo_math.DAPOMath17KDataset(seed: int = 42)#
Bases:
nemo_rl.data.datasets.raw_dataset.RawDatasetInitialization
Initialize the DAPO Math 17K dataset with train split.
- Parameters:
seed – Random seed for reproducible splitting