nemo_rl.data.datasets.response_datasets.tulu3#
Module Contents#
Classes#
Tulu3 SFT mixture dataset. |
Functions#
Format for Tulu3 SFT data. |
API#
- nemo_rl.data.datasets.response_datasets.tulu3.format_tulu3_sft_mixture(
- data: dict[str, Any],
- task_name: str = 'tulu3_sft_mixture',
Format for Tulu3 SFT data.
- class nemo_rl.data.datasets.response_datasets.tulu3.Tulu3SftMixtureDataset(
- seed: int = 42,
- test_size: float = 0.05,
- prompt_file: str | None = None,
- max_samples: int | None = None,
Bases:
nemo_rl.data.datasets.raw_dataset.RawDatasetTulu3 SFT mixture dataset.
Initialization
Initialize the Tulu3 SFT mixture dataset.
- Parameters:
seed – Random seed for train/validation split
test_size – Proportion of data to use for validation (0.0-1.0)
prompt_file – Optional prompt file path to be applied via TaskDataSpec
max_samples – Optional maximum number of samples to use from the dataset