nemo_rl.data.hf_datasets.oasst
#
Module Contents#
Classes#
Functions#
Recusive function that returns all the sub converstaions in a list starting from node tree_obj. |
|
Data#
API#
- nemo_rl.data.hf_datasets.oasst.SYSTEM_PROMPT = <Multiline-String>#
- nemo_rl.data.hf_datasets.oasst.parse_conversations(tree_obj, first: bool = False)#
Recusive function that returns all the sub converstaions in a list starting from node tree_obj.
- Parameters:
tree_obj (obj) – current conversation node
- Returns:
a list of sub conversation threads including the current conversation node
- nemo_rl.data.hf_datasets.oasst.get_data_records(objs)#
- nemo_rl.data.hf_datasets.oasst.download_and_process_oasst(
- output_directory: str = '.',
- seed: int = 42,
- split_ratio: float = 0.95,
- class nemo_rl.data.hf_datasets.oasst.OasstDataset(output_dir: str = '.', seed: int = 42)#
Initialization