nemo_curator.backends.ray_data.executor
nemo_curator.backends.ray_data.executor
nemo_curator.backends.ray_data.executor
Bases: BaseExecutor
Ray Data-based executor for pipeline execution.
This executor:
Convert Ray Data dataset back to list of tasks.
Parameters:
Ray Data dataset containing Task objects
Returns: list[Task]
List of Task objects
Convert list of tasks to Ray Data dataset.
Parameters:
List of Task objects
Returns: Dataset
Ray Data dataset containing Task objects directly
Execute the pipeline stages using Ray Data.
Parameters:
List of processing stages to execute
Initial tasks to process (can be None for empty start)
Returns: list[Task]
list[Task]: List of final processed tasks