nemo_curator.backends.experimental.ray_actor_pool.shuffle_adapter
nemo_curator.backends.experimental.ray_actor_pool.shuffle_adapter
Module Contents
Classes
API
Bases: BaseStageAdapter
Ray actor that wraps a shuffle stage and its actor.
This adapter manages the lifecycle of a shuffle actor (like LSHActor) and provides a uniform interface for the executor.
_batch_size
output_nparts
Extract shuffled data and write to output files.
Get the batch size for this stage.
Finish the insertion phase and trigger shuffle.
Read and insert tasks into the shuffler.
Setup shuffle workers and stage
Note: This method is not used in the current implementation since we use the Ray Data pattern of calling setup_on_node before actor creation.
Setup the root actor.
Setup UCXX communication.
Clean up resources.