backends.internal.raft.ray_comms
#
Module Contents#
Classes#
Initializes and manages underlying NCCL comms handles across the a pool of
Ray actors. It is expected that |
API#
- class backends.internal.raft.ray_comms.Comms(verbose: bool = False, nccl_root_location: str = 'ray-actor')#
Initializes and manages underlying NCCL comms handles across the a pool of Ray actors. It is expected that
init()
will be called explicitly. It is recommended to also calldestroy()
when the comms are no longer needed so the underlying resources can be cleaned up. This class is not meant to be thread-safe.Initialization
Args: verbose (bool): Print verbose logging. Defaults to False. nccl_root_location (str): Indicates where the NCCL’s root node should be located. [‘client’, ‘worker’, ‘scheduler’, ‘ray-actor’]. Defaults to “ray-actor”.
- create_nccl_uniqueid() None #
- init() None #
Initializes the underlying comms.
- valid_nccl_placements#
‘ray-actor’