core.resharding.copy_services.gloo_copy_service#
Module Contents#
Classes#
CopyService implementation that routes refit traffic over a CPU/Gloo process group instead of NCCL. |
Data#
API#
- core.resharding.copy_services.gloo_copy_service.logger#
‘getLogger(…)’
- class core.resharding.copy_services.gloo_copy_service.GlooCopyService(group=None)#
Bases:
core.resharding.copy_services.base.CopyServiceCopyService implementation that routes refit traffic over a CPU/Gloo process group instead of NCCL.
Initialization
- submit_send(
- src_tensor: torch.Tensor,
- dest_rank: int,
- task_id: Optional[int] = None,
- submit_recv(
- dest_tensor: torch.Tensor,
- src_rank: int,
- task_id: Optional[int] = None,
- run()#