nemo_automodel.components.moe.uccl_ep#
UCCL-EP integration for expert parallelism.
UCCL-EP (https://github.com/uccl-project/uccl/tree/main/ep) has the same interface and functionality as DeepEP, and enables GPU-initiated communication for MoE models across heterogeneous GPUs and NICs.
Vendored files (from https://github.com/uccl-project/uccl/tree/main/ep/deep_ep_wrapper): _buffer.py <- deep_ep_wrapper/deep_ep/buffer.py _utils.py <- deep_ep_wrapper/deep_ep/utils.py
Usage: Set dispatcher: uccl_ep in the model backend config.
Submodules#
Package Contents#
Data#
API#
- nemo_automodel.components.moe.uccl_ep.__all__#
[‘UCCLBuffer’]