Network Transport and Fabric Visibility
Network Transport and Fabric Visibility
Backend Switch Fabric API
The purpose of this API is to expose sufficient information about the clusterâs network topology to enable efficient scheduling, placement, and optimization of multi-node GPU workloads. Understanding the network hierarchy between compute instances and switches, as well as both intra- and inter-node NVLink domains, is essential for minimizing communication latency and maximizing throughput. Thus, this applies to North-South, East-West, and NVLink networks (not MGMT). See the appendix for a DGXC recommended reference implementation.