Hardware and Software Requirements#
GPU Architectures |
Volta ( |
CUDA |
12.2.2 and above |
CPU architectures |
x86_64, arm64-sbsa |
Operating System |
Linux |
Recommended NVIDIA InfiniBand solutions for accelerated inter-node communication
Required Packages
CUDA Toolkit 12.2.2 and above (https://developer.nvidia.com/cuda-downloads)
HPC-X v2.17.1 and above (https://developer.nvidia.com/networking/hpc-x) - contains OpenUCC and OpenUCX that satisfy cuBLASMp requirements.
NCCL v2.18.5 and above (https://developer.nvidia.com/nccl) - required to achieve better performance
NVSHMEM v3.1 and above (https://developer.nvidia.com/nvshmem)
Recommended Packages
GDRCopy v2.0+ (NVIDIA/gdrcopy) and nv_peer_mem (Mellanox/nv_peer_memory) - Allows underlying communication packages to use GPUDirect RDMA. If you install OpenUCX yourself, it should be configured with GDRCopy support.
Mellanox OFED (https://www.mellanox.com/products/infiniband-drivers/linux/mlnx_ofed) - drivers for NVIDIA InfiniBand Adapters (https://www.nvidia.com/en-us/networking/products/infiniband). If you install OpenUCX yourself, it should be configured with IB communications support.