Changes and New Features

HPC-X current version provides the following changes and new features:

Category

Description

TL/UCP Special Service Worker

Added support for having a separate UCX UCP worker use UCC service collectives.

For further information, please see TL/UCP Special Service Worker section.

Data Type Support in CUDA Executor Component (EC)

Added out-of-box support for all datatypes and reduction operations for UCC collectives for GPUs.

For further information, please see Data Type Support in CUDA Executor Component section.

EC/CUDA One-shot Kernel with Cooperative Launch

Added support for using a single CUDA kernel for CUDA operations in UCC GPU collectives.

For further information, please see EC/CUDA One-shot Kernel with Cooperative Launch section.

Out-Of-Box Native GPU Allreduce

Added support for the UCC library to detect the NVIDIA NVLink topology and select the best GPU-based algorithms for supported collectives (Allgather/v, Reducescatter/v).

For further information, please seeOut-Of-Box Native GPU Allreduce section.

Bug Fixes

See Bug Fixes.

© Copyright 2023, NVIDIA. Last updated on May 23, 2023.