NVIDIA cuFFTMp documentation

Welcome to the cuFFTMp (cuFFT Multi-process) library.

You can find here

Highlights

  • 2D and 3D distributed-memory FFTs

  • Slabs (1D) and pencils (2D) data decomposition, with arbitrary block sizes

  • MPI interface

  • Low-latency implementation using NVSHMEM, optimized for single-node and multi-node FFT’s