The cuFFT Device Extensions (cuFFTDx) library enables you to perform Fast Fourier Transform (FFT) calculations inside your CUDA kernel. Fusing FFT with other operations can decrease the latency and improve the performance of your application.

The documentation consists of two main components:


  • Fast Fourier Transform (FFT) embeddable into a CUDA kernel

  • High performance, no unnecessary data movement from and to global memory

  • Customizability, options to adjust selection of FFT routine for different needs (size, precision, batches etc.)

  • Ability to fuse FFT kernels with other operations saving global memory trips

  • Compatibility with future versions of the CUDA Toolkit