The International Conference for High Performance Computing, Networking, Storage, and Analysis

Workshops Archive

Development of a performance portable distributed FFT interface on top of the Kokkos ecosystem


Workshop: 2025 International Workshop on Performance, Portability, and Productivity in HPC (P3HPC)

Authors: Yuuichi Asahi and Trévis Morvany (CEA Saclay, Maison de la Simulation) and Thomas Padioleau and Julien Bigot (CEA Saclay)

Abstract: This paper presents the development of a performance portable distributed FFT implementation on top of the Kokkos ecosystem. Thanks to kokkos and kokkos-fft, we largely simplify the implementation details of distributed FFT with performance portability. We newly develop unique features like batched-distributed FFT and interfaces to vendor distributed FFT libraries. We demonstrate that our distributed-FFT works efficiently on NVIDIA A100 and AMD MI250X GPUs, while keeping a reasonable performance on CPUs.


Back to 2025 International Workshop on Performance, Portability, and Productivity in HPC (P3HPC) Archive Listing Back to Full Workshop Archive Listing