Poster Type: Research Posters
Author: Kohei Fujita (The University of Tokyo, RIKEN), Tsuyoshi Ichimura (The University of Tokyo, RIKEN), Muneo Hori (Japan Agency for Marine-Earth Science and Technology), Lalith Maddegedara (The University of Tokyo)
Supervisor:
Abstract: Using low-precision cores for acceleration of PDE-based simulations with sparse or small matrices is often challenging due to the frequent data conversion between high- and low-precision variables, and that the required precision varies in time/space due to the heterogeneity of the target problem. As an example of accelerating such PDE-based simulations, we develop an integer-based variable-precision computing method with low data-conversion costs for low-order explicit finite-element wave propagation simulations. Here, the precision level used for solving the problem is chosen locally to attain simulation accuracy, and is accelerated using INT8 Tensor Cores. This leads to a 3.3-fold speedup from a baseline FP64 CUDA-core-based implementation with equivalent simulation accuracy, with 87% weak efficiency up to 256 compute nodes of the GH200-based Miyabi supercomputer. These ideas are expected to be useful for accelerating other PDE-based problems with sparse or small matrices on computer architectures with high-performance, low-precision cores.
Best Poster Finalist (BP): no
Poster: PDF
Poster Summary: PDF