The International Conference for High Performance Computing, Networking, Storage, and Analysis

Research and ACM SRC Posters Archive

Explicit Low-Order Finite-Element Wave Simulation Accelerated with Variable-Precision Computing Using INT8 Tensor Cores


Poster Type: Research Posters

Author: Kohei Fujita (The University of Tokyo, RIKEN), Tsuyoshi Ichimura (The University of Tokyo, RIKEN), Muneo Hori (Japan Agency for Marine-Earth Science and Technology), Lalith Maddegedara (The University of Tokyo)

Supervisor:

Abstract: Using low-precision cores for acceleration of PDE-based simulations with sparse or small matrices is often challenging due to the frequent data conversion between high- and low-precision variables, and that the required precision varies in time/space due to the heterogeneity of the target problem. As an example of accelerating such PDE-based simulations, we develop an integer-based variable-precision computing method with low data-conversion costs for low-order explicit finite-element wave propagation simulations. Here, the precision level used for solving the problem is chosen locally to attain simulation accuracy, and is accelerated using INT8 Tensor Cores. This leads to a 3.3-fold speedup from a baseline FP64 CUDA-core-based implementation with equivalent simulation accuracy, with 87% weak efficiency up to 256 compute nodes of the GH200-based Miyabi supercomputer. These ideas are expected to be useful for accelerating other PDE-based problems with sparse or small matrices on computer architectures with high-performance, low-precision cores.

Best Poster Finalist (BP): no
Poster: PDF
Poster Summary: PDF


Back to Poster Archive Listing