The International Conference for High Performance Computing, Networking, Storage, and Analysis

Research and ACM SRC Posters Archive

Evaluating the Power-Monitoring Capabilities of Aurora


Poster Type: ACM Student Research Competition, Graduate

Author: Precious Eyabi (Argonne National Laboratory (ANL))

Supervisor: Kazutomo Yoshii (Argonne National Laboratory (ANL))

Abstract: Exascale systems like Aurora push performance bounds but they draw tens of megawatts, making precise, low-overhead power monitoring essential for efficiency and cost control. We present an ongoing evaluation of the two primary power-monitoring interfaces on Aurora, quantifying accuracy and temporal granularity from a single node to a system level. Our contribution is a reproducible methodology, combining HPC benchmarks, mini-apps, and spectral analysis, to determine when each tool is trustworthy and how to configure sampling. Preliminary results characterize sampling limits and overhead trade-offs. Complete results are in progress and we seek to deduce if our current methods of power monitoring are suitable for exascale levels. In the poster, we will share the evaluation framework, early comparative results, and actionable best practices for exascale power studies.

Best Poster Finalist (BP): no
Poster: PDF
Poster Summary: PDF


Back to Poster Archive Listing