SC25 Proceedings

Workshops Archive

Using Code Coverage to Assess Feature Gaps in MPI Correctness Tool Classification Tests

Workshop: 9th International Workshop on Software Correctness for HPC Applications (Correctness '25)

Authors: Alexander Hück (Technical University of Darmstadt), Simon Schwitanski (RWTH Aachen University), Tim Jammer (Technical University of Darmstadt), Joachim Jenke (RWTH Aachen University), and Yussur Mustafa Oraji and Christian Bischof (Technical University of Darmstadt)

Abstract: We examine the code generator-based MPI correctness benchmark MPI-BugBench (MBB) by analyzing the code coverage it triggers in three tools: MUST, PARCOACH, and clang-tidy. Our analysis complements MBB’s design, which prunes potentially exhaustive test sets based on real-world MPI usage. Our assessment identifies two key limitations in the generated tests: incomplete coverage of MPI features, such as varying-count collectives, and limited structural diversity of the generated tests, such as lack of loops and lack of array-based MPI handles. We find increasing test volume alone offers limited benefit for exercising the tool's analysis code in our assessment. To address these gaps, we propose a new generation level with the missing features and more varied code structures. To that end, we implemented 34 additional tests to exercise previously uncovered analysis code, adding as many as 770 lines of code coverage in MUST with a single test for varying-count collectives.

Back to 9th International Workshop on Software Correctness for HPC Applications (Correctness '25) Archive Listing Back to Full Workshop Archive Listing