SC25 Proceedings

Research and ACM SRC Posters

Accelerating AI Co-scientists with HPC Infrastructure. Suryatejas Appana (University of California, Berkeley)

Accelerating Linear Solve with Mixed Precision Nested Recursive Subdivision on AI Hardware. Vicki Carrica (Massachusetts Institute of Technology (MIT))

Accelerating Scientific Workflows with LLM-Driven Compiler Optimizations for Generated High-Performance Hardware. Robert Ramstad, Nicolas Bohm Agostini, and Antonino Tumeo (Pacific Northwest National Laboratory (PNNL))

Advancing EEG Signal Analysis with Quantum Machine Learning. Stephanie Murray (University of Washington Bothell, University of Hawaii at Manoa) and Erika Parsons (University of Washington Bothell)

AdversaGuard: A Distributed Data Poisoning Benchmark for Parallel AI. Yulia Kumar (Kean University, Rutgers University); Solomon Thomas, Dejaun Gayle, and J. Jenny Li (Kean University); and Dov Kruger (Rutgers University)

An Agent-Based Viral Venture: Adaptive Tool Selection for Scalable Genomics. Naomi Kolodisner (University of Arizona), Alok Kamatar (Advisor) (University of Chicago), and J. Greg Pauloski (Advisor) (NVIDIA Corporation)

Algorithms and Applications of Dynamic Network Analysis using CANDY. Aashish Pandey (University of North Texas), Arindam Khanda and S.M Shovan (Missouri University of Science and Technology), Ali Y. Khan (University of North Texas), Boyana Norris (University of Oregon), Sajal K. Das (Missouri University of Science and Technology), and Sanjukta Bhowmick (University of North Texas)

Analyzing Dataset Popularity for Optimizing In-network Storage. Gunwoo Kim (University of California, Davis) and Alex Sim and Kesheng Wu (ESnet; Lawrence Berkeley National Laboratory (LBNL))

Applying Lossy Compression Techniques to GNN Training. Milan Shah, Reece Neff, and Michela Becchi (North Carolina State University)

An Approach for Correlating Compiler Optimizations with Runtime Performance. Befikir Bogale (University of Tennessee, Knoxville); Olga Pearce (Lawrence Livermore National Laboratory); Tom Scogland (Lawrence Livermore National Laboratory (LLNL)); and Michela Taufer (University of Tennessee, Knoxville)

AutoSlim: Intelligent Automata Graph Optimization for Efficient Acceleration. Tiffany Yu and Rasha Karakchi (University of South Carolina)

Between the NIC and a Hard Place: Evaluating 400 Gb/s Ethernet for HPC Data Transfers. Adelle Ferris, Evelyn Needham, Nikole Grandez, Jesse Martinez, and Doug Egan (Los Alamos National Laboratory (LANL))

Bridging the Quantum Coding Gap: Instruction-Tuned LLMs for Qiskit. Sixu Chen, Yuqi Zhang, and Qiang Guan (Kent State University)

Building the Foundation for Machine Learning-Based Mars Weather Forecasting. Mohammad Altiwainy (Wayne State University)

C++ Standard Parallelism for GPU Programming in a Particle-In-Cell Application. Ester El khoury, Mathieu Lobet, and Julien Bigot (CEA Saclay) and Laurent Colombet (CEA Dam)

Can Long-Haul RDMA Benefit Federated Learning?. Zhonghao Chen, Yuke Li, Duo Zhang, and Xiaoyi Lu (University of California, Merced)

Can Lossy Compression Benefit NVMe-based IO?. Darren Ng and Duo Zhang (University of California, Merced); Sheng Di (Argonne National Laboratory (ANL)); Zhaorui Zhang (The Hong Kong Polytechnic University); and Xiaoyi Lu (University of California, Merced)

CATIOS: Time-Resolved I/O-Aware Job Scheduling for HPC Systems. YuTsen Tseng (Tohoku University, Graduate School of Information Sciences); Masatoshi Kawai (Tohoku University); Keichi Takahashi (University of Osaka); and Hiroyuki Takizawa (Tohoku University)

Chameleon Concierge: Retrieval-Augmented Generation (RAG) To Enhance Open Testbed Documentation. Saieda Ali Zada (University of Delaware)

Characterizing Performance and Energy Trade-offs on the Aurora Supercomputer. Solomon Bekele, Swann Perarnau, and Brice Videau (Argonne National Laboratory (ANL))

ChatHPC: Building the Foundations for a Productive and Trustworthy AI-Assisted HPC Ecosystem. Pedro Valero-Lara, Aaron Young, Mohammad Alaul Haque Monil, Swaroop Pophale, Zheming Jin, Jeffrey S. Vetter, Keita Teranishi, and William F. Godoy (Oak Ridge National Laboratory (ORNL))

CIRE: LLVM Analysis for Floating-Point Rounding Error Affected by Precision and Optimizations. Cayden Lund, Tanmay Tirpankar, and Ganesh Gopalakrishnan (University of Utah)

Classifying Performance Bounds Using Machine Learning. Lewis Littman and Tom Deakin (University of Bristol)

Compute System Simulator: Modeling the Impact of Allocation Policy and Hardware Reliability on HPC Cloud Resource Utilization. Jarrod Leddy and Huseyin Yildiz (Microsoft Corporation)

Configuring Large Language Models for Regional Ocean Model Development. Aidan Janney (National Center for Atmospheric Research (NCAR), University of Colorado Boulder); Giovanni Seijo-Ellis (University of Puerto Rico, Mayaguez; National Center for Atmospheric Research (NCAR)); and Dan Amrhein (National Center for Atmospheric Research (NCAR))

CROSS-HPC System Bayesian Optimization with Adaptive Transfer. Abrar Hossain and Kishwar Ahmed (University of Toledo)

csDF: A Double-Float Arithmetic Library for the Cerebras CS-2. Reo Nagashima, Akeru Nakamura, Kai Murakami, and Ryunosuke Matsuzaki (Meiji University); Daichi Mukunoki (Nagoya University); and Takaaki Miyajima (Meiji University)

CUR-MoE: Portable Mixture-of-Experts with High-Ratio Compression. Ritesh Bhirud (University of Massachusetts Amherst, Massachusetts Institute of Technology (MIT))

CUR-MoE: Portable Mixture-of-Experts with Interpretable High-Ratio Compression. Ritesh Bhirud (University of Massachusetts Amherst, Massachusetts Institute of Technology (MIT))

Detecting Silent Data Corruption in Sparse Matrices Using Hardware Performance Counters. Minseop Choi, Orlando Arias, and Seung Woo Son (University of Massachusetts Lowell)

DiffPro: Joint Timestep and Layer-Wise Precision Optimization for Efficient Diffusion Inference. Farhana Amin (Virginia Tech), Kanchon Gharami (Embry-Riddle Aeronautical University), and Dimitrios S. Nikolopoulos (Virginia Tech)

DiOMP-Offloading: Portable OpenMP Offloading for Distributed Heterogeneous Systems. Baodi Shan (Stony Brook University), Mauricio Araya-Polo (TotalEnergies), and Barbara Chapman (Stony Brook University)

Distributed 3D Gaussian Splatting for High-Resolution Isosurface Visualization. Mengjiao Han (Argonne National Laboratory (ANL)); Andres Sewell (Utah State University); Joseph Insley and Janet Knowles (Argonne National Laboratory (ANL)); Victor A. Mateevitsi and Michael E. Papka (Argonne National Laboratory (ANL), University of Illinois Chicago); Steve Petruzza (Utah State University); and Silvio Rizzi (Argonne National Laboratory (ANL))

Distributed Modular Digital Twin Network for High-Performance and Reliable Data Centers. Yan Chen, Xing Lu, Cary Faulkner, Alex Vlachokostas, Hanlong Wan, and Jeremy Lerond (Pacific Northwest National Laboratory (PNNL))

Divergence Prediction System for CFD Simulations. Takashi Soga (The University of Osaka), Takanori Uchida (Kyushu University), and Susumu Date (The University of Osaka)

Divide, Conquer, and Denoise: Hybrid Parallel Diffusion with Memory-Aware Coarse-to-Fine Inference. Farhana Amin (Virginia Tech), Kanchon Gharami (Embry-Riddle Aeronautical University), and Dimitrios Nikolopoulos (Virginia Tech)

Echoes of Earth: Building an Autonomous Environmental Lab for Acoustic Sensing. Hudson Reynolds (Boston University); Alex Tuecke (Worcester Polytechnic Institute); Mike Sherman (University of Chicago); and Kate Keahey (Argonne National Laboratory (ANL), University of Chicago)

An Efficient GEMM Acceleration Method for LLM Inference with Variable-Length Sequences. Yu Zhang and Lu Lu (South China University of Technology)

Enabling Efficient Runtime Data Analysis to a Crystal Deformation Simulation. Arthur Jaquard (Inria)

Enabling Real-Time, Extreme-Scale Bayesian Inference: FFT-Based GPU-Accelerated Matrix-Vector Products for Block-Triangular Toeplitz Matrices. Sreeram Venkat and Omar Ghattas (The University of Texas at Austin)

Energy-Efficient Multimodal LLM Inference: Stage-Level Characterization and Input-Aware Controls. Mona Moghadampanah, Adib Rezaei Shahmirzadi, and Dimitrios S. Nikolopoulos (Virginia Tech)

Enhancing Usability and Performance in Experimental Environments Management. Zahra Temori (University of Delaware), Paul Marshall (UChicago Department of Computer Science), and Kate Keahey (Argonne National Laboratory (ANL))

European Open Web Index: Large Complex Graph Visualization. Pavlina Smolkova and Katerina Slaninova (IT4Innovations, VSB - Technical University of Ostrava)

Evaluating LiDAR Compression for 3D Semantic Segmentation in Diverse Off-Road Environments on GOOSE Dataset. Adam Niemczura, Max Faykus, Oyinlolu Odetoye, Melissa Smith, Jon Calhoun, and Scott Groel (Clemson University)

Evaluating The Power-Monitoring Capabilities of Aurora. Precious Eyabi (Argonne National Laboratory (ANL))

Evaluating the Usage of Python Libraries on a Production Supercomputer. Thomas Papka (Loyola University Chicago, Argonne National Laboratory (ANL))

Explicit Low-Order Finite-Element Wave Simulation Accelerated with Variable-Precision Computing using INT8 Tensor Cores. Kohei Fujita and Tsuyoshi Ichimura (The University of Tokyo, RIKEN); Muneo Hori (Japan Agency for Marine-Earth Science and Technology); and Lalith Maddegedara (The University of Tokyo)

Exploring Fine-Grained Parallelism in Data-Flow Runtime Systems on Many-Core Systems. Wenyi Wang and Maxime Gonthier (University of Chicago), Haibin Lai (Southern University of Science and Technology), Poornima Nookala (Intel Corporation), Haochen Pan and Ian Foster (University of Chicago), Ioan Raicu (Illinois Institute of Technology), and Kyle Chard (University of Chicago)

Facilitating Mixed Python-Fortran HPC Codes: 4D Drift-Kinetic Simulations with Pyccel. Emily Bourne (Swiss Federal Institute of Technology Lausanne (EPFL), EPFL) and Yaman Güçlü (Max Planck Institute for Plasma Physics, Division of Numerical Methods in Plasma Physics)

Fast Linear Solvers via AI-Tuned Markov Chain Monte Carlo-based Matrix Inversion. Anton Lebedev and Won Kyung Lee (STFC Hartree Centre); Soumyadip Ghosh (IBM Thomas J. Watson Research Center); Olha I. Yaman (STFC Hartree Centre); Vassilis Kalantzis, Yingdong Lu, Tomasz Nowicki, Shashanka Ubaru, and Lior Horesh (IBM Thomas J. Watson Research Center); and Vassil Alexandrov (STFC Hartree Centre)

A Formal Characterization of Non-Monotonicity in Tensor Cores. Paul Jiang (Purdue University) and Vivian Zheng (Stony Brook University)

Forward Error Bounds and Efficient Algorithms for Computing a Tensor Times Matrix Chain in Low Precision on GPUs. Julian Bellavita (Cornell University, Oak Ridge National Laboratory (ORNL)) and Piyush Sao and Ramakrishnan Kannan (Oak Ridge National Laboratory (ORNL))

From Legacy to Portable: An Agentic AI Workflow for Fortran Code Translation and Cross-Architecture Optimization. Sparsh Gupta (Los Alamos National Laboratory (LANL), Franklin W. Olin College of Engineering) and Kamalavasan Kamalakkannan, Maxim Moraru, Galen Shipman, and Patrick Diehl (Los Alamos National Laboratory (LANL))

From Petabytes to Predictions: Harnessing Large-Scale NeuroBlu Mental Health Data and ML To Mitigate Medication Non-Adherence. Alyson Collins, Cathy Sandoval, Maya Seshan, Srishti Srivastava, and Josh McWilliams (University of Southern Indiana)

GATSched: Multi-Objective Graph Attention Networks for Energy-Efficient HPC Job Scheduling. Kyrian Adimora (The University of Kansas)

GNNs on Evolving Graphs: A Benchmark of Incremental Updates and Meta-Learning Approaches. Sriram Srinivasan (Bowie State University), Sanjukta Bhowmick (University of North Texas), and Hamdan Alabsi and Rand Obeidat (Bowie State University)

GPU Kernels for Mixture of Experts. Arthur Feeney (University of California, Irvine); Ying Wai Li (Los Alamos National Laboratory (LANL)); and Aparna Chandramowlishwaran (University of California, Irvine)

Hardware-Aware Quantum Circuit Synthesis. Nathan Jones, Akhilesh Bondapalli, Toby Cox, Ian Lewis, and Rong Ge (Clemson University)

Harmony: Converged Supercomputer Scratch and Archival Filesystems. Jake Carroll (The University of Queensland)

Heterogeneity-Aware Task Allocation for Modern HPC Systems. Sowmya Yellapragada (University of Utah); Jessica Imlau Dagostini (University of California, Santa Cruz); and Kevin Gott and Rebecca Hartman-Baker (Lawrence Berkeley National Laboratory (LBNL))

High Performance Batch SVD using GPUs. Ahmad Abdelfattah (University of Tennessee, Knoxville)

High-Performance Sparse Attention on Tensor Cores: Fused3S and Beyond. Zitong Li (University of California, Irvine)

HydraCache: LLM Inference Prefill Parallelization Through Distributed Cache Blending. Adib Rezaei Shahmirzadi (Virginia Tech), Shayan Shabihi (University of Maryland), Mona Moghadampanah (Virginia Tech), Furong Huang (University of Maryland), and Dimitrios S. Nikolopoulos (Virginia Tech)

The Impact of Maximum Vector Length on Cache Management Techniques in RISC-V Vector Extension. Shunya Nomura (Tohoku University); Jiaheng Liu (RIKEN Center for Computational Science (R-CCS)); Keichi Takahashi (The University of Osaka, Tohoku University); and Hiroyuki Takizawa (Tohoku University)

IncineRate: Multi-Modal FPGA Accelerator for SCNNs. Björn A. Lindqvist and Artur Podobas (KTH Royal Institute of Technology)

Inference-as-a-Service Prototype at NERSC. Colin Thomas (University of Notre Dame); Po-Han Huang (Georgia Institute of Technology); Hilary Utaegbulam (University of Rochester); Johannes Blaschke (ESnet; Lawrence Berkeley National Laboratory (LBNL)); Bruno Coimbra (Fermi National Laboratory); Pengfei Ding, Xiangyang Ju, and Andrew Naylor (ESnet; Lawrence Berkeley National Laboratory (LBNL)); and Michael Wang (Fermi National Laboratory)

Intelligent Surrogates Pay Attention to Data, Improving Multi-Objective HPC Optimization. Ashna Nawar Ahmed (Texas State University, Oak Ridge National Laboratory (ORNL)); Banooqa Banday (Texas State University); Terry Jones (Oak Ridge National Laboratory (ORNL)); and Tanzima Z. Islam (Texas State University)

JACC: Easy CPU/GPU Performance Portability for Scientific Applications in Julia. William Godoy, Pedro Valero-Lara, Philip Fackler, Keita Teranishi, and Jeffrey Vetter (Oak Ridge National Laboratory (ORNL)); Jhonny Gonzalez and Jose Gonzalez (The University of Texas at El Paso, Oak Ridge National Laboratory (ORNL)); and Alexis Huante (The University of Texas at Austin, Oak Ridge National Laboratory (ORNL))

Job Grouping-Based Intelligent Resource Recommendation Framework. Beste Oztop (Boston University); Benjamin Schwaller, Vitus J. Leung, and Jim Brandt (Sandia National Laboratories); and Brian Kulis, Manuel Egele, and Ayse K. Coskun (Boston University)

Julia with Intelligent Runtime for Heterogeneous Computing. Narasinga Rao Miniskar, Pedro Valero-Lara, William Godoy, Keita Teranishi, and Jeffrey S. Vetter (Oak Ridge National Laboratory (ORNL))

A Kokkos-Based Proxy of the Exascale Metagenome Assembler MetaHipMer2: A First Use of Kokkos for Computational Biology. Logan Williams, Gavin Conant, and Michela Becchi (North Carolina State University) and Jan Ciesko and Amy Powell (Sandia National Laboratories)

Learning To Select Scheduling Algorithms in OpenMP. Jonas H. Müller Korndörfer (University of Bern, University of Basel); Ali Mohammed and Ahmed Eleliemy (HPE HPC/AI EMEA Lab); Quentin Guilloteau (Inria); and Reto Krummenacher and Florina Ciorba (University of Basel)

Leveraging Large Language Models for Property Prediction in Polymorphic Organic Semiconductors. Shreya Pagaria (Carnegie Mellon University, Pittsburgh Supercomputing Center) and Mei-Yu Wang, Dana O’Connor, Julian Uran, and Paola Buitrago (Pittsburgh Supercomputing Center)

Local vs. Global FFT Approaches for High-Performance Ultrasound Simulation on Multi-GPU Systems. Oliver Kuník and Jiri Jaros (Faculty of Information Technology, Brno University of Technology)

Luthier: A Dynamic Binary Instrumentation Framework Targeting AMD GPUs. Matin Raayai-Ardakani, Norman Rubin, and David Kaeli (Northeastern University)

Massively Parallel Bayesian Inference Framework for GPU Supercomputers – Application to Estimation of Coseismic Fault Slip –. Kai Nakao, Tsuyoshi Ichimura, and Kohei Fujita (The University of Tokyo)

Massively Parallel GPU Rasterizer for Next-Generation Computational Lithography. Loay Hegazy, Mohamed Taher, and Sherif Hammouda (Siemens EDA)

Memory-Efficient CFD based on MPS: Effective One-Billion-Cell Resolution on a Single Node. Junya Onishi (RIKEN Center for Computational Science (R-CCS)); Ayato Takii (Kobe University, Japan; RIKEN Center for Computational Science (R-CCS)); Sangwon Kim (RIKEN Center for Computational Science (R-CCS)); Younghwa Cho (Hokkaido University, Japan); and Makoto Tsubokura (Kobe University, Japan; RIKEN Center for Computational Science (R-CCS))

Mitigating I/O Bottlenecks in LiDAR Pipelines by Directly Merging Neural Decompression and Semantic Segmentation. Ethan Marquez, Max Faykus, Oyinlolu Odetoye, Melissa Smith, and Jon Calhoun (Clemson University)

Mixed Compute Environments with OpenCHAMI. Sean Gibson, Richard Kim, Samuel Quan, Travis Cotton, and Thomas Mackell (Los Alamos National Laboratory (LANL))

Mojo: Python-Like MLIR-Based GPU Portable Science Kernels. Tatiana Melnichenko (University of Tennessee, Knoxville; Oak Ridge National Laboratory (ORNL))

MPI-SGX: Enabling Confidential Computing for MPI Parallel Applications with Intel SGX Technology. Kota Shimojima (The University of Electro-Communications, RIKEN Center for Computational Science (R-CCS)); Hayato Yamaki and Hiroki Honda (The University of Electro-Communications); Shinichiro Matsuo (Georgetown University); Atsuko Takefusa (National Institute of Informatics, Japan; RIKEN Center for Computational Science (R-CCS)); and Shinobu Miwa (The University of Electro-Communications, RIKEN Center for Computational Science (R-CCS))

Multi-GPU Implementation and Roofline Analysis of a Numerical Global Ocean Model. Takateru Yamagishi (Research Organization for Information Science and Technology); Masao Kurogi and Takao Kawasaki (Japan Agency for Marine-Earth Science and Technology); Yoshimasa Matsumura (National Institute for Environmental Studies); and Hiroyasu Hasumi (Atmosphere and Ocean Research Institute, The University of Tokyo)

Novel Graph Alignment Algorithms for Identifying Non-Determinism in Large-Scale Simulations. Dhroov Pandey (University of North Texas)

Numerical Investigation of Radiation Hydrodynamic Instabilities at Scale with FleCSI-HARD. Måns I. Andersson (KTH Royal Institute of Technology); Isaac C, Bannerman (Rensselaer Polytechnic Institute); Moon B. Hazarika (University of Michigan); Akshit Jariwala (The University of Texas at Austin); Jonathan Mathurin (Florida International University); Madela B. Quashie (Michigan State University); and Julien Loiseau and Hyun Lim (Los Alamos National Laboratory (LANL))

Optimizing and Extending Periodogram Computations for Astronomy. Yuwei Sun (Flatiron Institute, University of Illinois Urbana-Champaign) and Lehman Garrison (Flatiron Institute)

Optimizing Collectives with Large Payloads on GPU-Based Supercomputers. Siddharth Singh (NVIDIA Corporation, University of Maryland); Mahua Singh (IIT Guwahati); and Keshav Pradeep and Abhinav Bhatele (University of Maryland)

Optimizing Task-Driven Offloading in LLVM. Jan Kraus, Joachim Jenke, and Christian Terboven (Chair for High-Performance Computing i12, RWTH Aachen University)

Optimizing the GPU All-Reduce Using Multiple Processes Per GPU. Michael Adams and Amanda Bienz (University of New Mexico)

Orchid: Towards Heterogeneous Batched Eigenvalue Solvers. Matthew Chung (University of California, Riverside; Oak Ridge National Laboratory (ORNL))

Parallel Local Motif Counting on Large-Scale Dynamic Graphs. Ali Khan and Sanjukta Bhowmick (University of North Texas) and Michela Taufer (University of Tennessee, Knoxville)

ParaViz3D: MPI Trace Visualization with 3D Video. Jean-Yves Verhaeghe, Georg Hager, and Ayesha Afzal (Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen National High Performance Computing Center)

Performance Engineering of Scientific Applications with MVAPICH and TAU Using Emerging Communication Primitives. Dhabaleswar K. (DK) Panda (The Ohio State University); Sameer Shende (University of Oregon; ParaTools, Inc.); Ahmad Abdelfattah (University of Tennessee, Knoxville); and Yifeng Cui (San Diego Supercomputer Center (SDSC))

PhySiViT: A Physics Simulation Vision Transformer. Jessica Ezemba (Carnegie Mellon University), James Afful (Iowa State University), and Mei-Yu Wang (Pittsburgh Supercomputing Center)

Practical Viability of Translating Legacy Fortran Code to C++ Using Large Language Models. Ren Imai and Masatoshi Kawai (Tohoku University), Keichi Takahashi (The University of Osaka), and Hiroyuki Takizawa (Tohoku University)

Process-Based Predictors of Vulnerability Reintroduction. Samiha Shimmi (Northern Illinois University), Nicholas Synovic (Loyola University Chicago), Mona Rahimi (Northern Illinois University), and George Thiruvathukal (Loyola University Chicago)

Productive Scalable Distributed Task Scheduling Using an MPI-Based Backend for Dagger. Yan Guimarães (University of Brasilia)

A Quantum Solver for Multidimensional Partial Differential Equations: Practical Case Studies. Manu Chaudhary (Illinois State University (ISU)) and Kareem El-Araby, Alvir Nobel, Ishraq Islam, Manish Singh, Sunday Ogundele, Kieran Egan, Sneha Thomas, Vincent Vordtriede, Devon Bontrager, Serom Kim, and Esam El-Araby (University of Kansas)

Range Search on Heterogeneous Systems with Processing-in-Memory Architecture. Tasmia Jannat and Satish Puri (Missouri University of Science and Technology) and Michael Gowanlock (Northern Arizona University,)

Real-Time ML-Based Defense Against Malicious Payload in Reconfigurable Embedded Systems. Rye Stahle-Smith and Rasha Karakchi (University of South Carolina)

A Scalability Study of Quantum Algorithms for Dimensionality Reduction of Multidimensional Data. Kareem El-Araby (University of Kansas); Thom Popovic (Lawrence Berkeley National Laboratory (LBNL)); Alvir Nobel and Sunday Ogundele (University of Kansas); Katherine Klymko, Daan Camps, and Anastasiia Butko (Lawrence Berkeley National Laboratory (LBNL)); and Esam El-Araby (University of Kansas)

Scalable Alternative Route Computation with ACE: A C++17 Library for HPC Traffic Simulations. Paulo Silva, Pavlína Smolková, Kateřina Slaninová, Jan Martinovič, João Barbosa, and Matej Špeťko (IT4Innovations, VSB - Technical University of Ostrava) and Emanuele Vitali (CSC - IT Center for Science)

Scalable Execution Framework for R on Manycore Systems. Xiran Zhang (King Abdullah University of Science and Technology (KAUST)), Javier Conejero (Barcelona Supercomputing Center (BSC)), Sameh Abdulah (King Abdullah University of Science and Technology (KAUST)), Jorge Ejarque (Barcelona Supercomputing Center (BSC)), Ying Sun (King Abdullah University of Science and Technology (KAUST)), Rosa M. Badia (Barcelona Supercomputing Center (BSC)), and David E. Keyes and Marc G. Genton (King Abdullah University of Science and Technology (KAUST))

Scalable Multi-Node Multi-GPU Datalog Engine with Energy-Aware Profiling. Ahmedur Rahman Shovon (Argonne National Laboratory (ANL)) and Sidharth Kumar (University of Illinois Chicago)

Scaling Singular Values Beyond GPU Memory Limits: Out-of-Core, GPU-Accelerated, and Unified Across Data Precision and Hardware. Evelyne Ringoot (Massachusetts Institute of Technology (MIT))

ScODA: An Emerging Pipeline for Evaluating Distributed Database Performance to Support Operational Data Analytics. Nicholas Synovic (Loyola University Chicago); FNU Shilpika, Silvio Rizzi, and Doug Waldron (Argonne National Laboratory (ANL)); George K. Thiruvathukal (Loyola University Chicago); and Michael E. Papka (Argonne National Laboratory (ANL))

Seamless Scaling of Applications Across Programming Models. Reto Krummenacher (University of Basel), Quentin Guilloteau (Inria), Jonas H. Müller Korndörfer (University of Bern), and Florina M. Ciorba (University of Basel)

Shipping HPC Ecosystems Across Platforms: Portable and Composable HPC Clusters as Code. German Felipe Giraldo Villa, Théo Grivel, George Ioannidis, Edita Kizinevic, Carolina Lindqvist, Nicolas Litchinko, Pablo Llopis, Antonio Javier Russo, and Gilles Fourestey (École Polytechnique Fédérale de Lausanne)

Shortcut Mixup Policy: Toward Improving Robustness and Speed in Goal-Conditioned RL. Matthew Hyatt (Loyola University Chicago, Argonne National Laboratory (ANL)); Yassir Atlas, Hal Brynteson, Diego Roa Perdomo, Athena Angara, Mengjiao Han, Joseph Insley, Janet Knowles, Yongho Kim, Victor Mateevitsi, Michael Papka, and Silvio Rizzi (Argonne National Laboratory (ANL)); George Thiruvathukal (Loyola University Chicago, Argonne National Laboratory (ANL)); and Nicola Ferrier (Argonne National Laboratory (ANL))

SRAP: Sender-Side Receiver-Aware Port Selection for High-Speed Multi-Flow TCP. Shingo Hattori and Osamu Tatebe (University of Tsukuba)

Sync-Free GPU Parallelization of Sparse Kernels from Sequential Python Code. Malko-Bani Somo (McMaster University)

Template Task based Multiresolution Analysis in Hybrid Environments. Nilesh Chaturvedi (Institute for Advanced Computational Science, Stony Brook University; Stony Brook University, Department of Applied Mathematics and Statistics); Jospeh Schuchart (Institute for Advanced Computational Science, Stony Brook University); and Robert J. Harrison (Institute for Advanced Computational Science, Stony Brook University; Stony Brook University, Department of Applied Mathematics and Statistics)

Tensor Core Accelerated Fast Multipole Method for GROMACS. Jiamian Huang (Institute of Science Tokyo), Muhammad Umair Sadiq (KTH Royal Institute of Technology), Rio Yokota (Institute of Science Tokyo), and Berk Hess (KTH Royal Institute of Technology)

TidalMark: A Scalable Benchmark for Coastal Water Level Forecasting. Lucas Raicu, Daniel Grzenda, Ian Foster, and Kyle Chard (University of Chicago)

Time-stepping Hamiltonian Simulation for Solving Nonlinear PDEs via a Quantum-Classical Hybrid Approach. Sangwon Kim and Junya Onishi (RIKEN Center for Computational Science (R-CCS)); Ayato Takii (Kobe University, Japan); Younghwa Cho (Hokkaido University, Japan); and Tsubokura Makoto (RIKEN Center for Computational Science (R-CCS); Kobe University, Japan)

A Toolbox for Load Balancing Development and Analysis in WarpX/AMReX Applications. Jessica Imlau Dagostini (University of California, Santa Cruz); Sowmya Yellapragada (University of Utah); and Kevin Gott and Rebecca Hartman-Baker (Lawrence Berkeley National Laboratory (LBNL))

Towards a GPU-accelerated web-based graph rendering framework for large-scale protein networks. Jiaxin Lu and Landon Dyken (University of Illinois Chicago); Shilpika Shilpika and Venkatram Vishwanath (Argonne National Laboratory (ANL)); Michael Papka (University of Illinois Chicago, Argonne National Laboratory (ANL)); and Sidharth Kumar (University of Illinois Chicago)

Towards Application Agnostic HPC Profiling. Hari Teja Jajula (The University of Alabama, Lawrence Berkeley National Laboratory (LBNL)); Dhruva Kulkarni and Brian Austin (Lawrence Berkeley National Laboratory (LBNL)); and Purushotham Bangalore (The University of Alabama)

Understanding Communication Bottlenecks in Multi-Node LLM Inference. Prajwal Singhania (University of Maryland); Siddharth Singh (University of Maryland, NVIDIA Corporation); Lannie Dalton Hough and Ishan Revankar (University of Maryland); Harshitha Menon and Charles Jekel (Lawrence Livermore National Laboratory (LLNL)); and Abhinav Bhatele (University of Maryland)

Understanding GPU Utilization Using LDMS Data on Perlmutter. Onur Cankur (University of Maryland), Brian Austin (Lawrence Berkeley National Laboratory (LBNL)), and Abhinav Bhatele (University of Maryland)

Understanding LLM Behavior on HPC Data via Mechanistic Interpretability. Md Mahbubur Rahman (Iowa State University), Arjun Guha (Northeastern University), and Harshitha Menon (Lawrence Livermore National Laboratory (LLNL))

Unified Performance Modeling Stack for Distributed GPU Applications: Complementing Analytical Insights with Machine Learning. Urvij Saroliya (Technical University of Munich)

Unmasking Performance Variability in GPU Codes on Supercomputers. Cunyang Wei and Keshav Pradeep (University of Maryland, College Park) and Abhinav Bhatele (University of Maryland)

Unraveling Distant Galaxies: Analyzing IFU Data with Parsl and Academy. Daniel Babnigg (University of Chicago)

Using hardware metrics to understand performance of the RAJA Performance Suite kernels in different GPU modes on MI300A. Amr Abouelmagd (Tennessee Tech University) and Stephanie Brink, Michael McKinsey, David Boehme, Jason Burmark, Brian Ryujin, Tom Scogland, and Olga Pearce (Lawrence Livermore National Laboratory (LLNL))

VaultX Merge: Breaking Memory Barriers in Proof-of-Space Plot Generation. Arnav Sirigere, Varvara Bondarenko, and Ioan Raicu (Illinois Institute of Technology)

Wafer-Scale Simulation of Mutator Allele Dynamics in Large Asexual Populations. Matthew Andres Moreno (University of Michigan), Emily Dolson (Michigan State University), and Luis Zaman (University of Michigan)

When Label Propagation Outperforms BFS in Breadth-First Graph Traversal. Kalsuda Lapborisuth and Srinivas Aluru (Georgia Institute of Technology)

WiCAT: Reducing Congestion at Wireless Interfaces in Heterogeneous Architectures. Tarun Sharma (IIIT Delhi)

WONDERS: Integrating WOW, PONDER, and SCALE for Enhanced Scheduling Performance. Fabian Lehmann (Humboldt-Universität zu Berlin); Jonathan Rau, Jonathan Bader, and Odej Kao (Technical University of Berlin); and Ulf Leser (Humboldt-Universität zu Berlin)

SC Technical Program Archives

Research and ACM SRC Posters