Algorithms and Applications of Dynamic Network Analysis using CANDY.
Aashish Pandey (University of North Texas), Arindam Khanda and S.M Shovan (Missouri University of Science and Technology), Ali Y. Khan (University of North Texas), Boyana Norris (University of Oregon), Sajal K. Das (Missouri University of Science and Technology), and Sanjukta Bhowmick (University of North Texas)
An Approach for Correlating Compiler Optimizations with Runtime Performance.
Befikir Bogale (University of Tennessee, Knoxville); Olga Pearce (Lawrence Livermore National Laboratory); Tom Scogland (Lawrence Livermore National Laboratory (LLNL)); and Michela Taufer (University of Tennessee, Knoxville)
Can Lossy Compression Benefit NVMe-based IO?.
Darren Ng and Duo Zhang (University of California, Merced); Sheng Di (Argonne National Laboratory (ANL)); Zhaorui Zhang (The Hong Kong Polytechnic University); and Xiaoyi Lu (University of California, Merced)
CATIOS: Time-Resolved I/O-Aware Job Scheduling for HPC Systems.
YuTsen Tseng (Tohoku University, Graduate School of Information Sciences); Masatoshi Kawai (Tohoku University); Keichi Takahashi (University of Osaka); and Hiroyuki Takizawa (Tohoku University)
Configuring Large Language Models for Regional Ocean Model Development.
Aidan Janney (National Center for Atmospheric Research (NCAR), University of Colorado Boulder); Giovanni Seijo-Ellis (University of Puerto Rico, Mayaguez; National Center for Atmospheric Research (NCAR)); and Dan Amrhein (National Center for Atmospheric Research (NCAR))
Distributed 3D Gaussian Splatting for High-Resolution Isosurface Visualization.
Mengjiao Han (Argonne National Laboratory (ANL)); Andres Sewell (Utah State University); Joseph Insley and Janet Knowles (Argonne National Laboratory (ANL)); Victor A. Mateevitsi and Michael E. Papka (Argonne National Laboratory (ANL), University of Illinois Chicago); Steve Petruzza (Utah State University); and Silvio Rizzi (Argonne National Laboratory (ANL))
Exploring Fine-Grained Parallelism in Data-Flow Runtime Systems on Many-Core Systems.
Wenyi Wang and Maxime Gonthier (University of Chicago), Haibin Lai (Southern University of Science and Technology), Poornima Nookala (Intel Corporation), Haochen Pan and Ian Foster (University of Chicago), Ioan Raicu (Illinois Institute of Technology), and Kyle Chard (University of Chicago)
Fast Linear Solvers via AI-Tuned Markov Chain Monte Carlo-based Matrix Inversion.
Anton Lebedev and Won Kyung Lee (STFC Hartree Centre); Soumyadip Ghosh (IBM Thomas J. Watson Research Center); Olha I. Yaman (STFC Hartree Centre); Vassilis Kalantzis, Yingdong Lu, Tomasz Nowicki, Shashanka Ubaru, and Lior Horesh (IBM Thomas J. Watson Research Center); and Vassil Alexandrov (STFC Hartree Centre)
GPU Kernels for Mixture of Experts.
Arthur Feeney (University of California, Irvine); Ying Wai Li (Los Alamos National Laboratory (LANL)); and Aparna Chandramowlishwaran (University of California, Irvine)
Heterogeneity-Aware Task Allocation for Modern HPC Systems.
Sowmya Yellapragada (University of Utah); Jessica Imlau Dagostini (University of California, Santa Cruz); and Kevin Gott and Rebecca Hartman-Baker (Lawrence Berkeley National Laboratory (LBNL))
Inference-as-a-Service Prototype at NERSC.
Colin Thomas (University of Notre Dame); Po-Han Huang (Georgia Institute of Technology); Hilary Utaegbulam (University of Rochester); Johannes Blaschke (ESnet; Lawrence Berkeley National Laboratory (LBNL)); Bruno Coimbra (Fermi National Laboratory); Pengfei Ding, Xiangyang Ju, and Andrew Naylor (ESnet; Lawrence Berkeley National Laboratory (LBNL)); and Michael Wang (Fermi National Laboratory)
JACC: Easy CPU/GPU Performance Portability for Scientific Applications in Julia.
William Godoy, Pedro Valero-Lara, Philip Fackler, Keita Teranishi, and Jeffrey Vetter (Oak Ridge National Laboratory (ORNL)); Jhonny Gonzalez and Jose Gonzalez (The University of Texas at El Paso, Oak Ridge National Laboratory (ORNL)); and Alexis Huante (The University of Texas at Austin, Oak Ridge National Laboratory (ORNL))
Learning To Select Scheduling Algorithms in OpenMP.
Jonas H. Müller Korndörfer (University of Bern, University of Basel); Ali Mohammed and Ahmed Eleliemy (HPE HPC/AI EMEA Lab); Quentin Guilloteau (Inria); and Reto Krummenacher and Florina Ciorba (University of Basel)
Memory-Efficient CFD based on MPS: Effective One-Billion-Cell Resolution on a Single Node.
Junya Onishi (RIKEN Center for Computational Science (R-CCS)); Ayato Takii (Kobe University, Japan; RIKEN Center for Computational Science (R-CCS)); Sangwon Kim (RIKEN Center for Computational Science (R-CCS)); Younghwa Cho (Hokkaido University, Japan); and Makoto Tsubokura (Kobe University, Japan; RIKEN Center for Computational Science (R-CCS))
MPI-SGX: Enabling Confidential Computing for MPI Parallel Applications with Intel SGX Technology.
Kota Shimojima (The University of Electro-Communications, RIKEN Center for Computational Science (R-CCS)); Hayato Yamaki and Hiroki Honda (The University of Electro-Communications); Shinichiro Matsuo (Georgetown University); Atsuko Takefusa (National Institute of Informatics, Japan; RIKEN Center for Computational Science (R-CCS)); and Shinobu Miwa (The University of Electro-Communications, RIKEN Center for Computational Science (R-CCS))
Multi-GPU Implementation and Roofline Analysis of a Numerical Global Ocean Model.
Takateru Yamagishi (Research Organization for Information Science and Technology); Masao Kurogi and Takao Kawasaki (Japan Agency for Marine-Earth Science and Technology); Yoshimasa Matsumura (National Institute for Environmental Studies); and Hiroyasu Hasumi (Atmosphere and Ocean Research Institute, The University of Tokyo)
Numerical Investigation of Radiation Hydrodynamic Instabilities at Scale with FleCSI-HARD.
Måns I. Andersson (KTH Royal Institute of Technology); Isaac C, Bannerman (Rensselaer Polytechnic Institute); Moon B. Hazarika (University of Michigan); Akshit Jariwala (The University of Texas at Austin); Jonathan Mathurin (Florida International University); Madela B. Quashie (Michigan State University); and Julien Loiseau and Hyun Lim (Los Alamos National Laboratory (LANL))
ParaViz3D: MPI Trace Visualization with 3D Video.
Jean-Yves Verhaeghe, Georg Hager, and Ayesha Afzal (Friedrich-Alexander-Universität Erlangen-Nürnberg, Erlangen National High Performance Computing Center)
Process-Based Predictors of Vulnerability Reintroduction.
Samiha Shimmi (Northern Illinois University), Nicholas Synovic (Loyola University Chicago), Mona Rahimi (Northern Illinois University), and George Thiruvathukal (Loyola University Chicago)
A Scalability Study of Quantum Algorithms for Dimensionality Reduction of Multidimensional Data.
Kareem El-Araby (University of Kansas); Thom Popovic (Lawrence Berkeley National Laboratory (LBNL)); Alvir Nobel and Sunday Ogundele (University of Kansas); Katherine Klymko, Daan Camps, and Anastasiia Butko (Lawrence Berkeley National Laboratory (LBNL)); and Esam El-Araby (University of Kansas)
Scalable Execution Framework for R on Manycore Systems.
Xiran Zhang (King Abdullah University of Science and Technology (KAUST)), Javier Conejero (Barcelona Supercomputing Center (BSC)), Sameh Abdulah (King Abdullah University of Science and Technology (KAUST)), Jorge Ejarque (Barcelona Supercomputing Center (BSC)), Ying Sun (King Abdullah University of Science and Technology (KAUST)), Rosa M. Badia (Barcelona Supercomputing Center (BSC)), and David E. Keyes and Marc G. Genton (King Abdullah University of Science and Technology (KAUST))
Shortcut Mixup Policy: Toward Improving Robustness and Speed in Goal-Conditioned RL.
Matthew Hyatt (Loyola University Chicago, Argonne National Laboratory (ANL)); Yassir Atlas, Hal Brynteson, Diego Roa Perdomo, Athena Angara, Mengjiao Han, Joseph Insley, Janet Knowles, Yongho Kim, Victor Mateevitsi, Michael Papka, and Silvio Rizzi (Argonne National Laboratory (ANL)); George Thiruvathukal (Loyola University Chicago, Argonne National Laboratory (ANL)); and Nicola Ferrier (Argonne National Laboratory (ANL))
Template Task based Multiresolution Analysis in Hybrid Environments.
Nilesh Chaturvedi (Institute for Advanced Computational Science, Stony Brook University; Stony Brook University, Department of Applied Mathematics and Statistics); Jospeh Schuchart (Institute for Advanced Computational Science, Stony Brook University); and Robert J. Harrison (Institute for Advanced Computational Science, Stony Brook University; Stony Brook University, Department of Applied Mathematics and Statistics)
Tensor Core Accelerated Fast Multipole Method for GROMACS.
Jiamian Huang (Institute of Science Tokyo), Muhammad Umair Sadiq (KTH Royal Institute of Technology), Rio Yokota (Institute of Science Tokyo), and Berk Hess (KTH Royal Institute of Technology)
Towards Application Agnostic HPC Profiling.
Hari Teja Jajula (The University of Alabama, Lawrence Berkeley National Laboratory (LBNL)); Dhruva Kulkarni and Brian Austin (Lawrence Berkeley National Laboratory (LBNL)); and Purushotham Bangalore (The University of Alabama)
Understanding Communication Bottlenecks in Multi-Node LLM Inference.
Prajwal Singhania (University of Maryland); Siddharth Singh (University of Maryland, NVIDIA Corporation); Lannie Dalton Hough and Ishan Revankar (University of Maryland); Harshitha Menon and Charles Jekel (Lawrence Livermore National Laboratory (LLNL)); and Abhinav Bhatele (University of Maryland)