Morning Break - ISAV25
In-Transit Data Transport Strategies for Coupled AI-Simulation Workflow Patterns
From Exploration to Explanation: ML-Driven Causal Discovery for Datacenter Reliability at Scale
A Data-Size Adaptive Approach to I/O of Poorly Load Balanced In Situ Data Extracts
Lightning Talk: Differentiable In-Situ Diagnostics for Particle Accelerator Digital Twins
Back to Workshop Archive Listing
Morning Break - ExHetAI
Enabling Unstructured Sparse Fine-Tuning and Inference for Foundation Models on Wafer-Scale Engine
WAGES: Workload-Aware GPU Sharing System for Energy-Efficient Serverless LLM Serving
OmniFed: A Modular Framework for Configurable Federated Learning from Edge to HPC
Enhancing ChatPORT with CUDA-to-SYCL Kernel Translation Capability
Back to Workshop Archive Listing
International Workshop on RESource DISaggregation in High Performance Computing (RESDIS)
Opening Invited Talk: Photonics-Enabled Systems for the Disaggregated Era of Supercomputing and AI
RESDIS'25 Morning Break
Fast on-demand Memory Mapping for Shared Memory and Disaggregated Systems
TEGRA - Scaling Up Graph Processing with Disaggregated Computing
DoCeph: DPU-Offloaded Messaging in Ceph for Reduced Host CPU Utilization
Back to Workshop Archive Listing
Visionary Speaker - Dr Michael Grieves - Digital Twins State of the Art
NVIDIA Earth-2, a digital twin of our planet for tackling climate change
Morning Break
Back to Workshop Archive Listing
Morning Break - HPC Systems Professionals Workshop (HPCSYSPROS25)
Transparent Global File System Access in Environments with Multiple Authentication Domains
Pushing the Limits of Cold Storage for Research Data With Elm: An Open-Source S3-to-Tape System
Back to Workshop Archive Listing
XLOOP 2025: The 7th Annual Workshop on Extreme-Scale Experiment-in-the-Loop Computing
Accelerating Advanced Light Source Science Through Multi-Facility HPC Workflows
Streaming X-ray Detector Data to Remote Facilities Using EJFAT
The LCLStream Ecosystem for Multi-Institutional Dataset Exploration
Morning Break - XLOOP 2025
X-ray Ptychography at the Edge: Towards Real-Time Feedback for High-Speed Nanoimaging
Speeding Up Phonon Dynamic Structure Factor Calculations in Phonopy with GPU-Accelerated Computing
Adapting scientific streaming inference workflows for a deterministic tensor processing unit
AI Agents for Enabling Autonomous Experiments at ORNL's HPC and Manufacturing User Facilities
Lunch - XLOOP 2025
Back to Workshop Archive Listing
Morning Break - Alternatives To MPI+X (PAW-ATM 2025)
Invited Talk: Alternative Parallel Programming Models: Past, Present, and Is There a Future?
KVMSR+UDWeave: Extreme-Scaling with Fine-grained Parallelism on the UpDown Graph Supercomputer
Comparing Distributed-Memory Programming Frameworks with Radix Sort
DiOMP-Offloading: Toward Portable Distributed Heterogeneous OpenMP
Lunch - Alternatives To MPI+X (PAW-ATM 2025)
Weak Scaling of NVSHMEM Applied To Hashed Distributed Structured Data
Redesigning GROMACS Halo Exchange: Improving Strong Scaling with GPU-initiated NVSHMEM
Afternoon Break - Alternatives To MPI+X (PAW-ATM 2025)
Enhancing HPX with FleCSI: Automatic Detection of Implicit Task Dependencies
KDRSolvers: Scalable, Flexible, Task-Oriented Krylov Solvers
Panel Discussion: The Role of Alternatives to MPI+X Technologies in AI/ML
Back to Workshop Archive Listing
Morning Break - Scalable Algorithms for Large-Scale Heterogeneous Systems (ScalAH'25)
Numerical Properties and Scalability of s-Step Preconditioned Conjugate Gradient Methods
Efficient Embedding Initialization via Dominant Eigenvector Projections
Fast Linear Solvers via AI-Tuned Markov Chain Monte Carlo-based Matrix Inversion
Lunch - Scalable Algorithms for Large-Scale Heterogeneous Systems (ScalAH'25)
Afternoon Break - Scalable Algorithms for Large-Scale Heterogeneous Systems (ScalAH'25)
Invited Talk: When the qubits are ready, will the national labs be?
Post-Variational Quantum Neural Networks on a Hybrid HPC-QC System
High-Performance and Power-Efficient Emulation of Matrix Multiplication using INT8 Matrix Engines
Scalable Hydrodynamics on multiple Field-Programmable Gate Arrays (FPGAs)
Back to Workshop Archive Listing
IA^3 2025 – 15th Workshop on Irregular Applications: Architectures & Algorithms
IA^3 - Invited Talk 1 - Maintaining connected components for unending graph streams
Morning Break - IA^3 2025
eIM: GPU-Accelerated Efficient Influence Maximization in Large-Scale Social Networks
Profiling Application-Specific Properties of Irregular Graph Algorithms on GPUs
Performance-Portable Symbolic Factorization through Common Graph Operations
How effective is matrix reordering for improving performance of sparse matrix-vector multiplication?
Lunch - IA^3 2025
IA^3 - Invited Talk 2: Communicating more—asynchronously—saves time
Afternoon Break - IA^3 2025
An Optimized Generalized Multi-Color Point Implicit Solver for Intel GPUs using OneAPI ESIMD
Architecting Tensor Core-Based Reductions for Irregular Molecular Docking Kernels
Benchmarking and Dissecting the Cerebras Wafer Scale Eingin-2 Architecture
Back to Workshop Archive Listing
Evaluation of Test-Time Compute Constraints on Safety and Skill Large Reasoning Models
Batch Tiling on Attention: Efficient Mixture of Experts Training on Wafer-Scale Processors
Morning Break - Frontiers in Generative AI for HPC Science and Engineering: Foundations, Challenges, and Opportunities
Agentic AI vs ML-based Autotuning: A Comparative Study for Loop Reordering Optimization
GridMind: LLMs-Powered Agents for Power System Analysis and Operations
Frameworks for Large Language Model Serving in HPC Environments
Lunch - Frontiers in Generative AI for HPC Science and Engineering: Foundations, Challenges, and Opportunities
Exploring Distributed Vector Databases Performance on HPC Platforms: A Study with Qdrant
Beyond End-to-End: Understanding the Limits of LLMs in Scientific Problem Solving
Afternoon Break - Frontiers in Generative AI for HPC Science and Engineering: Foundations, Challenges, and Opportunities
BioR5: A Three-Layer Architecture for Biological Reasoning in Scientific AI
LABMATE: Language Model Based Multi-Agent System to Accelerate Catalysis Experiments
ChatEED: An agentic retrieval assistant for accelerator operators
Back to Workshop Archive Listing
Morning Break - ExaMPI25
Accelerating Intra-Node GPU Communication: A Performance Model for Multi-Path Transfers
Lunch - ExaMPI25
Scaling All-to-All Operations Across Emerging Many-Core Supercomputers
On the Integration of Lightweight Tasks with MPI using the C++26 std::execution `Senders' API
Afternoon Break - ExaMPI25
Back to Workshop Archive Listing
EduHPC-25: Workshop on Education for High Performance Computing
An Interactive Agentic HPC Tutor for Lesson Planning, Teaching, and Assessment
Teaching Task-Based Parallel Programming with a Runtime Systems-Aware Perspective
Morning Break - EduHPC-25
GPU Programming for AI Workflow Development on AWS SageMaker: An Instructional Approach
Scalable Softmax for Efficient Attention: Parallel and Distributed Strategies
Flood simulation: An assignment for OpenMP, MPI, and CUDA/HIP/OpenCL
Lunch - EduHPC-25
Afternoon Break - EduHPC-25
From Soil to Software: Experience from a STEM Workshop on Smart Plant Care and Teachable Machines
“Offloading” Undergraduate Research to the Graphics Processing Unit for Acceleration
A GPU-Accelerated RAG-Based Telegram Assistant for Supporting Parallel Processing Students
From CPU to GPU: Transforming STEM Education with Parallelism
Back to Workshop Archive Listing
The 12th Annual International Workshop on Innovating the Network for Data Intensive Science (INDIS)
Morning Break - International Workshop on Innovating the Network for Data Intensive Science (INDIS)
Rapid Quantum Network Simulation Design with a Path to Scalable Execution
To Stream or Not to Stream: Towards A Quantitative Model for Remote HPC Processing Decisions
From Path-Aware to Application-Aware Source Routing using Traffic Classes
LLM-Based Optimization Algorithm Selection for High-Performance Networks Orchestration
Optimizing Network Resilience Using Domain-Specific Hardware Accelerator for Dynamic Programming
Lunch - International Workshop on Innovating the Network for Data Intensive Science (INDIS)
Complex Parsing for In-Network Acceleration of High-Energy Physics Experiments
Afternoon Break - International Workshop on Innovating the Network for Data Intensive Science (INDIS)
The quest for high capacity, long reach, industrial strength networks
eCounter: Inline Per-IP Network Monitoring at Millisecond Resolution via eBPF
Implementing Network-level QoS at HPC Datacenters to Enable Distributed Scientific Workflows
Modular Architecture for High-Performance and Low Overhead Data Transfers
From Edge to HPC: Investigating Cross-Facility Data Streaming Architectures
Back to Workshop Archive Listing
Morning Break - Research Software Engineers in HPC (RSE-HPC-2025)
Lunch - Research Software Engineers in HPC (RSE-HPC-2025)
Afternoon Break - Research Software Engineers in HPC (RSE-HPC-2025)
Back to Workshop Archive Listing
Morning Break - Workshop on Large Scale Quantum-Classical Computing
Synergies between classical and quantum HPC through open-source software
JUNIQ: HPC-QC integration. Current implementation and some thoughts on future implementations
Lunch - Workshop on Large Scale Quantum-Classical Computing
qiskit-addon-sqd-hpc: A C++ template library for sample-based quantum diagonalization (SQD)
Afternoon Break - Workshop on Large Scale Quantum-Classical Computing
What can high-performance computing centers do with quantum computing?
Back to Workshop Archive Listing
CASSE: Targeted Threat Modeling for Data Management Libraries
60 Security Professionals Walk Into A Room: Outbrief from the 3rd HPC Security Technical Exchange
Afternoon Break - Workshop on Cyber Security in High Performance Computing (S-HPC)
Threads of Trouble: Unveiling GPU Software and Hardware Security Flaws
Back to Workshop Archive Listing
A Framework for Mapping the Effective and Sustainable Use of Energy
Run-time Energy-Efficiency Optimization for AI and HPC Workloads
Afternoon Break - Sustainable Supercomputing
Optimizing Microgrid Composition for Sustainable Data Centers
Molten Chloride Small Modular Reactor Performance Characteristics for Data Center Operation
EMLIO: Minimizing I/O Latency and Energy Consumption for Large-Scale AI Training
Back to Workshop Archive Listing
14th International Workshop on Runtime and Operating Systems for Supercomputers (ROSS 2025)
Invited Talk: State-of-the-Art Communication Software for Supercomputers and Its Applications
Afternoon Break - Workshop on Runtime and Operating Systems for Supercomputers (ROSS 2025)
Extending the C++ Execution Control Library to Support Dynamic Parallel Runtime Systems
Reproducible Performance Evaluation of OpenMP and SYCL Workloads under Noise Injection
Invited Talk: Multikernel: Kernel-to-Kernel Isolation with Elastic Resource Management
Back to Workshop Archive Listing
MEMO’25: International Workshop on Memory System, Management and Optimization
Panel: Energy-efficient Memory Technology for maximizing bandwidth and reducing latency
Afternoon Break - MEMO’25: International Workshop on Memory System, Management and Optimization
Umpire: Portable Memory Management for High-Performance Computing Applications
The MALL is Open: Exploring Shared Caches and Latency in AMD CDNA™ 3 GPUs
Hardware-Software Co-Design of Iterative Filter-Update Numerical Methods Using Processing-In-Memory
Back to Workshop Archive Listing
HUST-25: 12th International Workshop on HPC User Support Tools
A Modular, Responsive, and Accessible HPC Dashboard Built upon Open OnDemand
Open Composer: A Web-Based Application for Generating and Managing Batch Jobs on HPC Clusters
Is it an HPC Workflow Assistant? Is it a Framework? It's Drona Workflow Engine
Afternoon Break - HUST-25: 12th International Workshop on HPC User Support Tools
Generating Frequently Asked Questions from Technical Support Tickets using Large Language Models
AskHPC: A ChatBot for High Performance Computing User Support
ModuLair: Streamlining Python Virtual Environment Management for HPC
Job Script Portability Tool: Cat-based Localization Scheme (CLS)
AnvilOps: Increasing Accessibility of Kubernetes with Automated Builds and Deployments
Back to Workshop Archive Listing
Fifth International Symposium on Quantitative Co-Design of Supercomputers
Morning Break - Fifth International Symposium on Quantitative Co-Design of Supercomputers
An AI-driven approach for delivering sustainable supercomputing services
Back to Workshop Archive Listing
The Eleventh Computational Approaches for Cancer Workshop (CAFCW25)
Morning Break - Computational Approaches for Cancer Workshop (CAFCW25)
iSTaRT - in Silico Targeted Radionuclide Therapy: Designing Inhibitor-Chelator Conjugates
An AI Agentic Framework for Understanding Low-Dose Radiation Effects on Human Lung Epithelial Cells
PathLlama: A Language Model for Automated Cancer Surveillance
Stretch Break
PathPCNet: Pathway Principal Component-Based Interpretable Framework for Drug Sensitivity Prediction
Bayesian Inference for Patient-Specific Digital Twins in Oncology
Back to Workshop Archive Listing
Invited talk: Cuzco from Open-source to a High Performance Computing CPU Design
Bridging Simulation and Silicon: A Study of RISC-V Hardware and FireSim Simulation
Simulating Hybrid Analog + RISC-V Systems for HPC Applications
Accelerating Gravitational N-Body Simulations Using the RISC-V-Based Tenstorrent Wormhole
Morning Break - International Workshop on RISC-V for HPC (RISCVHPC)
RISC-V Vectorization Coverage for HPC: A TSVC-Based Analysis
Is RISC-V ready for High Performance Computing? An evaluation of the Sophon SG2044
Assessing a RISC-V Accelerator for Cross-Section Lookup in Chipyard
Dyninst on the RISC-V: Binary Instrumentation in Support of Performance, Debugging, and Other Tools
Back to Workshop Archive Listing
PDSW'25: The 10th International Parallel Data Systems Workshop
Featured Talk: Supporting Science Through Data Management Software
Morning Break - PDSW'25: The 10th International Parallel Data Systems Workshop
LLMTailor: A Layer-wise Tailoring Tool for Efficient Checkpointing of Large Language Models
SlimIO: Lightweight I/O Path Design for Write Isolation in FDP-backed In-Memory Databases
Parallel Data Object Creation: Scalable Metadata Management in Parallel I/O Library
SmartIO: A Lightweight End-to-End Workflow for Runtime I/O Optimization of HPC Systems
Lunch - PDSW'25: The 10th International Parallel Data Systems Workshop
RL4Sys: A Lightweight System-driven RL Framework for Drop-in Integration in System Optimization
Quantifying AWS S3 I/O Performance Boundaries Using the Roofline Model
Afternoon Break - PDSW'25: The 10th International Parallel Data Systems Workshop
Secure In-Storage Execution of VTK Workloads on Modern Parallel NFS Data Servers
Evaluating Usage and Performance of DAOS for a Classic HPC Application
Accelerating Exascale Scientific Discovery via In-Situ and In-Transit Data Analytics in HPC
Panel: Storage Architectures and I/O Optimizations for AI Applications
Back to Workshop Archive Listing
2025 International Workshop on Performance, Portability, and Productivity in HPC
Comparing Cross-Platform Performance via Node-to-Node Scaling Studies
Roofline Analysis of Tightly-Coupled CPU-GPU Superchips: A Study on MI300A and GH200
Morning Break - Workshop on Performance, Portability, and Productivity in HPC
Development of a performance portable distributed FFT interface on top of the Kokkos ecosystem
Bridging Performance Portability and Scalability for Plasma Simulations on Heterogeneous Systems
Performance portable batched linear algebra kernels for transport sweeps using Kokkos
LAMMPS-KOKKOS: Performance Portable Molecular Dynamics Across Exascale Architectures
Lunch - Workshop on Performance, Portability, and Productivity in HPC
Preserving CUDA Syntax for SYCL Portability: A Thin C++ Abstraction without Kernel Migration
Afternoon Break - Workshop on Performance, Portability, and Productivity in HPC
Energy-aware performance portability with OpenMP dynamic variants
Extending RAJA Parallel Programming Abstractions with Just-In-Time Optimization
How does abstraction enable Performance, Portability, and Productivity?
Back to Workshop Archive Listing
Twelfth SC Workshop on Best Practices for HPC Training and Education
Expanding the CyberAmbassadors Program to Include Mentoring for Emerging CI Careers
Morning Break - Best Practices for HPC Training and Education
HPC-ED: Testing Automated Agents to Assess the Quality of Training Resource Metadata
Experience and Outcomes Organizing a Hackathon in the Physical Sciences
Lunch - Best Practices for HPC Training and Education
Afternoon Break - Best Practices for HPC Training and Education
Developing Findable, Accessible, Interoperable and Reusable (FAIR) AI and HPC Training Environments
Closing Remarks- HPC Education and Training: Emerging Opportunities and Challenges
Back to Workshop Archive Listing
LLVM-HPC2025: The Eleventh Workshop on the LLVM Compiler Infrastructure in HPC
Bridging Realities: CUDA-Q's MLIR Pipeline for Quantum-Classical Hybrid Computing
Morning Break - LLVM-HPC2025
Lessons Learned: Template-Heavy C++ in Production HPC Runtime Systems
Implementing OpenMP Offload Support in the AMD Next Generation Fortran Compiler
Lunch - LLVM-HPC2025
ASaP: Automatic Software Prefetching for Sparse Tensor Computations in MLIR
Afternoon Break - LLVM-HPC2025
OpenSHMEM MLIR: A Dialect for Compile-Time Optimization of One-Sided Communications
Scabbard: LLVM Instrumentation-aided Race Checking in CPU/GPU Unified Memory for AMD GPUs
CIRE: LLVM Analysis for Floating-Point Rounding Error Affected by Precision and Optimizations
Back to Workshop Archive Listing
9th International Workshop on Software Correctness for HPC Applications (Correctness '25)
Morning Break - Workshop on Software Correctness for HPC Applications (Correctness '25)
Using Code Coverage to Assess Feature Gaps in MPI Correctness Tool Classification Tests
Coupling Static and Dynamic MPI Correctness Tools to Optimize Accuracy and Overhead
Differential Testing for Sequential to Parallel Transformations
Extending MPI Correctness Benchmarking to the Fortran Language
Lunch - Workshop on Software Correctness for HPC Applications (Correctness '25)
Correct and Efficient HPC Code Generation with LLMs: Challenges and Opportunities
Afternoon Break - Workshop on Software Correctness for HPC Applications (Correctness '25)
Towards an Automated Workflow for Floating-Point Analysis of GPU Kernels
LLM4FP: LLM-Based Program Generation for Triggering Floating-Point Inconsistencies Across Compilers
Exploring Reduced Precision for Deep Learning Activation Functions
Back to Workshop Archive Listing
20th Workshop on Workflows in Support of Large-Scale Science (WORKS25)
RESILIO : A Scalable and Composable Architecture for Tomographic Reconstruction Workflows
Integrating and Characterizing HPC Task Runtime Systems for hybrid AI-HPC workloads
Morning Break - Workflows in Support of Large-Scale Science (WORKS25)
Distinguished Talk: The Evolution of Automation in Science — The Pegasus Perspective
CAMEO: A Co-design Architecture for Multi-objective Energy System Optimization
Adapting Classic Scheduling Heuristics for Online Execution under Uncertainty
Lunch - Workflows in Support of Large-Scale Science (WORKS25)
Concurrency Patterns and Primitives in Modern AI/ML Scientific Applications
Guest Panelist: Beyond DAGs: Adaptive Streaming Workflows & AI-Driven Workflow Design
Guest Panelist: Research Without Borders: Bridging the Knowledge Gap
Panel Discussion: Navigating the Future of Scientific Workflows
Afternoon Break - Workflows in Support of Large-Scale Science (WORKS25)
Overcoming Dynamic I/O Boundaries: a Double-Sided Streaming Methodology with dispel4py and CAPIO
LLM Agents for Interactive Workflow Provenance: Reference Architecture and Evaluation Methodology
Evaluating the Efficacy of LLM-Based Reasoning for Multiobjective HPC Job Scheduling
The (R)evolution of Scientific Workflows in the Agentic AI Era: Towards Autonomous Science
Back to Workshop Archive Listing
The 11th International Workshop on Data Analysis and Reduction for Big Scientific Data
Invited Talk: Globus: Enabling Scalable and Sustainable Research for Data-Intensive Science
Morning Break - Workshop on Data Analysis and Reduction for Big Scientific Data
Design and Implementation of a Custom Hardware Accelerator for SZx Compression in Chipyard
Evaluating Accuracy and Performance Tradeoffs in GPU Accelerated Single Cell RNA-seq Analysis
Data Management System Analysis for Distributed Computing Workloads
Building n-Dimensional Trees for Resolution-Based Progressive Compression
Lunch - Workshop on Data Analysis and Reduction for Big Scientific Data
On the Compressibility of Floating-Point Data in Posit and IEEE-754 Representation
ASCRIBE-XR: Extended Reality for Visualization of Scientific Images
Afternoon Break - Workshop on Data Analysis and Reduction for Big Scientific Data
Lightweight CNN-Based Artifact Reduction for Scientific Error-bounded Lossy Compression
Compression Error Sensitivity Analysis for Different Experts in MoE Model Inference
Characterizing the Performance of Parallel Data-Compression Algorithms across Compilers and GPUs
Integrating Distributed SQL Query Engines with Object-Based Computational Storage
Back to Workshop Archive Listing
Twelfth Workshop on Accelerator Programming and Directives (WACCPD 2025)
Morning Break - Workshop on Accelerator Programming and Directives (WACCPD 2025)
Mojo: MLIR-based Performance-Portable HPC Science Kernels on GPUs for the Python Ecosystem
Towards Efficient Load Balancing BFS on GPUs: One Code for AMD, Intel & Nvidia
A Study of Performance Portability of Low-bit Fused Matrix-Vector Multiplication Kernels in SYCL
Porting a Fortran plasma simulation to Exascale on AMD GPUs using both OpenMP and Kokkos
Lunch - Workshop on Accelerator Programming and Directives (WACCPD 2025)
Bridging FPGA and GPU over PCIe: A Low-Latency Communication Path using AVX-512
Reduction-Aware Directive-Based Programming via Multi-Dimensional Homomorphisms
Afternoon Break - Workshop on Accelerator Programming and Directives (WACCPD 2025)
Scalable Neural Network Training: Distributed Data-Parallel Approaches
Back to Workshop Archive Listing
A Peak Performance Model for All-to-all on Hierarchical Systems and Its Applications
Morning Break - PMBS25
Beyond Guess and Check: Quantifying the Fidelity of Proxy Applications
CGSim: A Simulation Framework for Large Scale Distributed Computing Environment
Implications of Full-System Modeling for Superconducting Architectures
Experiences of Porting Structured and Unstructured Stencil Applications to FPGA using SYCL
Lunch - PMBS25
MoE-Inference-Bench: Performance Evaluation of Mixture of Expert Large Language and Vision Models
Pretraining LLMs at Scale: Tuning Strategies and Performance Portability.
Afternoon Break - PMBS25
Characterizing the Impact of GPU Power Management on an Exascale System
On the Performance and Scalability of Cloud Supercomputers: Insights from Eagle and Reindeer
Back to Workshop Archive Listing
Morning Break - AI4S: 6th Workshop on Artificial Intelligence and Machine Learning for Scientific Applications
LangChain-Parsl: Connect Large Language Model Agents to High Performance Computing Resource
Classification of Three-dimensional Electron Diffraction Data with a Large Language Model
Lunch - AI4S: 6th Workshop on Artificial Intelligence and Machine Learning for Scientific Applications
Afternoon Break - AI4S: 6th Workshop on Artificial Intelligence and Machine Learning for Scientific Applications
FIRST: Federated Inference Resource Scheduling Toolkit for Scientific AI Model Access
Inverse Design for Generating Initial Conditions in Scientific Simulations
Back to Workshop Archive Listing
Containers and New Orchestration Paradigms for Isolated Environments in HPC
The Convergence of Containers and Kubernetes with HPC and Generative AI
Morning Break - CANOPIE-HPC
Experience Deploying Containerized GenAI Services at an HPC Center
Engine-Agnostic Model Hot-Swapping for Cost-Effective LLM Inference
Dynamic Topology-Aware Scheduling in HPC Systems with Topograph
Lunch - CANOPIE-HPC
Afternoon Break - CANOPIE-HPC
Evaluating HPK for Running Cloud-Native Workloads on Slurm Clusters
Back to Workshop Archive Listing
Morning Break - WHPC: Building Community, Building Careers
WHPC - WHPC Distinguished Volunteering Recognition Presented to Cristin Merritt
Lunch - WHPC: Building Community, Building Careers
TUSQ: Tracking, Uncomputation, and Sampling for Noisy Quantum Simulation
Large-Scale Empirical Image Enhancement Studies with diverse HPC systems
Predicting Resources for AI Workloads in HPC: Methods, Challenges, and Opportunities
Overhead Quantification of the Lightweight Distributed Metric Service for High-Performance Computers
WHPC - Networking Breakout
Afternoon Break - WHPC: Building Community, Building Careers
Back to Workshop Archive Listing
Featured Talk - ARMing GPUs: On the Memory Subsystem of Grace Hopper GH200
Afternoon Break - Sixth Workshop on Heterogeneity and Memory Systems (HMEM)
Machine Learning-Guided Memory Optimization for DLRM Inference on Tiered Memory
Performance Analysis of Compute Express Link (CXL) Memory Expansion with Data Interleaving
Summary: Hierarchical Framework for Multi-node Compute eXpress Link Memory Transactions
Performance Characterization of CXL Memory and Its Use Cases
Back to Workshop Archive Listing
7th Workshop on Programming and Performance Visualization Tools (ProTools)
MT4G: A Tool for Reliable Auto-Discovery of NVIDIA and AMD GPU Compute and Memory Topologies
Afternoon Break - Workshop on Programming and Performance Visualization Tools (ProTools)
Extending THAPI with CXI Hardware Counter Sampling for High Resolution NIC Telemetry
Scalable, High-Fidelity Monitoring of Application Communication Patterns in Vernier
Back to Workshop Archive Listing
Afternoon Break - High Performance Python for Science at Scale
A first look at Mojo’s MLIR-based Performance Portable GPU Programming for Python Users
Panel Discussion: Managing a Python Environment for Everyone
Back to Workshop Archive Listing
Morning Break - Workshop on Heterogeneous High-Performance Reconfigurable Computing (H2RC 2025)
A Compute Graph Simulation and Implementation Framework Targeting AMD Versal AI Engines
SNAcc: An Open-Source Framework for Streaming-based Network-to-Storage Accelerators
Connected-Component Labeling Using HLS for High-Energy Particle Physics Instruments
An Approach to Identify Divergences in Hardware Designs for HPC Workloads
Back to Workshop Archive Listing
Seventh International Workshop on Emerging Parallel Distributed Runtime Systems and Middleware
ChemComp: A Compilation Framework for Computing with Chemical Reaction Networks
Morning Break - Workshop on Emerging Parallel Distributed Runtime Systems and Middleware
MPI Communication Performance on AMD MI300A: Microbenchmarks and Applications
Error Analysis of Globally Distributed Workflow Management System
Back to Workshop Archive Listing
Sixth Combined Workshop on Interactive and Urgent High-Performance Computing
Open OnDemand: Connecting Computing Power with Powerful Minds
Morning Break - Sixth Combined Workshop on Interactive and Urgent High-Performance Computing
Implementing support for Interactive and AI workloads in a traditional HPC environment
Back to Workshop Archive Listing
First International Symposium on Artificial Intelligence and Extreme-Scale Workflows
Invited Talk: How Complex AI Workflows Are Driving the Future of HPC Architectures?
Morning Break - Symposium on Artificial Intelligence and Extreme-Scale Workflows
Invited Talk: AI-Ready Scientific Workflows at Scale: Bridging Data, Infrastructure, and Automation
Invited Talk: Intelligent Workflows at Scale: AI-Driven Exploration of Cosmological Simulations
Back to Workshop Archive Listing
A Simulation Framework for Workload Management in Hybrid Quantum-HPC Cloud System
An HPC-Inspired Blueprint for a Technology-Agnostic Quantum Middle Layer
Tackling the Challenges of Adding Pulse-level Support to a Heterogeneous HPCQC Software Stack
Scaling Hybrid Quantum--HPC Applications with the Quantum Framework
Morning Break - Workshop for Software Frameworks and Workload Management on Quantum-HPC Ecosystems
Orchestrating Quantum-HPC Workflows with Distributed Quantum Circuit Cutting
Towards Supporting QIR: Steps for Adopting the Quantum Intermediate Representation
A Practical Quantum Solver for Multidimensional Partial Differential Equations
Invited Talk - The Current and Future of Q-HPC Software Frameworks
Panel Discussion - Challenges in Q-HPC Software Frameworks & Workload Management
Back to Workshop Archive Listing
Experiences Integrating Database Support into the OLCF Test Harness
Morning Break - Workshop on HPC Testing and Evaluation of Systems, Tools, and Software (HPCTESTS 2025)
Testing and Benchmarking Emerging Supercomputers via the MFC Flow Solver
Seeking Cost-Optimal Infrastructure Size for Distributed Filesystems: A Ceph Case Study
Back to Workshop Archive Listing