Poster Type: Research Posters
Author: German Felipe Giraldo Villa (École Polytechnique Fédérale de Lausanne), Théo Grivel (École Polytechnique Fédérale de Lausanne), George Ioannidis (École Polytechnique Fédérale de Lausanne), Edita Kizinevic (École Polytechnique Fédérale de Lausanne), Carolina Lindqvist (École Polytechnique Fédérale de Lausanne), Nicolas Litchinko (École Polytechnique Fédérale de Lausanne), Pablo Llopis (École Polytechnique Fédérale de Lausanne), Antonio Javier Russo (École Polytechnique Fédérale de Lausanne), Gilles Fourestey (École Polytechnique Fédérale de Lausanne)
Supervisor:
Abstract: High performance computing systems require intricate platform-specific stacks and configurations, which poses a challenge for reproducing the same HPC ecosystem on a different platform, a key requirement for geo-redundancy, business continuity, and urgent computing. We present a method for declaratively defining portable HPC ecosystems that can be deployed rapidly and reliably with a high degree of automation. Our model enables infrastructure-layer portability, going beyond existing cloud-native solutions.
We introduce a two-tiered modular abstraction framework: provider-specific lower-level modules that handle the implementation details; and provider-agnostic high-level modules that define core infrastructure logic, designed around the versatile software-defined clusters (vClusters) developed at CSCS.
To evaluate our approach, we showcase a portable implementation of the Weather and Climate HPC vCluster that runs on the Alps ecosystem, and deploy it on Google Cloud Platform. Our work demonstrates the effectiveness of our declarative approach in migrating HPC systems across heterogeneous platforms.
Best Poster Finalist (BP): no
Poster: PDF
Poster Summary: PDF