The International Conference for High Performance Computing, Networking, Storage, and Analysis

Research and ACM SRC Posters Archive

Mixed Compute Environments with OpenCHAMI


Poster Type: Research Posters

Author: Sean Gibson (Los Alamos National Laboratory (LANL)), Richard Kim (Los Alamos National Laboratory (LANL)), Samuel Quan (Los Alamos National Laboratory (LANL)), Travis Cotton (Los Alamos National Laboratory (LANL)), Thomas Mackell (Los Alamos National Laboratory (LANL))

Supervisor: Cotton (Oregon State University)

Abstract: There is a growing need for workloads that don’t follow a traditional HPC workflow. Many of these workloads are developed with Kubernetes as the workload manager rather than an HPC-focused one such as Slurm. Mixing different workloads presents a challenge for a few reasons: The demand for either type of resource may fluctuate, so static assignments of Kubernetes or Slurm as the WLM may result in idle resources; the desire for one WLM or another may increase, so extra resources will need to be assigned and moved.

To address this demand, we utilized OpenCHAMI, an open-source system management platform for deploying, managing, and scaling HPC clusters. With OpenCHAMI, we created “spread”: a command line tool that configures nodes’ workload environments across the cluster. We support fast node booting using kexec and a dynamic base of workload environments to swap between, including Slurm and Kubernetes.

Best Poster Finalist (BP): no
Poster: PDF
Poster Summary: PDF


Back to Poster Archive Listing