Authors: Tim Wickberg (SchedMD LLC), Danny Auble (SchedMD LLC)
Abstract: Slurm is an open-source workload manager used on many of the TOP500 systems and provides a rich set of features.
An updated Slurm community survey will be distributed ahead of the BoF, and introduced at the start of the session.
Changes made in the Slurm 25.05 and 25.11 releases will be presented, alongside the future roadmap for 26.05 and beyond.
Initial results from the community survey will be discussed. Discussion will focus around how Slurm development should react to changes in Linux distribution lifecycles, Linux cgroup versions, container runtimes, and external tools such as MPI/PMIx.
Remaining time will be used as an open community forum.
Everyone interested in Slurm use and development is encouraged to attend.
Long Description: Slurm is an open-source workload manager used on much of the TOP500 systems and provides a rich set of features.
Slinky is SchedMD's toolkit of components to enable integration of Slurm into Kubernetes, allowing for enhancement of Kubernetes scheduling capabilities for AI/ML workloads.
The meeting will consist of four parts: The Slurm development team will present details about the newly released Slurm 25.11, current status of the Slinky projects, planned changes for the upcoming 26.05 and future releases, and survey the audience on aspects of HPC system design.
Everyone interested in Slurm use and development is encouraged to attend.
Goals for the BoF are to:
- Communicate recent features and functionality introduced in Slurm.
- Discuss how newer technologies, such as Kubernetes, are leading to changes in how HPC systems are designed and deployed, and how projects such as Slinky may add additional options.
- Gather community feedback to steer future development efforts.
- Survey the community to understand the state-of-practice, focused around Linux distribution adoption (and retirement of obsolete versions), Linux subsystem adoption (cgroup v1 vs v2), integration with other projects such as OpenPMIx.
Website: https://slurm.schedmd.com/