Automatizing the creation of specialized high-performance computing containers

With Exascale computing already here, supercomputers are systems every time larger, more complex, and heterogeneous. While expert system administrators can install and deploy applications in the systems correctly, this is something that general users can not usually do. The eFlows4HPC project aims to provide methodologies and tools to enable the use and reuse of application workflows. One of the aspects that the project focuses on is simplifying the application deployment in large and complex systems. The approach uses containers, not generic ones, but containers tailored for each target High-Performance Computing (HPC) system. This paper presents the Container Image Creation service developed in the framework of the project and experimentation based on project applications. We compare the performance of the specialized containers against generic containers and against a native installation. The results show that in almost all cases, the specialized containers outperform the generic ones (up to 2× faster), and in all cases, the performance is the same as with the native installation.

[1]  M. Taufer,et al.  Building Trust in Earth Science Findings through Data Traceability and Results Explainability , 2023, IEEE Transactions on Parallel and Distributed Systems.

[2]  Rosa M. Badia,et al.  Enabling Dynamic and Intelligent Workflows for HPC, Data Analytics, and AI Convergence , 2022, Future generations computer systems.

[3]  D. Kranzlmüller,et al.  Enabling EASEY Deployment of Containerized Applications for Future HPC Systems , 2020, ICCS.

[4]  Rosa M. Badia,et al.  dislib: Large Scale High Performance Machine Learning in Python , 2019, 2019 15th International Conference on eScience (eScience).

[5]  Manuel J. Castro,et al.  Performance Benchmarking of Tsunami-HySEA Model for NTHMP’s Inundation Mapping Activities , 2017, Pure and Applied Geophysics.

[6]  Vanessa Sochat,et al.  Singularity: Scientific containers for mobility of compute , 2017, PloS one.

[7]  Qiang Wang,et al.  The Finite-volumE Sea ice–Ocean Model (FESOM2) , 2016 .

[8]  Jordi Torres,et al.  PyCOMPSs: Parallel computational workflows in Python , 2016, Int. J. High Perform. Comput. Appl..

[9]  Bronis R. de Supinski,et al.  The Spack package manager: bringing order to HPC software chaos , 2015, SC15: International Conference for High Performance Computing, Networking, Storage and Analysis.

[10]  Markus Geimer,et al.  Modern Scientific Software Management Using EasyBuild and Lmod , 2014, 2014 First International Workshop on HPC User Support Tools.

[11]  Dirk Merkel,et al.  Docker: lightweight Linux containers for consistent development and deployment , 2014 .

[12]  Riccardo Rossi,et al.  Migration of a generic multi-physics framework to HPC environments , 2013 .

[13]  Eugenio Oñate,et al.  An Object-oriented Environment for Developing Finite Element Codes for Multi-disciplinary Applications , 2010 .

[14]  S. McMillan,et al.  Making Containers Easier with HPC Container Maker , 2018 .

[15]  F. Al-Shamali,et al.  Author Biographies. , 2015, Journal of social work in disability & rehabilitation.