Containerizing HPC Applications on Heterogeneous Systems for Centralized Resource Management: A Case Study

Recently, the demand for scientific computing on HPC systems has grown in popularity. However, the runtime environment is a standpoint when there are many kinds of different applications with different requirements. Moreover, an HPC system cannot satisfy all of these requirements of environment. This becomes more and more considerable in the case of applications running on heterogeneous systems (e.g., CPU/Intel Xeon Phi based cluster). Generally, two main problems needing to be tackled in HPC systems are runtime environment and workload management. In terms of lightweight virtualization, Docker facilitates the isolation of different applications as well as runtime environments on the same host operating system. In addition, with huge advantages, batch job scheduler plays a vital role in management and operation. In this paper, we adopt an approach by combining containerization and HPC workload management to support the submission of a variety of applications. Practically, we perform the experiments on a heterogeneous cluster with CPU and Intel Xeon Phi coprocessor. The results show that there is a slightly different about the performance of jobs which are submitted by the normal way and containerized way. However, the experimental result highlights that the cost of containerizing HPC applications is negligible, and this can be applied in practice to fulfill user's requirement.

[1]  Aleksandar Jemcov,et al.  OpenFOAM: A C++ Library for Complex Physics Simulations , 2007 .

[2]  Ibm Redbooks,et al.  Workload Management With Loadleveler , 2001 .

[3]  Liqiang Chen Integrating Cloud Computing Services Using Enterprise Service Bus (ESB) , 2012 .

[4]  日経BP社,et al.  Amazon Web Services完全ソリューションガイド , 2016 .

[5]  Yanli Wang,et al.  Quantum ESPRESSO: a modular and open-source software project for quantum simulations of materials , 2009 .

[6]  James Reinders,et al.  Intel Xeon Phi Coprocessor High Performance Programming , 2013 .

[7]  Michael E. Jennings,et al.  Charliecloud: Unprivileged Containers for User-Defined Software Stacks in HPC , 2017 .

[8]  Honbo Zhou,et al.  The EASY - LoadLeveler API Project , 1996, JSSPP.

[9]  J. Hervouet TELEMAC modelling system: an overview , 2000 .

[10]  Douglas Thain,et al.  Integrating Containers into Workflows: A Case Study Using Makeflow, Work Queue, and Docker , 2015, VTDC@HPDC.

[11]  Michael Humphrey Altair's PBS professional update , 2006, SC.

[12]  Richard O. Sinnott,et al.  A performance comparison of container-based technologies for the Cloud , 2017, Future Gener. Comput. Syst..

[13]  Andy B. Yoo,et al.  Approved for Public Release; Further Dissemination Unlimited X-ray Pulse Compression Using Strained Crystals X-ray Pulse Compression Using Strained Crystals , 2002 .

[14]  D. Jacobsen,et al.  Contain This, Unleashing Docker for HPC , 2015 .

[15]  Edward L. Haletky VMware vSphere and Virtual Infrastructure Security: Securing the Virtual Environment , 2009 .

[16]  Georges Da Costa,et al.  2005 IEEE International Symposium on Cluster Computing and the Grid , 2005, CCGRID.

[17]  César A. F. De Rose,et al.  Performance Evaluation of Container-Based Virtualization for High Performance Computing Environments , 2013, 2013 21st Euromicro International Conference on Parallel, Distributed, and Network-Based Processing.

[18]  Andreas Wilke,et al.  Skyport - Container-Based Execution Environment Management for Multi-cloud Scientific Workflows , 2014, 2014 5th International Workshop on Data-Intensive Computing in the Clouds.

[19]  Mohsine Eleuldj,et al.  OpenStack: Toward an Open-source Solution for Cloud Computing , 2012 .

[20]  Gerrit Groenhof,et al.  GROMACS: Fast, flexible, and free , 2005, J. Comput. Chem..

[21]  Ramakrishnan Rajamony,et al.  An updated performance comparison of virtual machines and Linux containers , 2015, 2015 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS).

[22]  Jun Kong,et al.  Comparative Performance Analysis of Intel (R) Xeon Phi (TM), GPU, and CPU: A Case Study from Microscopy Image Analysis , 2014, 2014 IEEE 28th International Parallel and Distributed Processing Symposium.

[23]  Miika Komu,et al.  Hypervisors vs. Lightweight Virtualization: A Performance Comparison , 2015, 2015 IEEE International Conference on Cloud Engineering.