Towards Accurate Simulation of Global Challenges on Data Centers Infrastructures via Coupling of Models and Data Sources

Accurate digital twinning of the global challenges (GC) leads to computationally expensive coupled simulations. These simulations bring together not only different models, but also various sources of massive static and streaming data sets. In this paper, we explore ways to bridge the gap between traditional high performance computing (HPC) and data-centric computation in order to provide efficient technological solutions for accurate policy-making in the domain of GC. GC simulations in HPC environments give rise to a number of technical challenges related to coupling. Being intended to reflect current and upcoming situation for policy-making, GC simulations extensively use recent streaming data coming from external data sources, which requires changing traditional HPC systems operation. Another common challenge stems from the necessity to couple simulations and exchange data across data centers in GC scenarios. By introducing a generalized GC simulation workflow, this paper shows commonality of the technical challenges for various GC and reflects on the approaches to tackle these technical challenges in the HiDALGO project.

[1]  A. Simmons,et al.  Operational global reanalysis : progress , future directions and synergies with NWP , 2018 .

[2]  Andrew Crooks,et al.  The Integration of Agent-Based Modelling and Geographical Information for Geospatial Simulation , 2012 .

[3]  Javier Carnero,et al.  Running Simulations in HPC and Cloud Resources by Implementing Enhanced TOSCA Workflows , 2018, 2018 International Conference on High Performance Computing & Simulation (HPCS).

[4]  D. Randall,et al.  A Multiscale Modeling System: Developments, Applications, and Critical Issues , 2009 .

[5]  Marlon E. Pierce,et al.  Apache Airavata: Design and Directions of a Science Gateway Framework , 2014, 2014 6th International Workshop on Science Gateways.

[6]  Peter V. Coveney,et al.  Survey of Multiscale and Multiphysics Applications and Communities , 2012, Computing in Science & Engineering.

[7]  Derek Groen,et al.  How Policy Decisions Affect Refugee Journeys in South Sudan: A Study Using Automated Ensemble Simulations , 2020, J. Artif. Soc. Soc. Simul..

[8]  E Weinan,et al.  Multiscale modeling , 2019, Scholarpedia.

[9]  Daniel Krajzewicz,et al.  SUMO - Simulation of Urban MObility An Overview , 2011 .

[10]  Peter V. Coveney,et al.  FabSim: Facilitating computational research through automation on large-scale and distributed e-infrastructures , 2015, Comput. Phys. Commun..

[11]  Johan Montagnat,et al.  Scientific workflows: Past, present and future , 2017, Future Gener. Comput. Syst..

[12]  Zhihui Wang,et al.  Multiscale agent-based cancer modeling , 2009, Journal of mathematical biology.

[13]  Yong Chen,et al.  PFault: A General Framework for Analyzing the Reliability of High-Performance Parallel File Systems , 2018, ICS.

[14]  Jeffrey H. Johnson,et al.  Global Systems Science and Policy , 2017 .

[15]  Bastien Chopard,et al.  Multiscale modelling and simulation: a position paper , 2014, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[16]  Philipp Neumann,et al.  Mastering the scales: a survey on the benefits of multiscale computing software , 2019, Philosophical Transactions of the Royal Society A.

[17]  Peter V. Coveney,et al.  Distributed multiscale computing with MUSCLE 2, the Multiscale Coupling Library and Environment , 2013, J. Comput. Sci..

[18]  Rizos Sakellariou,et al.  A characterization of workflow management systems for extreme-scale applications , 2016, Future Gener. Comput. Syst..

[19]  Frank Leymann,et al.  Portable Cloud Services Using TOSCA , 2012, IEEE Internet Computing.