The ESCAPE project: Energy-efficient Scalable Algorithms for Weather Prediction at Exascale

Abstract. In the simulation of complex multi-scale flows arising in weather and climate modelling, one of the biggest challenges is to satisfy strict service requirements in terms of time to solution and to satisfy budgetary constraints in terms of energy to solution, without compromising the accuracy and stability of the application. These simulations require algorithms that minimise the energy footprint along with the time required to produce a solution, maintain the physically required level of accuracy, are numerically stable, and are resilient in case of hardware failure. The European Centre for Medium-Range Weather Forecasts (ECMWF) led the ESCAPE (Energy-efficient Scalable Algorithms for Weather Prediction at Exascale) project, funded by Horizon 2020 (H2020) under the FET-HPC (Future and Emerging Technologies in High Performance Computing) initiative. The goal of ESCAPE was to develop a sustainable strategy to evolve weather and climate prediction models to next-generation computing technologies. The project partners incorporate the expertise of leading European regional forecasting consortia, university research, experienced high-performance computing centres, and hardware vendors. This paper presents an overview of the ESCAPE strategy: (i) identify domain-specific key algorithmic motifs in weather prediction and climate models (which we term Weather & Climate Dwarfs), (ii) categorise them in terms of computational and communication patterns while (iii) adapting them to different hardware architectures with alternative programming models, (iv) analyse the challenges in optimising, and (v) find alternative algorithms for the same scheme. The participating weather prediction models are the following: IFS (Integrated Forecasting System); ALARO, a combination of AROME (Application de la Recherche a l'Operationnel a Meso-Echelle) and ALADIN (Aire Limitee Adaptation Dynamique Developpement International); and COSMO–EULAG, a combination of COSMO (Consortium for Small-scale Modeling) and EULAG (Eulerian and semi-Lagrangian fluid solver). For many of the weather and climate dwarfs ESCAPE provides prototype implementations on different hardware architectures (mainly Intel Skylake CPUs, NVIDIA GPUs, Intel Xeon Phi, Optalysys optical processor) with different programming models. The spectral transform dwarf represents a detailed example of the co-design cycle of an ESCAPE dwarf. The dwarf concept has proven to be extremely useful for the rapid prototyping of alternative algorithms and their interaction with hardware; e.g. the use of a domain-specific language (DSL). Manual adaptations have led to substantial accelerations of key algorithms in numerical weather prediction (NWP) but are not a general recipe for the performance portability of complex NWP models. Existing DSLs are found to require further evolution but are promising tools for achieving the latter. Measurements of energy and time to solution suggest that a future focus needs to be on exploiting the simultaneous use of all available resources in hybrid CPU–GPU arrangements.

[1]  Peter Bauer,et al.  Atlas : A library for numerical weather prediction and climate modelling , 2017, Comput. Phys. Commun..

[2]  Yongjun Zheng,et al.  Simulation of the performance and scalability of message passing interface (MPI) communications of atmospheric models running on exascale supercomputers , 2018, Geoscientific Model Development.

[3]  Mats Hamrud,et al.  A Fast Spherical Harmonics Transform for Global NWP and Climate Models , 2013 .

[4]  Beau Johnston,et al.  Dwarfs on Accelerators: Enhancing OpenCL Benchmarking for Heterogeneous Computing Architectures , 2018, ICPP Workshops.

[5]  Mats Hamrud,et al.  A Partitioned Global Address Space implementation of the European Centre for Medium Range Weather Forecasts Integrated Forecasting System , 2015, Int. J. High Perform. Comput. Appl..

[6]  Wendy S. Parker,et al.  The future of climate modeling , 2015, Climatic Change.

[7]  James Demmel,et al.  the Parallel Computing Landscape , 2022 .

[8]  Peter Bauer,et al.  The quiet revolution of numerical weather prediction , 2015, Nature.

[9]  Jing Zhang,et al.  OpenCL and the 13 dwarfs: a work in progress , 2012, ICPE '12.

[10]  Eduardo F. D'Azevedo,et al.  MiniApps derived from production HPC applications using multiple programing models , 2018, Int. J. High Perform. Comput. Appl..

[11]  Sophie Valcke,et al.  Crossing the chasm: how to develop weather and climate models for next generation computers? , 2017 .

[12]  Circumventing the pole problem of reduced lat‐lon grids with local schemes. Part II: Validation experiments , 2019, Quarterly Journal of the Royal Meteorological Society.

[13]  Francis X. Giraldo,et al.  Current and Emerging Time-Integration Strategies in Global Numerical Weather and Climate Prediction , 2019 .

[14]  Torsten Hoefler,et al.  Near-global climate simulation at 1 km resolution: establishing a performance baseline on 4888 GPUs with COSMO 5.0 , 2017 .

[15]  Nils Wedi,et al.  Assessing the scales in numerical weather and climate predictions: will exascale be the rescue? , 2019, Philosophical Transactions of the Royal Society A.

[16]  Leonid Oliker,et al.  Hardware/software co‐design of global cloud system resolving models , 2011 .

[17]  Dhabaleswar K. Panda,et al.  High performance implementation of MPI derived datatype communication over InfiniBand , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[18]  Kenneth Flamm Measuring Moore's Law: Evidence from Price, Cost, and Quality Indexes , 2018 .

[19]  E. Kaltofen The “Seven Dwarfs” of Symbolic Computation , 2012 .

[20]  Hanna Pawlowska,et al.  University of Warsaw Lagrangian Cloud Model (UWLCM) 1.0: a modern large-eddy simulation tool for warm cloud modeling with Lagrangian microphysics , 2019, Geoscientific Model Development.

[21]  Pierre Bénard,et al.  RK‐IMEX HEVI schemes for fully compressible atmospheric models with advection: analyses and numerical testing , 2017 .

[22]  Timothy D. Wilkinson,et al.  An optical Fourier transform coprocessor with direct phase determination , 2017, Scientific Reports.

[23]  Renate Hagedorn,et al.  Toward a new generation of world climate research and computing facilities , 2010 .

[24]  Wu-chun Feng,et al.  OpenDwarfs: Characterization of Dwarf-Based Benchmarks on Fixed and Reconfigurable Architectures , 2016, J. Signal Process. Syst..

[25]  Robert Pincus,et al.  The CLAW DSL: Abstractions for Performance Portable Weather and Climate Models , 2018, PASC.

[26]  Francis X. Giraldo,et al.  Strong scaling for numerical weather prediction at petascale with the atmospheric model NUMA , 2015, Int. J. High Perform. Comput. Appl..

[28]  A. P. Siebesma,et al.  Weather Forecasting Using GPU-Based Large-Eddy Simulations , 2015 .

[29]  Tim Palmer,et al.  Climate forecasting: Build high-resolution global climate models , 2014, Nature.

[30]  D. Williamson The Evolution of Dynamical Cores for Global Atmospheric Models(125th Anniversary Issue of the Meteorological Society of Japan) , 2007 .

[31]  Torsten Hoefler,et al.  Reflecting on the Goal and Baseline for Exascale Computing: A Roadmap Based on Weather and Climate Simulations , 2019, Computing in Science & Engineering.

[32]  Juri Papay,et al.  Snow White Clouds and the Seven Dwarfs , 2011, 2011 IEEE Third International Conference on Cloud Computing Technology and Science.

[33]  Joanna Szmelter,et al.  FVM 1.0: a nonhydrostatic finite-volume dynamical core for the IFS , 2019, Geoscientific Model Development.