Near-global climate simulation at 1 km resolution: establishing a performance baseline on 4888 GPUs with COSMO 5.0

Abstract. The best hope for reducing long-standing global climate model biases is by increasing resolution to the kilometer scale. Here we present results from an ultrahigh-resolution non-hydrostatic climate model for a near-global setup running on the full Piz Daint supercomputer on 4888 GPUs (graphics processing units). The dynamical core of the model has been completely rewritten using a domain-specific language (DSL) for performance portability across different hardware architectures. Physical parameterizations and diagnostics have been ported using compiler directives. To our knowledge this represents the first complete atmospheric model being run entirely on accelerators on this scale. At a grid spacing of 930 m (1.9 km), we achieve a simulation throughput of 0.043 (0.23) simulated years per day and an energy consumption of 596 MWh per simulated year. Furthermore, we propose a new memory usage efficiency (MUE) metric that considers how efficiently the memory bandwidth – the dominant bottleneck of climate codes – is being used.

[1]  D. Lüthi,et al.  Evaluation of the convection‐resolving climate modeling approach on continental scales , 2017 .

[2]  Joost VandeVondele,et al.  cp2k: atomistic simulations of condensed matter systems , 2014 .

[3]  Michael Sprenger,et al.  The Real-Time Ultrafinescale Forecast Support during the Special Observing Period of the MAP , 2002 .

[4]  P Blake,et al.  A national strategy , 2000 .

[5]  Terry Davies,et al.  Validity of anelastic and other equation sets as inferred from normal‐mode analysis , 2003 .

[6]  Gilles Fourestey,et al.  First Experiences With Validating and Using the Cray Power Management Database Tool , 2014, ArXiv.

[7]  Matt Martineau,et al.  GPU-STREAM v2.0: Benchmarking the Achievable Memory Bandwidth of Many-Core Processors Across Diverse Parallel Programming Models , 2016, ISC Workshops.

[8]  Hong Wang,et al.  A summary of numerical methods for time-dependent advection-dominated partial differential equations , 2001 .

[9]  Tobias Gysi,et al.  Towards a performance portable, architecture agnostic implementation strategy for weather and climate models , 2014, Supercomput. Front. Innov..

[10]  Hiroshi Niino,et al.  Multiple Polar Mesocyclones over the Japan Sea on 11 February 1997 , 2004 .

[11]  B. Ritter,et al.  A comprehensive radiation scheme for numerical weather prediction models with potential applications in climate simulations , 1992 .

[12]  F. Martin Ralph Observations of 250-km-wavelength clear-air-eddies and 750-km-wavelength mesocyclones associated with a synoptic-scale midlatitude cyclone , 1996 .

[13]  D. Lüthi,et al.  Towards European-scale convection-resolving climate simulations with GPUs: a study with COSMO 4.19 , 2016 .

[14]  Kevin A. Reed,et al.  Analytical initial conditions and an analysis of baroclinic instability waves in f ‐ and β‐plane 3D channel models , 2015 .

[15]  Y.-K. Kwok,et al.  Static scheduling algorithms for allocating directed task graphs to multiprocessors , 1999, CSUR.

[16]  Srimanth Gadde Graph partitioning algorithms for minimizing inter-node communication on a distributed system , 2013 .

[17]  H. T. Kung,et al.  I/O complexity: The red-blue pebble game , 1981, STOC '81.

[18]  G. Zängl,et al.  Quantitative precipitation forecasting in the Alps: The advances achieved by the Mesoscale Alpine Programme , 2007 .

[19]  Chris Snyder,et al.  Atmospheric Kinetic Energy Spectra from Global High-Resolution Nonhydrostatic Simulations , 2014 .

[20]  Emil M. Constantinescu,et al.  Implicit-Explicit Formulations of a Three-Dimensional Nonhydrostatic Unified Model of the Atmosphere (NUMA) , 2013, SIAM J. Sci. Comput..

[21]  A. Staniforth,et al.  A new dynamical core for the Met Office's global and regional modelling of the atmosphere , 2005 .

[22]  Torsten Hoefler,et al.  MODESTO: Data-centric Analytic Optimization of Complex Stencil Programs on Heterogeneous Architectures , 2015, ICS.

[23]  Oliver Fuhrer,et al.  Using Compiler Directives to Port Large Scientific Applications to GPUs: An Example from Atmospheric Science , 2014, Parallel Process. Lett..

[24]  Tim Palmer,et al.  Climate forecasting: Build high-resolution global climate models , 2014, Nature.

[25]  Michael T. Goodrich,et al.  Parallel external memory graph algorithms , 2010, 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS).

[26]  A. P. Siebesma,et al.  Climate goals and computing the future of clouds , 2017 .

[27]  D. Williamson,et al.  A baroclinic instability test case for atmospheric model dynamical cores , 2006 .

[28]  C. Osuna,et al.  Operational numerical weather prediction on a GPU-accelerated cluster supercomputer , 2016 .

[29]  William C. Skamarock,et al.  Evaluation of Global Atmospheric Solvers Using Extensions of the Jablonowski and Williamson Baroclinic Wave Test Case , 2013 .

[30]  D. Durran Numerical Methods for Fluid Dynamics: With Applications to Geophysics , 2010 .

[31]  C. Bretherton,et al.  Convective self‐aggregation feedbacks in near‐global cloud‐resolving simulations of an aquaplanet , 2015 .

[32]  Pradeep Dubey,et al.  Debunking the 100X GPU vs. CPU myth: an evaluation of throughput computing on CPU and GPU , 2010, ISCA.

[33]  Hirofumi Tomita,et al.  Performance Analysis and Optimization of Nonhydrostatic ICosahedral Atmospheric Model (NICAM) on the K Computer and TSUBAME2.5 , 2016, PASC.

[34]  Chris Hope,et al.  The $10 trillion value of better information about the transient climate response , 2015, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[35]  Louis J. Wicker,et al.  Time-Splitting Methods for Elastic Models Using Forward Time Schemes , 2002 .

[36]  Giovanni Aloisio,et al.  CPMIP: Measurements of Real Computational Performance of Earth System Models , 2016 .

[37]  D. Durran Numerical Methods for Fluid Dynamics , 2010 .

[38]  John Shalf,et al.  Exascale Computing Technology Challenges , 2010, VECPAR.

[39]  G. Doms,et al.  The Nonhydrostatic Limited-Area Model LM (Lokal-Modell) of DWD: Part I: Scientific Documentation (Ve , 1999 .

[40]  Veronika Eyring,et al.  Overview of the Coupled Model Intercomparison Project Phase 6 (CMIP6) experimental design and organization , 2015 .

[41]  D. Ricard,et al.  Kinetic energy spectra characteristics of two convection‐permitting limited‐area models AROME and Meso‐NH , 2013 .

[42]  Samuel Williams,et al.  Roofline: an insightful visual performance model for multicore architectures , 2009, CACM.

[43]  Chao Yang,et al.  10M-Core Scalable Fully-Implicit Solver for Nonhydrostatic Atmospheric Dynamics , 2016, SC16: International Conference for High Performance Computing, Networking, Storage and Analysis.

[44]  A. Bott A positive definite advection scheme obtained by nonlinear renormalization of the advective fluxes , 1989 .

[45]  Hiroaki Miura,et al.  A Madden-Julian Oscillation Event Realistically Simulated by a Global Cloud-Resolving Model , 2007, Science.

[46]  Jeffrey Scott Vitter,et al.  External memory algorithms and data structures: dealing with massive data , 2001, CSUR.

[47]  H. Yashiro,et al.  Deep moist atmospheric convection in a subkilometer global simulation , 2013 .

[48]  Andreas H. Fink,et al.  Secondary Cyclogenesis along an Occluded Front Leading to Damaging Wind Gusts: Windstorm Kyrill, January 2007 , 2015 .

[49]  A. P. Siebesma,et al.  Clouds, circulation and climate sensitivity , 2015 .

[50]  Andrew A. Chien,et al.  The future of microprocessors , 2011, Commun. ACM.

[51]  C. Schär,et al.  Heavy precipitation in a changing climate: Does short‐term summer precipitation increase faster? , 2015 .

[52]  J. Steppeler,et al.  Meso-gamma scale forecasts using the nonhydrostatic model LM , 2003 .

[53]  M. Baldauf,et al.  Operational Convective-Scale Numerical Weather Prediction with the COSMO Model: Description and Sensitivities , 2011 .

[54]  C. Bretherton,et al.  Clouds and Aerosols , 2013 .

[55]  Tobias Gysi,et al.  STELLA: a domain-specific tool for structured grid methods in weather and climate models , 2015, SC15: International Conference for High Performance Computing, Networking, Storage and Analysis.

[56]  H. Fowler,et al.  Heavier summer downpours with climate change revealed by weather forecast resolution model , 2014 .

[57]  Tao Zhang,et al.  DAG Based Multipath Routing Algorithm for Load Balancing in Machine-to-Machine Networks , 2014, Int. J. Distributed Sens. Networks.