Data‐Driven Medium‐Range Weather Prediction With a Resnet Pretrained on Climate Simulations: A New Model for WeatherBench

Numerical weather prediction has traditionally been based on physical models of the atmosphere. Recently, however, the rise of deep learning has created increased interest in purely data-driven medium-range weather forecasting with first studies exploring the feasibility of such an approach. To accelerate progress in this area, the WeatherBench benchmark challenge was defined. Here, we train a deep residual convolutional neural network (Resnet) to predict geopotential, temperature and precipitation at 5.625 degree resolution up to 5 days ahead. To avoid overfitting and improve forecast skill, we pretrain the model using historical climate model output before fine-tuning on reanalysis data. The resulting forecasts outperform previous submissions to WeatherBench and are comparable in skill to a physical baseline at similar resolution. We also analyze how the neural network creates its predictions and find that, with some exceptions, it is compatible with physical reasoning. Finally, we perform scaling experiments to estimate the potential skill of data-driven approaches at higher resolutions.

[1]  Jonathan A. Weyn,et al.  Can Machines Learn to Predict Weather? Using Deep Learning to Predict Gridded 500‐hPa Geopotential Height From Historical Weather Data , 2019, Journal of Advances in Modeling Earth Systems.

[2]  Stephan Rasp,et al.  Neural networks for post-processing ensemble weather forecasts , 2018, Monthly Weather Review.

[3]  Sebastian Scher,et al.  Generalization properties of neural networks trained on Lorenzsystems , 2019 .

[4]  Peter Bauer,et al.  Challenges and design choices for global weather and climate models based on machine learning , 2018, Geoscientific Model Development.

[5]  S. E. Haupt,et al.  Using Artificial Intelligence to Improve Real-Time Decision-Making for High-Impact Weather , 2017 .

[6]  Olivier Mestre,et al.  Calibrated Ensemble Forecasts Using Quantile Regression Forests and Ensemble Model Output Statistics , 2016 .

[7]  Veronika Eyring,et al.  Overview of the Coupled Model Intercomparison Project Phase 6 (CMIP6) experimental design and organization , 2015 .

[8]  Imme Ebert-Uphoff,et al.  Evaluation, Tuning, and Interpretation of Neural Networks for Working with Images in Meteorological Applications , 2020 .

[9]  M. Rodwell,et al.  Characteristics of occasional poor medium-range weather forecasts for Europe , 2013 .

[10]  Thomas Brox,et al.  U-Net: Convolutional Networks for Biomedical Image Segmentation , 2015, MICCAI.

[11]  B. Hoskins,et al.  On the use and significance of isentropic potential vorticity maps , 2007 .

[12]  J. Thepaut,et al.  The ERA5 global reanalysis , 2020, Quarterly Journal of the Royal Meteorological Society.

[13]  B. Ancell,et al.  Comparing Adjoint- and Ensemble-Sensitivity Analysis with Applications to Observation Targeting , 2007 .

[14]  R. Rotunno,et al.  Mesoscale Predictability of Moist Baroclinic Waves: Convection-Permitting Experiments and Multistage Error Growth Dynamics , 2007 .

[15]  Andrew Gettelman,et al.  The Art and Science of Climate Model Tuning , 2017 .

[16]  Peter Bauer,et al.  The quiet revolution of numerical weather prediction , 2015, Nature.

[17]  Andrew Zisserman,et al.  Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps , 2013, ICLR.

[18]  Jian Lu,et al.  High Resolution Model Intercomparison Project (HighResMIP v1.0) for CMIP6 , 2016 .

[19]  Sebastian Scher,et al.  Toward Data‐Driven Weather and Climate Forecasting: Approximating a Simple General Circulation Model With Deep Learning , 2018, Geophysical Research Letters.