Can deep learning beat numerical weather prediction?

The recent hype about artificial intelligence has sparked renewed interest in applying the successful deep learning (DL) methods for image recognition, speech recognition, robotics, strategic games and other application areas to the field of meteorology. There is some evidence that better weather forecasts can be produced by introducing big data mining and neural networks into the weather prediction workflow. Here, we discuss the question of whether it is possible to completely replace the current numerical weather models and data assimilation systems with DL approaches. This discussion entails a review of state-of-the-art machine learning concepts and their applicability to weather data with its pertinent statistical properties. We think that it is not inconceivable that numerical weather models may one day become obsolete, but a number of fundamental breakthroughs are needed before this goal comes into reach. This article is part of the theme issue ‘Machine learning for weather and climate modelling’.

[1]  Alexei A. Efros,et al.  Context Encoders: Feature Learning by Inpainting , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[2]  Yoshua Bengio,et al.  Object Recognition with Gradient-Based Learning , 1999, Shape, Contour and Grouping in Computer Vision.

[3]  A. Comrie Comparing Neural Networks and Regression Models for Ozone Forecasting , 1997 .

[4]  Hirofumi Tomita,et al.  A new dynamical framework of nonhydrostatic global model using the icosahedral grid , 2004 .

[5]  Yann LeCun,et al.  Deep multi-scale video prediction beyond mean square error , 2015, ICLR.

[6]  Daniel L. Silver,et al.  Guest editor’s introduction: special issue on inductive transfer learning , 2008, Machine Learning.

[7]  Aida Alvera-Azcárate,et al.  DINCAE 1.0: a convolutional neural network with error estimates to reconstruct sea surface temperature satellite observations , 2020 .

[8]  M. Köhler,et al.  A Dual Mass Flux Framework for Boundary Layer Convection. Part I: Transport , 2009 .

[9]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[10]  Li Fei-Fei,et al.  Perceptual Losses for Real-Time Style Transfer and Super-Resolution , 2016, ECCV.

[11]  M. Schultz,et al.  IntelliO3-ts v1.0: a neural network approach to predict near-surface ozone concentrations in Germany , 2021 .

[12]  Jürgen Schmidhuber,et al.  Recurrent nets that time and count , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[13]  Vahid Nourani,et al.  Forecasting Daily Precipitation Using Hybrid Model of Wavelet-Artificial Neural Network and Comparison with Adaptive Neurofuzzy Inference System (Case Study: Verayneh Station, Nahavand) , 2014 .

[14]  Thomas Vandal,et al.  Intercomparison of machine learning methods for statistical downscaling: the case of daily and extreme precipitation , 2017, Theoretical and Applied Climatology.

[15]  Joshua B. Tenenbaum,et al.  Human-level concept learning through probabilistic program induction , 2015, Science.

[16]  A. H. Murphy,et al.  Skill Scores and Correlation Coefficients in Model Verification , 1989 .

[17]  Yann Le Cun,et al.  A Theoretical Framework for Back-Propagation , 1988 .

[18]  William C. Skamarock,et al.  A unified approach to energy conservation and potential vorticity dynamics for arbitrarily-structured C-grids , 2010, J. Comput. Phys..

[19]  Nicolas Thome,et al.  Disentangling Physical Dynamics From Unknown Factors for Unsupervised Video Prediction , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Bernt Schiele,et al.  Generative Adversarial Text to Image Synthesis , 2016, ICML.

[21]  Diederik P. Kingma,et al.  An Introduction to Variational Autoencoders , 2019, Found. Trends Mach. Learn..

[22]  W S McCulloch,et al.  A logical calculus of the ideas immanent in nervous activity , 1990, The Philosophy of Artificial Intelligence.

[23]  Navdeep Jaitly,et al.  Adversarial Autoencoders , 2015, ArXiv.

[24]  Michael S. Bernstein,et al.  ImageNet Large Scale Visual Recognition Challenge , 2014, International Journal of Computer Vision.

[25]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[26]  Luís Torgo,et al.  SMOTE for Regression , 2013, EPIA.

[27]  Yoshua Bengio,et al.  Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling , 2014, ArXiv.

[28]  Demis Hassabis,et al.  Mastering the game of Go with deep neural networks and tree search , 2016, Nature.

[29]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[30]  Anuj Karpatne,et al.  Physics Guided RNNs for Modeling Dynamical Systems: A Case Study in Simulating Lake Temperature Profiles , 2018, SDM.

[31]  Philip S. Yu,et al.  PredRNN: Recurrent Neural Networks for Predictive Learning using Spatiotemporal LSTMs , 2017, NIPS.

[32]  Guangquan Zhang,et al.  Learning under Concept Drift: A Review , 2019, IEEE Transactions on Knowledge and Data Engineering.

[33]  Charles A. Doswell,et al.  Precipitation Forecasting Using a Neural Network , 1999 .

[34]  N. Roberts,et al.  Scale-Selective Verification of Rainfall Accumulations from High-Resolution Forecasts of Convective Events , 2008 .

[35]  Ziyin Liu,et al.  Neural Networks Fail to Learn Periodic Functions and How to Fix It , 2020, NeurIPS.

[36]  D. Lazer,et al.  The Parable of Google Flu: Traps in Big Data Analysis , 2014, Science.

[37]  Dara Entekhabi,et al.  An ensemble‐based reanalysis approach to land data assimilation , 2005 .

[38]  Reinhard Klein,et al.  Unsupervised Deep Learning of Incompressible Fluid Dynamics , 2020, ArXiv.

[39]  Dumitru Erhan,et al.  Going deeper with convolutions , 2014, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[40]  Stephan Rasp,et al.  Neural networks for post-processing ensemble weather forecasts , 2018, Monthly Weather Review.

[41]  Yoshua Bengio,et al.  Modeling Cloud Reflectance Fields using Conditional Generative Adversarial Networks , 2020 .

[42]  R. Bannister A review of operational methods of variational and ensemble‐variational data assimilation , 2017 .

[43]  A. Arakawa,et al.  A Unified Representation of Deep Moist Convection in Numerical Modeling of the Atmosphere. Part II , 2013 .

[44]  Xiao Xiang Zhu,et al.  Deep Learning in Remote Sensing: A Comprehensive Review and List of Resources , 2017, IEEE Geoscience and Remote Sensing Magazine.

[45]  M. Diamantakis,et al.  An inherently mass‐conserving semi‐implicit semi‐Lagrangian discretization of the deep‐atmosphere global non‐hydrostatic equations , 2014 .

[46]  Marco Baroni,et al.  Generalization without Systematicity: On the Compositional Skills of Sequence-to-Sequence Recurrent Networks , 2017, ICML.

[47]  Yu Zhou,et al.  Polarimetric SAR Image Classification Using Deep Convolutional Neural Networks , 2016, IEEE Geoscience and Remote Sensing Letters.

[48]  Yoshua Bengio,et al.  Learning long-term dependencies with gradient descent is difficult , 1994, IEEE Trans. Neural Networks.

[49]  Marc Pollefeys,et al.  KloudNet: Deep Learning for Sky Image Analysis and Irradiance Forecasting , 2018, GCPR.

[50]  Eric Gilleland,et al.  Intercomparison of Spatial Forecast Verification Methods , 2009 .

[51]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[52]  Max Welling,et al.  Auto-Encoding Variational Bayes , 2013, ICLR.

[53]  R. Dmowska,et al.  International Geophysics Series , 1992 .

[54]  Dit-Yan Yeung,et al.  Towards Bayesian Deep Learning: A Survey , 2016, ArXiv.

[55]  Jennifer G. Dy,et al.  Quantifying Uncertainty in Discrete-Continuous and Skewed Data with Bayesian Deep Learning , 2018, KDD.

[56]  Almut Gassmann Discretization of generalized Coriolis and friction terms on the deformed hexagonal C‐grid , 2018, Quarterly Journal of the Royal Meteorological Society.

[57]  Tali Dekel,et al.  SinGAN: Learning a Generative Model From a Single Natural Image , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[58]  Christiane Jablonowski,et al.  MCore: A non-hydrostatic atmospheric dynamical core utilizing high-order finite-volume methods , 2012, J. Comput. Phys..

[59]  S. Pelland,et al.  Solar and photovoltaic forecasting through post‐processing of the Global Environmental Multiscale numerical weather prediction model , 2013 .

[60]  Peter Bauer,et al.  Challenges and design choices for global weather and climate models based on machine learning , 2018, Geoscientific Model Development.

[61]  Nagiza F. Samatova,et al.  Theory-Guided Data Science: A New Paradigm for Scientific Discovery from Data , 2016, IEEE Transactions on Knowledge and Data Engineering.

[62]  Dit-Yan Yeung,et al.  Towards Bayesian Deep Learning: A Framework and Some Existing Methods , 2016, IEEE Transactions on Knowledge and Data Engineering.

[63]  Eric Horvitz,et al.  A Deep Hybrid Model for Weather Forecasting , 2015, KDD.

[64]  Huiqing Wen,et al.  Deep Learning Based Multistep Solar Forecasting for PV Ramp-Rate Control Using Sky Images , 2021, IEEE Transactions on Industrial Informatics.

[65]  Francesco Ravazzolo,et al.  Forecaster's Dilemma: Extreme Events and Forecast Evaluation , 2015, 1512.09244.

[66]  Zoubin Ghahramani,et al.  Deep Bayesian Active Learning with Image Data , 2017, ICML.

[67]  Bin Yang,et al.  Correlated Time Series Forecasting using Deep Neural Networks: A Summary of Results , 2018, ArXiv.

[68]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[69]  Yarin Gal,et al.  Uncertainty in Deep Learning , 2016 .

[70]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[71]  David Vázquez,et al.  PixelVAE: A Latent Variable Model for Natural Images , 2016, ICLR.

[72]  Jianzhong Zhou,et al.  Probabilistic spatiotemporal wind speed forecasting based on a variational Bayesian deep learning model , 2020 .

[73]  G. Zängl,et al.  The ICON (ICOsahedral Non‐hydrostatic) modelling framework of DWD and MPI‐M: Description of the non‐hydrostatic dynamical core , 2015 .

[74]  Yugang Niu,et al.  Hourly day-ahead solar irradiance prediction using weather forecasts by LSTM , 2018 .

[75]  Alexander Binder,et al.  Unmasking Clever Hans predictors and assessing what machines really learn , 2019, Nature Communications.

[76]  Michael Weniger,et al.  Spatial verification using wavelet transforms: a review , 2016, 1605.03395.

[77]  Constantinos S. Pattichis,et al.  Artificial neural networks in forecasting minimum temperature (weather) , 1991 .

[78]  Paris Perdikaris,et al.  Physics-Constrained Deep Learning for High-dimensional Surrogate Modeling and Uncertainty Quantification without Labeled Data , 2019, J. Comput. Phys..

[79]  S. Serrar,et al.  Fast atmospheric response to a sudden thinning of Arctic sea ice , 2016, Climate Dynamics.

[80]  Zoubin Ghahramani,et al.  Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning , 2015, ICML.

[81]  Paul Smolensky,et al.  Information processing in dynamical systems: foundations of harmony theory , 1986 .

[82]  Anuj Karpatne,et al.  Physics-guided Neural Networks (PGNN): An Application in Lake Temperature Modeling , 2017, ArXiv.

[83]  Peter Bauer,et al.  The quiet revolution of numerical weather prediction , 2015, Nature.

[84]  Paris Perdikaris,et al.  Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations , 2019, J. Comput. Phys..

[85]  Elizabeth E. Ebert,et al.  Fuzzy verification of high‐resolution gridded forecasts: a review and proposed framework , 2008 .

[86]  L. Magnusson,et al.  Advancements in Hurricane Prediction With NOAA's Next‐Generation Forecast System , 2019, Geophysical Research Letters.

[87]  I. Orlanski A rational subdivision of scales for atmospheric processes , 1975 .

[88]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[89]  Vladimir M. Krasnopolsky,et al.  A neural network technique to improve computational efficiency of numerical oceanic models , 2002 .

[90]  R. Neggers A Dual Mass Flux Framework for Boundary Layer Convection. Part II: Clouds , 2009 .

[91]  Petra Friederichs,et al.  Assessment of wavelet-based spatial verification by means of a stochastic precipitation model (wv_verif v0.1.0) , 2019, Geoscientific Model Development.

[92]  O. Kisi,et al.  Wavelet and neuro-fuzzy conjunction model for precipitation forecasting , 2007 .

[93]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[94]  Juanzhen Sun,et al.  Radar Data Assimilation with WRF 4D-Var. Part I: System Development and Preliminary Testing , 2013 .

[95]  Àngela Nebot,et al.  Local Maximum Ozone Concentration Prediction Using Soft Computing Methodologies , 2003 .

[96]  J. Beckers,et al.  DINCAE 1.0: a convolutional neural network with error estimates to reconstruct sea surface temperature satellite observations , 2019, Geoscientific Model Development.

[97]  Tijana Janjić,et al.  Assimilation of Mode-S EHS Aircraft Observations in COSMO-KENDA , 2016 .

[98]  Peter Lynch,et al.  The origins of computer weather prediction and climate modeling , 2008, J. Comput. Phys..

[99]  L. Denby Discovering the Importance of Mesoscale Cloud Organization Through Unsupervised Classification , 2020, Geophysical Research Letters.

[100]  Roland Potthast,et al.  Kilometre‐scale ensemble data assimilation for the COSMO model (KENDA) , 2016 .

[101]  Jun Du,et al.  Satellite Image Prediction Relying on GAN and LSTM Neural Networks , 2019, ICC 2019 - 2019 IEEE International Conference on Communications (ICC).

[102]  Jacek Tabor,et al.  Processing of missing data by neural networks , 2018, NeurIPS.

[103]  Philip H. Ramsey Statistical Methods in the Atmospheric Sciences , 2005 .

[104]  Honglak Lee,et al.  Action-Conditional Video Prediction using Deep Networks in Atari Games , 2015, NIPS.

[105]  B. Brown,et al.  The Setup of the MesoVICT Project , 2018, Bulletin of the American Meteorological Society.

[106]  Soumith Chintala,et al.  Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks , 2015, ICLR.

[107]  Jean-Noël Thépaut,et al.  The Global Observing System , 2010 .

[108]  Sergey Levine,et al.  Stochastic Adversarial Video Prediction , 2018, ArXiv.

[109]  Jürgen Schmidhuber,et al.  Deep learning in neural networks: An overview , 2014, Neural Networks.

[110]  T. Vukicevic,et al.  The Effect of Linearization Errors on 4DVAR Data Assimilation , 1998 .

[111]  Niels Bormann,et al.  Representing Equilibrium and Nonequilibrium Convection in Large-Scale Models , 2014 .

[112]  Barbara G. Brown,et al.  Forecast verification: current status and future directions , 2008 .

[113]  Young Cheol Kwon,et al.  A Mass-Flux Cumulus Parameterization Scheme across Gray-Zone Resolutions , 2017 .

[114]  Chaitanya Manapragada,et al.  Learning Under Concept Drift , 2021 .

[115]  Pietro Perona,et al.  One-shot learning of object categories , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[116]  Yunqian Ma,et al.  Imbalanced Learning: Foundations, Algorithms, and Applications , 2013 .

[117]  Vladimir M. Krasnopolsky,et al.  A Neural Network Nonlinear Multimodel Ensemble to Improve Precipitation Forecasts over Continental US , 2012 .

[118]  Chuan Li,et al.  Precomputed Real-Time Texture Synthesis with Markovian Generative Adversarial Networks , 2016, ECCV.

[119]  Eric P. Xing,et al.  Nonparametric Variational Auto-Encoders for Hierarchical Representation Learning , 2017, 2017 IEEE International Conference on Computer Vision (ICCV).

[120]  Alexei A. Efros,et al.  Image-to-Image Translation with Conditional Adversarial Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[121]  Soukayna Mouatadid,et al.  WeatherBench: A Benchmark Data Set for Data‐Driven Weather Forecasting , 2020, Journal of Advances in Modeling Earth Systems.

[122]  Chong Wang,et al.  Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin , 2015, ICML.

[123]  A. Arakawa,et al.  A Unified Representation of Deep Moist Convection in Numerical Modeling of the Atmosphere. Part I , 2013 .

[124]  Stefan Wermter,et al.  Continual Lifelong Learning with Neural Networks: A Review , 2019, Neural Networks.

[125]  Dit-Yan Yeung,et al.  Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting , 2015, NIPS.

[126]  Qing-Shan Jia,et al.  Review of wind power forecasting methods: From multi-spatial and temporal perspective , 2017, 2017 36th Chinese Control Conference (CCC).

[127]  P. O'Gorman,et al.  Using Machine Learning to Parameterize Moist Convection: Potential for Modeling of Climate, Climate Change, and Extreme Events , 2018, Journal of Advances in Modeling Earth Systems.

[128]  B. Krishna,et al.  Monthly Rainfall Prediction Using Wavelet Neural Network Analysis , 2013, Water Resources Management.

[129]  Wei Xiong,et al.  Learning to Generate Time-Lapse Videos Using Multi-stage Dynamic Generative Adversarial Networks , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[130]  Andrew Zisserman,et al.  Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[131]  Patrick Gallinari,et al.  Deep learning for physical processes: incorporating prior scientific knowledge , 2017, ICLR.

[132]  Philip S. Yu,et al.  Memory in Memory: A Predictive Neural Network for Learning Higher-Order Non-Stationarity From Spatiotemporal Dynamics , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[133]  N. Bormann,et al.  The growing impact of satellite observations sensitive to humidity, cloud and precipitation , 2017 .

[134]  Yi Lu Murphey,et al.  Weather Recognition Based on Edge Deterioration and Convolutional Neural Networks , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[135]  Vladimir M. Krasnopolsky,et al.  Complex hybrid models combining deterministic and machine learning components for numerical climate modeling and weather prediction , 2006, Neural Networks.

[136]  Özgür Kisi,et al.  Precipitation forecasting by using wavelet-support vector machine conjunction model , 2012, Eng. Appl. Artif. Intell..

[137]  Vishal M. Patel,et al.  Image De-Raining Using a Conditional Generative Adversarial Network , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[138]  R. C. Macridis A review , 1963 .

[139]  Roberto Cipolla,et al.  Concrete Problems for Autonomous Vehicle Safety: Advantages of Bayesian Deep Learning , 2017, IJCAI.

[140]  A. Gastli,et al.  Review of the use of Numerical Weather Prediction (NWP) Models for wind energy assessment , 2010 .

[141]  K. D. Beheng,et al.  A two-moment cloud microphysics parameterization for mixed-phase clouds. Part 1: Model description , 2006 .

[142]  Fuqing Zhang,et al.  Coupling ensemble Kalman filter with four-dimensional variational data assimilation , 2009 .

[143]  Joachim Denzler,et al.  Deep learning and process understanding for data-driven Earth system science , 2019, Nature.

[144]  Douglas W. Nychka,et al.  Interpretable Deep Learning for Spatial Analysis of Severe Hailstorms , 2019, Monthly Weather Review.