Predicting clustered weather patterns: A test case for applications of convolutional neural networks to spatio-temporal climate data

Deep learning techniques such as convolutional neural networks (CNNs) can potentially provide powerful tools for classifying, identifying, and predicting patterns in climate and environmental data. However, because of the inherent complexities of such data, which are often spatio-temporal, chaotic, and non-stationary, the CNN algorithms must be designed/evaluated for each specific dataset and application. Yet CNN, being a supervised technique, requires a large labeled dataset to start. Labeling demands (human) expert time which, combined with the limited number of relevant examples in this area, can discourage using CNNs for new problems. To address these challenges, here we (1) Propose an effective auto-labeling strategy based on using an unsupervised clustering algorithm and evaluating the performance of CNNs in re-identifying and predicting these clusters up to 5 days ahead of time; (2) Use this approach to label thousands of daily large-scale weather patterns over North America in the outputs of a fully-coupled climate model and show the capabilities of CNNs in re-identifying and predicting the 4 clustered regimes up to 5 days ahead of time. The deep CNN trained with 1000 samples or more per cluster has an accuracy of 90% or better for both identification and prediction while prediction accuracy scales weakly with the number of lead days. Accuracy scales monotonically but nonlinearly with the size of the training set, e.g. reaching 94% with 3000 training samples per cluster for identification and 93–76% for prediction at lead day 1–5, outperforming logistic regression, a simpler machine learning algorithm, by  ~ 25%. Effects of architecture and hyperparameters on the performance of CNNs are examined and discussed.

[1]  S. Cessie,et al.  Ridge Estimators in Logistic Regression , 1992 .

[2]  Prabhat,et al.  Application of Deep Convolutional Neural Networks for Detecting Extreme Weather in Climate Datasets , 2016, ArXiv.

[3]  Kaiming He,et al.  Feature Pyramid Networks for Object Detection , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[4]  Elizabeth A. Barnes,et al.  Exploring recent trends in Northern Hemisphere blocking , 2014 .

[5]  Pedram Hassanzadeh,et al.  Evaluating Indices of Blocking Anticyclones in Terms of Their Linear Relations With Surface Hot Extremes , 2019, Geophysical Research Letters.

[6]  Noah D. Brenowitz,et al.  Prognostic Validation of a Neural Network Unified Physics Parameterization , 2018, Geophysical Research Letters.

[7]  Amy H. Butler,et al.  A Census of Atmospheric Variability From Seconds to Decades , 2017 .

[8]  Prabhat,et al.  North American extreme temperature events and related large scale meteorological patterns: a review of statistical methods, dynamics, modeling, and trends , 2016, Climate Dynamics.

[9]  X. Cheng,et al.  Cluster Analysis of the Northern Hemisphere Wintertime 500-hPa Height Field: Spatial Patterns , 1993 .

[10]  J. Holton An introduction to dynamic meteorology , 2004 .

[11]  Bala Rajaratnam,et al.  Contribution of changes in atmospheric circulation patterns to extreme temperature trends , 2015, Nature.

[12]  Tim N. Palmer,et al.  Nonlinear Dynamics and Climate Change: Rossby's Legacy , 1998 .

[13]  Petros Koumoutsakos,et al.  Data-driven forecasting of high-dimensional chaotic systems with long short-term memory networks , 2018, Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[14]  Dong Eun Lee,et al.  Multiscale Variability in North American Summer Maximum Temperatures and Modulations from the North Atlantic Simulated by an AGCM , 2018 .

[15]  Surendra Byna,et al.  TECA: Petascale Pattern Recognition for Climate Science , 2015, CAIP.

[16]  G. North,et al.  Empirical Orthogonal Functions: The Medium is the Message , 2009 .

[17]  Jian Sun,et al.  Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[18]  Prabhat,et al.  Semi-Supervised Detection of Extreme Weather Events in Large Climate Datasets , 2016, ArXiv.

[19]  Yuan Yu,et al.  TensorFlow: A system for large-scale machine learning , 2016, OSDI.

[20]  K.,et al.  The Community Earth System Model (CESM) large ensemble project: a community resource for studying climate change in the presence of internal climate variability , 2015 .

[21]  Stephan Rasp,et al.  Neural networks for post-processing ensemble weather forecasts , 2018, Monthly Weather Review.

[22]  Brian F. Farrell,et al.  Generalized Stability Theory. Part II: Nonautonomous Operators , 1996 .

[23]  John M. Wallace,et al.  Cluster analysis of Northern Hemisphere wintertime 500-hPa flow regimes during 1920-2014 , 2015 .

[24]  Saleh Nabi,et al.  Reduced-order modeling of fully turbulent buoyancy-driven flows using the Green's function method , 2018, Physical Review Fluids.

[25]  Peter John Huybers,et al.  Long-lead predictions of eastern United States hot days from Pacific sea surface temperatures , 2016 .

[26]  Gregory R. Herman,et al.  Money Doesn't Grow on Trees, but Forecasts Do: Forecasting Extreme Precipitation with Random Forests , 2018 .

[27]  Henk A. Dijkstra,et al.  Using network theory and machine learning to predict El Niño , 2018, Earth System Dynamics.

[28]  Devika Subramanian,et al.  Data-driven prediction of a multi-scale Lorenz 96 chaotic system using a hierarchy of deep learning methods: Reservoir computing, ANN, and RNN-LSTM , 2019, ArXiv.

[29]  Lester Mackey,et al.  S2S reboot: An argument for greater inclusion of machine learning in subseasonal to seasonal forecasts , 2018, WIREs Climate Change.

[30]  Pierre Gentine,et al.  Deep learning to represent subgrid processes in climate models , 2018, Proceedings of the National Academy of Sciences.

[31]  Geoffrey E. Hinton,et al.  Dynamic Routing Between Capsules , 2017, NIPS.

[32]  Yisong Yue,et al.  Long-term Forecasting using Tensor-Train RNNs , 2017, ArXiv.

[33]  Padhraic Smyth,et al.  Multiple Regimes in Northern Hemisphere Height Fields via MixtureModel Clustering* , 1999, Journal of the Atmospheric Sciences.

[34]  Pedram Hassanzadeh,et al.  Persistent anomalies of the extratropical Northern Hemisphere wintertime circulation as an initiator of El Niño/Southern Oscillation events , 2017, Scientific Reports.

[35]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[36]  Max Tegmark,et al.  Why Does Deep and Cheap Learning Work So Well? , 2016, Journal of Statistical Physics.

[37]  Pedram Hassanzadeh,et al.  Blocking variability: Arctic Amplification versus Arctic Oscillation , 2015 .

[38]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[39]  J. Wallace,et al.  The Arctic oscillation signature in the wintertime geopotential height and temperature fields , 1998 .

[40]  Prabhat,et al.  ExtremeWeather: A large-scale climate dataset for semi-supervised detection, localization, and understanding of extreme weather events , 2016, NIPS.

[41]  Dit-Yan Yeung,et al.  Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting , 2015, NIPS.

[42]  Amir Hossein Souri,et al.  A 15-year climatology of wind pattern impacts on surface ozone in Houston, Texas , 2016 .

[43]  J. Bezdek,et al.  FCM: The fuzzy c-means clustering algorithm , 1984 .

[44]  Tim N. Palmer,et al.  Signature of recent climate change in frequencies of natural atmospheric circulation regimes , 1999, Nature.

[45]  Anuj Karpatne,et al.  Machine Learning for the Geosciences: Challenges and Opportunities , 2017, IEEE Transactions on Knowledge and Data Engineering.

[46]  P. O'Gorman,et al.  Using Machine Learning to Parameterize Moist Convection: Potential for Modeling of Climate, Climate Change, and Extreme Events , 2018, Journal of Advances in Modeling Earth Systems.

[47]  Karthik Kashinath,et al.  Deep Learning for Scientific Inference from Geophysical Data: The Madden-Julian Oscillation as a Test Case , 2019, 1902.04621.

[48]  John Methven,et al.  Blocking and its Response to Climate Change , 2018, Current Climate Change Reports.

[49]  Pedram Hassanzadeh,et al.  The Linear Response Function of an Idealized Atmosphere. Part II: Implications for the Practical Use of the Fluctuation–Dissipation Theorem and the Role of Operator’s Nonnormality , 2016, 1603.08807.

[50]  Aditi Sheshadri,et al.  Propagating Annular Modes: Empirical Orthogonal Functions, Principal Oscillation Patterns, and Time Scales , 2017 .

[51]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[52]  Da Yang,et al.  Size of the Atmospheric Blocking Events: Scaling Law and Response to Climate Change , 2019, Geophysical Research Letters.

[53]  P. Swapna,et al.  Nonlinearities in the Evolutional Distinctions Between El Niño and La Niña Types , 2017 .

[54]  Tong Zhu,et al.  The impact of circulation patterns on regional transport pathways and air quality over Beijing and its surroundings , 2011 .

[55]  Liu Rui,et al.  Fuzzy c-Means Clustering Algorithm , 2008 .

[56]  Peter Bauer,et al.  Challenges and design choices for global weather and climate models based on machine learning , 2018, Geoscientific Model Development.

[57]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[58]  Andreas Philipp,et al.  Cluster Analysis of North Atlantic-European Circulation Types and Links with Tropical Pacific Sea Surface Temperatures , 2008 .

[59]  Joachim Denzler,et al.  Deep learning and process understanding for data-driven Earth system science , 2019, Nature.

[60]  Pedram Hassanzadeh,et al.  Quantifying the eddy-jet feedback strength of the annular mode in an idealized GCM and reanalysis data , 2016, 1608.04130.

[61]  Yu Kosaka,et al.  Recent global-warming hiatus tied to equatorial Pacific surface cooling , 2013, Nature.

[62]  R. Chattopadhyay,et al.  A new method to compute the principal components from self‐organizing maps: an application to monsoon intraseasonal oscillations , 2013 .

[63]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[64]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[65]  Petros,et al.  Generalized Stability Theory . Part I : Autonomous Operators , 2022 .

[66]  R. Chattopadhyay,et al.  A self‐organizing map–based ensemble forecast system for extended range prediction of active/break cycles of Indian summer monsoon , 2013 .

[67]  Yisong Yue,et al.  Long-term Forecasting using Higher Order Tensor RNNs , 2017 .

[68]  E. Tziperman,et al.  Winter Precipitation Forecast in the European and Mediterranean Regions Using Cluster Analysis , 2017 .

[69]  Pedram Hassanzadeh,et al.  Data-driven reduced modelling of turbulent Rayleigh–Bénard convection using DMD-enhanced fluctuation–dissipation theorem , 2018, Journal of Fluid Mechanics.

[70]  Sookyung Kim,et al.  Resolution Reconstruction of Climate Data with Pixel Recursive Model , 2017, 2017 IEEE International Conference on Data Mining Workshops (ICDMW).

[71]  R. Chattopadhyay,et al.  A bias-correction and downscaling technique for operational extended range forecasts based on self organizing map , 2017, Climate Dynamics.

[72]  R. Chattopadhyay,et al.  Objective Identification of Nonlinear Convectively Coupled Phases of Monsoon Intraseasonal Oscillation: Implications for Prediction , 2008 .

[73]  Devika Subramanian,et al.  Data-driven prediction of a multi-scale Lorenz 96 chaotic system using deep learning methods: Reservoir computing, ANN, and RNN-LSTM , 2019 .

[74]  Samy Bengio,et al.  Understanding deep learning requires rethinking generalization , 2016, ICLR.

[75]  K. Mo,et al.  Cluster analysis of multiple planetary flow regimes , 1988 .

[76]  Pierre Gentine,et al.  Could Machine Learning Break the Convection Parameterization Deadlock? , 2018, Geophysical Research Letters.

[77]  Elizabeth A. Barnes,et al.  A methodology for the comparison of blocking climatologies across indices, models and climate scenarios , 2012, Climate Dynamics.

[78]  Andrew Stuart,et al.  Earth System Modeling 2.0: A Blueprint for Models That Learn From Observations and Targeted High‐Resolution Simulations , 2017, 1709.00037.

[79]  Rajib Chattopadhyay,et al.  Can El Niño-Southern Oscillation (ENSO) events modulate intraseasonal oscillations of Indian summer monsoon? , 2011 .