Spatio-temporal Weather Forecasting and Attention Mechanism on Convolutional LSTMs

Numerical weather forecasting on high-resolution physical models consume hours of computations on supercomputers. Application of deep learning and machine learning methods in forecasting revealed new solutions in this area. In this paper, we forecast high-resolution numeric weather data using both input weather data and observations by providing a novel deep learning architecture. We formulate the problem as spatiotemporal prediction. Our model is composed of Convolutional Long-short Term Memory, and Convolutional Neural Network units with encoder-decoder structure. We enhance the shortlong term performance and interpretability with an attention and a context matcher mechanism. We perform experiments on high-scale, real-life, benchmark numerical weather dataset, ERA5 hourly data on pressure levels, and forecast the temperature. The results show significant improvements in capturing both spatial and temporal correlations with attention matrices focusing on different parts of the input series. Our model obtains the best validation and the best test score among the baseline models, including ConvLSTM forecasting network and U-Net. We provide qualitative and quantitative results and show that our model forecasts 10 time steps with 3 hour frequency with an average of 2 degrees error. Our code and the data are publicly available.

[1]  Wang-chun Woo,et al.  Operational Application of Optical Flow Techniques to Radar-Based Rainfall Nowcasting , 2017 .

[2]  Afan Galih Salman,et al.  Weather forecasting using deep learning techniques , 2015, 2015 International Conference on Advanced Computer Science and Information Systems (ICACSIS).

[3]  F. Molteni,et al.  The ECMWF Ensemble Prediction System: Methodology and validation , 1996 .

[4]  Chaker El Amrani,et al.  Sequence to Sequence Weather Forecasting with Long Short-Term Memory Recurrent Neural Networks , 2016 .

[5]  Hugo Larochelle,et al.  Modulating early visual processing by language , 2017, NIPS.

[6]  Sushil J. Louis,et al.  Forecasting the weather of Nevada: A deep learning approach , 2015, 2015 International Joint Conference on Neural Networks (IJCNN).

[7]  Irving P. Krick,et al.  A Physical Basis for Long-Range Weather Forecasts , 1938 .

[8]  Jordan G. Powers,et al.  The Weather Research and Forecasting Model: Overview, System Efforts, and Future Directions , 2017 .

[9]  Jason Hickey,et al.  Machine Learning for Precipitation Nowcasting from Radar Images , 2019, ArXiv.

[10]  Pradeep Hewage,et al.  Deep learning-based effective fine-grained weather forecasting model , 2020, Pattern Analysis and Applications.

[11]  Saifur Rahman,et al.  Day-ahead building-level load forecasts using deep learning vs. traditional time-series techniques , 2019, Applied Energy.

[12]  S. Poornima,et al.  Prediction of Rainfall Using Intensified LSTM Based Recurrent Neural Network with Weighted Linear Units , 2019, Atmosphere.

[13]  Marco Aurélio Stelmar Netto,et al.  DeepDownscale: A Deep Learning Strategy for High-Resolution Weather Forecast , 2018, 2018 IEEE 14th International Conference on e-Science (e-Science).

[14]  J. Thepaut,et al.  The ERA5 global reanalysis , 2020, Quarterly Journal of the Royal Meteorological Society.

[15]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[16]  Hsiang-Fu Yu,et al.  Think Globally, Act Locally: A Deep Neural Network Approach to High-Dimensional Time Series Forecasting , 2019, NeurIPS.

[17]  Vladlen Koltun,et al.  An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling , 2018, ArXiv.

[18]  Bin Wang,et al.  Deep Uncertainty Quantification: A Machine Learning Approach for Weather Forecasting , 2018, KDD.

[19]  Dit-Yan Yeung,et al.  Deep Learning for Precipitation Nowcasting: A Benchmark and A New Model , 2017, NIPS.

[20]  Afan Galih Salman,et al.  Recurrent gradient descent adaptive learning rate and momentum neural network for rainfall forecasting , 2016, 2016 International Seminar on Application for Technology of Information and Communication (ISemantic).

[21]  Yoshua Bengio,et al.  On the Properties of Neural Machine Translation: Encoder–Decoder Approaches , 2014, SSST@EMNLP.

[22]  Fernando De la Torre,et al.  Spatio-Temporal Matching for Human Pose Estimation in Video , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Doina Precup,et al.  Hierarchical Spatio-Temporal Probabilistic Graphical Model with Multiple Feature Fusion for Binary Facial Attribute Classification in Real-World Face Videos , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Heiga Zen,et al.  WaveNet: A Generative Model for Raw Audio , 2016, SSW.

[25]  Jeffrey L. Elman,et al.  Finding Structure in Time , 1990, Cogn. Sci..

[26]  Aaron C. Courville,et al.  FiLM: Visual Reasoning with a General Conditioning Layer , 2017, AAAI.

[27]  Garrison W. Cottrell,et al.  A Dual-Stage Attention-Based Recurrent Neural Network for Time Series Prediction , 2017, IJCAI.

[28]  Chao Dong,et al.  Recovering Realistic Texture in Image Super-Resolution by Deep Spatial Feature Transform , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[29]  M. Déqué,et al.  The ARPEGE/IFS atmosphere model: a contribution to the French community climate modelling , 1994 .

[30]  David W. S. Wong,et al.  An adaptive inverse-distance weighting spatial interpolation technique , 2008, Comput. Geosci..

[31]  Yoshua Bengio,et al.  Learning Phrase Representations using RNN Encoder–Decoder for Statistical Machine Translation , 2014, EMNLP.

[32]  Mohammed Bennamoun,et al.  Attention in Convolutional LSTM for Gesture Recognition , 2018, NeurIPS.

[33]  Dit-Yan Yeung,et al.  Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting , 2015, NIPS.

[34]  Xiaoling Zhang,et al.  Forecasting Different Types of Convective Weather: A Deep Learning Approach , 2019, Journal of Meteorological Research.

[35]  Stephen J. Maybank,et al.  Learning Human Actions by Combining Global Dynamics and Local Appearance , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[36]  Sa-Kwang Song,et al.  Computer Vision in Precipitation Nowcasting: Applying Image Quality Assessment Metrics for Training Deep Neural Networks , 2019, Atmosphere.

[37]  George Athanasopoulos,et al.  Forecasting: principles and practice , 2013 .

[38]  Gang Wang,et al.  Skeleton-Based Action Recognition Using Spatio-Temporal LSTM Network with Trust Gates , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[39]  Yong Wang,et al.  The ALADIN System and its canonical model configurations AROME CY41T1 and ALARO CY40T1 , 2017 .

[40]  Siamak Mehrkanoon,et al.  Deep multi-stations weather forecasting: explainable recurrent convolutional neural networks , 2020, ArXiv.

[41]  Lars Herre,et al.  Neural networks for GEFCom2017 probabilistic load forecasting , 2019, International Journal of Forecasting.

[42]  Qian Li,et al.  AENN: A GENERATIVE ADVERSARIAL NEURAL NETWORK FOR WEATHER RADAR ECHO EXTRAPOLATION , 2019, The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences.

[43]  Joachim Denzler,et al.  Detecting Regions of Maximal Divergence for Spatio-Temporal Anomaly Detection , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[44]  Ko Nishino,et al.  Tracking Pedestrians Using Local Spatio-Temporal Motion Patterns in Extremely Crowded Scenes , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[45]  Alexander Heye,et al.  Precipitation Nowcasting : Leveraging Deep Recurrent Convolutional Neural Networks , 2017 .

[46]  Soukayna Mouatadid,et al.  WeatherBench: A Benchmark Data Set for Data‐Driven Weather Forecasting , 2020, Journal of Advances in Modeling Earth Systems.

[47]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[48]  Slawek Smyl,et al.  A hybrid method of exponential smoothing and recurrent neural networks for time series forecasting , 2020, International Journal of Forecasting.

[49]  Xianfeng Tang,et al.  Revisiting Spatial-Temporal Similarity: A Deep Learning Framework for Traffic Prediction , 2018, AAAI.

[50]  Hyojin Kim,et al.  Deep-Hurricane-Tracker: Tracking and Forecasting Extreme Climate Events , 2019, 2019 IEEE Winter Conference on Applications of Computer Vision (WACV).

[51]  Quoc V. Le,et al.  Sequence to Sequence Learning with Neural Networks , 2014, NIPS.

[52]  Wenjun Zeng,et al.  An End-to-End Spatio-Temporal Attention Model for Human Action Recognition from Skeleton Data , 2016, AAAI.

[53]  L. J. Eun,et al.  Application of Artificial Neural Networks to Rainfall Forecasting in the Geum River Basin, Korea , 2018 .

[54]  Yu-Lin He,et al.  Deep Neural Network Modeling for Big Data Weather Forecasting , 2015 .