Deep Uncertainty Quantification: A Machine Learning Approach for Weather Forecasting

Weather forecasting is usually solved through numerical weather prediction (NWP), which can sometimes lead to unsatisfactory performance due to inappropriate setting of the initial states. In this paper, we design a data-driven method augmented by an effective information fusion mechanism to learn from historical data that incorporates prior knowledge from NWP. We cast the weather forecasting problem as an end-to-end deep learning problem and solve it by proposing a novel negative log-likelihood error (NLE) loss function. A notable advantage of our proposed method is that it simultaneously implements single-value forecasting and uncertainty quantification, which we refer to as deep uncertainty quantification (DUQ). Efficient deep ensemble strategies are also explored to further improve performance. This new approach was evaluated on a public dataset collected from weather stations in Beijing, China. Experimental results demonstrate that the proposed NLE loss significantly improves generalization compared to mean squared error (MSE) loss and mean absolute error (MAE) loss. Compared with NWP, this approach significantly improves accuracy by 47.76%, which is a state-of-the-art result on this benchmark dataset.

[1]  Lewis F. Richardson,et al.  Weather Prediction by Numerical Process , 1922 .

[2]  A. Weigend,et al.  Estimating the mean and variance of the target probability distribution , 1994, Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94).

[3]  Ravi Sankar,et al.  Time Series Prediction Using Support Vector Machines: A Survey , 2009, IEEE Computational Intelligence Magazine.

[4]  Adrian E. Raftery,et al.  Weather Forecasting with Ensemble Methods , 2005, Science.

[5]  Jürgen Schmidhuber,et al.  Applying LSTM to Time Series Predictable through Time-Window Approaches , 2000, ICANN.

[6]  Charles Blundell,et al.  Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles , 2016, NIPS.

[7]  Xueqi Cheng,et al.  NeuCast: Seasonal Neural Forecast of Power Grid Time Series , 2018, IJCAI.

[8]  A. H. Murphy,et al.  What Is a Good Forecast? An Essay on the Nature of Goodness in Weather Forecasting , 1993 .

[9]  Guriĭ Ivanovich Marchuk,et al.  Numerical Methods in Weather Prediction , 1974 .

[10]  Jeffrey Dean,et al.  Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[11]  Vicente Julián,et al.  Rainfall Prediction: A Deep Learning Approach , 2016, HAIS.

[12]  Nitish Srivastava,et al.  Unsupervised Learning of Video Representations using LSTMs , 2015, ICML.

[13]  Xu Lai,et al.  Comparison between ARIMA and ANN Models Used in Short-Term Wind Speed Forecasting , 2011, 2011 Asia-Pacific Power and Energy Engineering Conference.

[14]  Dit-Yan Yeung,et al.  Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting , 2015, NIPS.

[15]  Eric Horvitz,et al.  A Deep Hybrid Model for Weather Forecasting , 2015, KDD.

[16]  Licia Capra,et al.  Urban Computing: Concepts, Methodologies, and Applications , 2014, TIST.

[17]  Prashant J. Shenoy,et al.  Predicting solar generation from weather forecasts using machine learning , 2011, 2011 IEEE International Conference on Smart Grid Communications (SmartGridComm).

[18]  S Roberts,et al.  Gaussian processes for time-series modelling , 2013, Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences.

[19]  Yu Zheng,et al.  Deep Distributed Fusion Network for Air Quality Prediction , 2018, KDD.

[20]  Dit-Yan Yeung,et al.  Towards Bayesian Deep Learning: A Framework and Some Existing Methods , 2016, IEEE Transactions on Knowledge and Data Engineering.

[21]  Guangquan Zhang,et al.  Learning under Concept Drift: A Review , 2019, IEEE Transactions on Knowledge and Data Engineering.

[22]  Cyril Voyant,et al.  Numerical Weather Prediction (NWP) and hybrid ARMA/ANN model to predict global radiation , 2012, ArXiv.

[23]  Tianrui Li,et al.  ST-MVL: Filling Missing Values in Geo-Sensory Time Series Data , 2016, IJCAI.

[24]  Amir F. Atiya,et al.  Comprehensive Review of Neural Network-Based Prediction Intervals and New Advances , 2011, IEEE Transactions on Neural Networks.

[25]  David S. Matteson,et al.  Predicting Ambulance Demand: a Spatio-Temporal Kernel Approach , 2015, KDD.

[26]  Tianrui Li,et al.  Predicting Citywide Crowd Flows Using Deep Spatio-Temporal Residual Networks , 2017, Artif. Intell..

[27]  Mohamed Zaki,et al.  High-Quality Prediction Intervals for Deep Learning: A Distribution-Free, Ensembled Approach , 2018, ICML.