Improving Subseasonal Forecasting in the Western U.S. with Machine Learning

Water managers in the western United States (U.S.) rely on longterm forecasts of temperature and precipitation to prepare for droughts and other wet weather extremes. To improve the accuracy of these longterm forecasts, the U.S. Bureau of Reclamation and the National Oceanic and Atmospheric Administration (NOAA) launched the Subseasonal Climate Forecast Rodeo, a year-long real-time forecasting challenge in which participants aimed to skillfully predict temperature and precipitation in the western U.S. two to four weeks and four to six weeks in advance. Here we present and evaluate our machine learning approach to the Rodeo and release our SubseasonalRodeo dataset, collected to train and evaluate our forecasting system. Our system is an ensemble of two nonlinear regression models. The first integrates the diverse collection of meteorological measurements and dynamic model forecasts in the SubseasonalRodeo dataset and prunes irrelevant predictors using a customized multitask feature selection procedure. The second uses only historical measurements of the target variable (temperature or precipitation) and introduces multitask nearest neighbor features into a weighted local linear regression. Each model alone is significantly more accurate than the debiased operational U.S. Climate Forecasting System (CFSv2), and our ensemble skill exceeds that of the top Rodeo competitor for each target variable and forecast horizon. Moreover, over 2011-2018, an ensemble of our regression models and debiased CFSv2 improves debiased CFSv2 skill by 40-50% for temperature and 129-169% for precipitation. We hope that both our dataset and our methods will help to advance the state of the art in subseasonal forecasting.

[1]  E. Lorenz Deterministic nonperiodic flow , 1963 .

[2]  W L Gates,et al.  A New (Revised) Tabulation of the Scripps Topography on a 1 Degree Global Grid , 1975 .

[3]  R. Reynolds,et al.  The NCEP/NCAR 40-Year Reanalysis Project , 1996, Renewable Energy.

[4]  F. Nebeker Calculating the weather : meteorology in the 20th century , 1997 .

[5]  K. Wolter,et al.  Measuring the strength of ENSO events: How does 1997/98 rank? , 1998 .

[6]  A. Barros,et al.  Localized Precipitation Forecasts from a Numerical Weather Prediction Model Using Artificial Neural Networks , 1998 .

[7]  José Manuel Gutiérrez,et al.  Bayesian Networks for Probabilistic Weather Prediction , 2002, ECAI.

[8]  M. Wheeler,et al.  An All-Season Real-Time Multivariate MJO Index: Development of an Index for Monitoring and Prediction , 2004 .

[9]  B. Rudolf,et al.  World Map of the Köppen-Geiger climate classification updated , 2006 .

[10]  John D. Hunter,et al.  Matplotlib: A 2D Graphics Environment , 2007, Computing in Science & Engineering.

[11]  Thomas M. Smith,et al.  Daily High-Resolution-Blended Analyses for Sea Surface Temperature , 2007 .

[12]  T. McMahon,et al.  Updated world map of the Köppen-Geiger climate classification , 2007 .

[13]  Illia Horenko,et al.  Automated Generation of Reduced Stochastic Weather Models I: Simultaneous Dimension and Model Reduction for Time Series Analysis , 2008, Multiscale Model. Simul..

[14]  H. V. D. Dool,et al.  A global monthly land surface air temperature analysis for 1948-present , 2008 .

[15]  Y. Radhika,et al.  Atmospheric Temperature Prediction using Support Vector Machines , 2009 .

[16]  Wes McKinney,et al.  Data Structures for Statistical Computing in Python , 2010, SciPy.

[17]  A. Robertson,et al.  Subseasonal to Seasonal Prediction Project: bridging the gap between weather and climate , 2012 .

[18]  A. Barnston,et al.  Skill of Real-Time Seasonal ENSO Model Predictions During 2002–11: Is Our Capability Increasing? , 2012 .

[19]  John K. Williams,et al.  Enhancing understanding and improving prediction of severe weather through spatiotemporal relational learning , 2013, Machine Learning.

[20]  A. Barnston,et al.  The North American multimodel ensemble: Phase-1 seasonal-to-interannual prediction; phase-2 toward developing intraseasonal prediction , 2014 .

[21]  S. Guikema,et al.  Application of Statistical Models to the Prediction of Seasonal Rainfall Anomalies over the Sahel , 2014 .

[22]  M. Iredell,et al.  The NCEP Climate Forecast System Version 2 , 2014 .

[23]  Eric Horvitz,et al.  A Deep Hybrid Model for Weather Forecasting , 2015, KDD.

[24]  Arun Kumar,et al.  Improving and Promoting Subseasonal to Seasonal Prediction , 2015 .

[25]  Ehud Strobach,et al.  Decadal climate predictions using sequential learning algorithms , 2015, 1509.05285.

[26]  Dit-Yan Yeung,et al.  Convolutional LSTM Network: A Machine Learning Approach for Precipitation Nowcasting , 2015, NIPS.

[27]  Johan A. K. Suykens,et al.  Spatio-temporal feature selection for black-box weather forecasting , 2016, ESANN.

[28]  D. Vimont,et al.  Utilizing the state of ENSO as a means for season‐ahead predictor selection , 2016 .

[29]  Prabhat,et al.  Application of Deep Convolutional Neural Networks for Detecting Extreme Weather in Climate Datasets , 2016, ArXiv.

[30]  Vicente Julián,et al.  Rainfall Prediction: A Deep Learning Approach , 2016, HAIS.

[31]  Borhan Molazem Sanandaji,et al.  Deep Forecast: Deep Learning-based Spatio-Temporal Forecasting , 2017, ArXiv.

[32]  R. Webb,et al.  Sub-Seasonal Climate Forecast Rodeo , 2017 .

[33]  Andrew P. Morse,et al.  Potential applications of subseasonal‐to‐seasonal (S2S) predictions , 2017 .

[34]  E. Tziperman,et al.  Winter Precipitation Forecast in the European and Mediterranean Regions Using Cluster Analysis , 2017 .

[35]  Ke Zhang,et al.  A Short-Term Rainfall Prediction Model Using Multi-task Convolutional Neural Networks , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[36]  Prabhat,et al.  ExtremeWeather: A large-scale climate dataset for semi-supervised detection, localization, and understanding of extreme weather events , 2016, NIPS.

[37]  T. N. Krishnamurti,et al.  Improvements in Hurricane Intensity Forecasts from a Multimodel Superensemble Utilizing a Generalized Neural Network Technique , 2018 .

[38]  Lester Mackey,et al.  S2S reboot: An argument for greater inclusion of machine learning in subseasonal to seasonal forecasts , 2018, WIREs Climate Change.

[39]  Gregory R. Herman,et al.  “Dendrology” in Numerical Weather Prediction: What Random Forests and Logistic Regression Tell Us about Forecasting Extreme Precipitation , 2018, Monthly Weather Review.

[40]  Kristin M. Calhoun,et al.  Development of a Human–Machine Mix for Forecasting Severe Convective Events , 2018 .