Meteorological drought forecasting based on a statistical model with machine learning techniques in Shaanxi province, China.

BACKGROUND Drought is a major natural disaster that causes severe social and economic losses. The prediction of regional droughts may provide important information for drought preparedness and farm irrigation. The existing drought prediction models are mainly based on a single weather station. Efforts need to be taken to develop a new multistation-based prediction model. OBJECTIVES This study optimizes the predictor selection process and develops a new model to predict droughts using past drought index, meteorological measures and climate signals from 32 stations during 1961 to 2016 in Shaanxi province, China. METHODS We applied and compared two methods, including a cross-correlation function and a distributed lag nonlinear model (DLNM), in selecting the optimal predictors and specifying their lag time. Then, we built a DLNM, an artificial neural network model and an XGBoost model and compared their validations for predicting the Standardized Precipitation Evapotranspiration Index (SPEI) 1-6 months in advance. RESULTS The DLNM was better than the cross-correlation function in predictor selection and lag effect determination. The XGBoost model more accurately predicted SPEI with a lead time of 1-6 months than the DLNM and the artificial neural network, with cross-validation R2 values of 0.68-0.82, 0.72-0.89, 0.81-0.92, and 0.84-0.95 at 3-, 6-, 9- and 12-month time scales, respectively. Moreover, the XGBoost model had the highest prediction accuracy for overall droughts (89%-97%) and for three specific drought categories (i.e., moderate, severe, and extreme) (76%-94%). CONCLUSION This study offers a new modeling strategy for drought predictions based on multistation data. The incorporation of nonlinear and lag effects of predictors into the XGBoost method can significantly improve prediction accuracy of SPEI and drought.

[1]  J. Hao The Interrelationship between Regional Westerly Index and the Water Vapor Budget in Northwest China , 2008 .

[2]  A. Dai Drought under global warming: a review , 2011 .

[3]  G. Blumenstock Drought in the United States Analyzed by Means of the Theory of Probability , 1942 .

[4]  I. Guyon,et al.  The Higgs Machine Learning Challenge , 2015 .

[5]  V. Singh,et al.  Drought Forecasting Using a Hybrid Stochastic and Neural Network Model , 2007 .

[6]  Weihong Qian,et al.  Climatic regime shift and decadal anomalous events in China , 2007 .

[7]  James D. Brown,et al.  The Science of NOAA's Operational Hydrologic Ensemble Forecast Service , 2014 .

[8]  S. Hyakin,et al.  Neural Networks: A Comprehensive Foundation , 1994 .

[9]  Chris Funk,et al.  Intra-seasonal NDVI change projections in semi-arid Africa , 2006 .

[10]  Michelle Livne,et al.  Boosted Tree Model Reforms Multimodal Magnetic Resonance Imaging Infarct Prediction in Acute Stroke , 2018, Stroke.

[11]  Yaning Chen,et al.  Spatial and temporal variability of drought in the arid region of China and its relationships to teleconnection indices , 2015 .

[12]  A. Dai Erratum: Increasing drought under global warming in observations and models , 2013 .

[13]  V. Mishra,et al.  Prediction of vegetation anomalies to improve food security and water management in India , 2015 .

[14]  Chunyu Dong,et al.  Temporal dynamics and spatial patterns of drought and the relation to ENSO: a case study in Northwest China , 2016 .

[15]  K. Stahl,et al.  Impacts of European drought events: insights from an international database of text-based reports , 2015 .

[16]  P. Eliasson,et al.  Drought monitoring and analysis: Standardised Precipitation Evapotranspiration Index (SPEI) and Standardised Precipitation Index (SPI) , 2018, Physics and Chemistry of the Earth, Parts A/B/C.

[17]  Dimitri P. Solomatine,et al.  Meteorological Drought Forecasting Based on Climate Signals Using Artificial Neural Network – A Case Study in Khanhhoa Province Vietnam , 2016 .

[18]  J. K. Mcguire,et al.  THE 1957 DROUGHT IN THE EASTERN UNITED STATES , 1957 .

[19]  Marvin E. Jensen,et al.  Evapotranspiration and irrigation water requirements : a manual , 1990 .

[20]  Xing Yuan,et al.  Understanding and seasonal forecasting of hydrological drought in the Anthropocene , 2016 .

[21]  Sadaf Shamsuddin,et al.  A Novel Multi-Scalar Drought Index for Monitoring Drought: the Standardized Precipitation Temperature Index , 2017, Water Resources Management.

[22]  Chia-Ching Chou,et al.  Risk-based drought early warning system in reservoir operation , 2008 .

[23]  Wang Zhiwei Climate Change in Drought over Northern China during 1950-2000 , 2003 .

[24]  Florinda Matos,et al.  Leveraging intellectual capital through Lewin's Force Field Analysis: the case of software development companies , 2017 .

[25]  Jianqi Sun,et al.  Changes in Drought Characteristics over China Using the Standardized Precipitation Evapotranspiration Index , 2015 .

[26]  Francisco José Climent Diranzo,et al.  Predicting failure in the U.S. banking sector: An extreme gradient boosting approach , 2019, International Review of Economics & Finance.

[27]  Junqiang Yao,et al.  Spatial-temporal variation and impacts of drought in Xinjiang (Northwest China) during 1961–2015 , 2018, PeerJ.

[28]  S. Vicente‐Serrano,et al.  A Multiscalar Drought Index Sensitive to Global Warming: The Standardized Precipitation Evapotranspiration Index , 2009 .

[29]  Ximing Cai,et al.  Assessing the value of seasonal climate forecast information through an end‐to‐end forecasting framework: Application to U.S. 2012 drought in central Illinois , 2014 .

[30]  Antonio Gasparrini,et al.  Distributed Lag Linear and Non-Linear Models in R: The Package dlnm. , 2011, Journal of statistical software.

[31]  Ravinesh C. Deo,et al.  Input selection and data-driven model performance optimization to predict the Standardized Precipitation and Evaporation Index in a drought-prone region , 2018, Atmospheric Research.

[32]  S. Vicente‐Serrano,et al.  Standardized precipitation evapotranspiration index (SPEI) revisited: parameter fitting, evapotranspiration models, tools, datasets and drought monitoring , 2014 .

[33]  S. Morid,et al.  Drought forecasting using artificial neural networks and time series of drought indices , 2007 .

[34]  Tormod Næs,et al.  Understanding the collinearity problem in regression and discriminant analysis , 2001 .

[35]  E. Kang,et al.  Recent and Future Climate Change in Northwest China , 2007 .

[36]  U. Somorowska Changes in Drought Conditions in Poland over the Past 60 Years Evaluated by the Standardized Precipitation-Evapotranspiration Index , 2016, Acta Geophysica.

[37]  V. Singh,et al.  Spatiotemporal behavior of floods and droughts and their impacts on agriculture in China , 2015 .

[38]  Jiancang Xie,et al.  Use of four drought indices for evaluating drought characteristics under climate change in Shaanxi, China: 1951–2012 , 2015, Natural Hazards.

[39]  Q. Duan,et al.  Seasonal drought predictability and forecast skill in the semi-arid endorheic Heihe River basin in northwestern China , 2018, Hydrology and Earth System Sciences.

[40]  Hiroshi Oyama,et al.  Using machine-learning approaches to predict non-participation in a nationwide general health check-up scheme , 2018, Comput. Methods Programs Biomed..

[41]  Jin‐Yi Yu,et al.  Decadal changes of ENSO persistence barrier in SST and ocean heat content indices: 1958–2001 , 2007 .

[42]  B. Lyon,et al.  The Drought and Humanitarian Crisis in Central and Southwest Asia: A Climate Perspective , 2001 .

[43]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[44]  A. Dai Characteristics and trends in various forms of the Palmer Drought Severity Index during 1900–2008 , 2011 .

[45]  J. Im,et al.  Meteorological drought forecasting for ungauged areas based on machine learning: Using long-range climate forecast and remote sensing data , 2017 .

[46]  A Gasparrini,et al.  Distributed lag non-linear models , 2010, Statistics in medicine.

[47]  Chris Edwards,et al.  Growing pains for deep learning , 2015, Commun. ACM.

[48]  C. W. Thornthwaite An approach toward a rational classification of climate. , 1948 .

[49]  Dmitry Vedenov,et al.  Information Value of Climate Forecasts for Rainfall Index Insurance for Pasture, Rangeland, and Forage in the Southeast United States , 2013, Journal of Agricultural and Applied Economics.

[50]  Pengli Ma,et al.  The spatial distribution characteristics of a comprehensive drought risk index in southwestern China and underlying causes , 2016, Theoretical and Applied Climatology.

[51]  K. Stahl,et al.  A quantitative analysis to objectively appraise drought indicators and model drought impacts , 2015 .

[52]  Kei-Hoi Cheung,et al.  Predicting urinary tract infections in the emergency department with machine learning , 2018, PloS one.

[53]  S. Wold,et al.  The Collinearity Problem in Linear Regression. The Partial Least Squares (PLS) Approach to Generalized Inverses , 1984 .

[54]  Ali Behrangi,et al.  Probabilistic Seasonal Prediction of Meteorological Drought Using the Bootstrap and Multivariate Information , 2015 .

[55]  Junliang Fan,et al.  Comparison of Support Vector Machine and Extreme Gradient Boosting for predicting daily global solar radiation using temperature and precipitation in humid subtropical climates: A case study in China , 2018 .

[56]  Guoqing Wang,et al.  Agricultural drought prediction using climate indices based on Support Vector Regression in Xiangjiang River basin. , 2018, The Science of the total environment.

[57]  A. Mishra,et al.  Drought forecasting using feed-forward recursive neural network , 2006 .

[58]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[59]  Faisal Saeed,et al.  Bioactive Molecule Prediction Using Extreme Gradient Boosting , 2016, Molecules.

[60]  Vijay P. Singh,et al.  Probabilistic prediction of hydrologic drought using a conditional probability approach based on the meta-Gaussian model , 2016 .

[61]  Ö. F. Durdu Application of linear stochastic models for drought forecasting in the Büyük Menderes river basin, western Turkey , 2010 .

[62]  D. Wilhite,et al.  Monitoring the 1996 Drought Using the Standardized Precipitation Index , 1999 .

[63]  G. Demisse,et al.  Satellite‐based hybrid drought monitoring tool for prediction of vegetation condition in Eastern Africa: A case study for Ethiopia , 2014 .

[64]  V. Singh,et al.  Drought forecasting in eastern Australia using multivariate adaptive regression spline, least square support vector machine and M5Tree model , 2017 .

[65]  Max Kuhn,et al.  caret: Classification and Regression Training , 2015 .