Machine Learning: An Efficient Alternative to the Variable Infiltration Capacity Model for an Accurate Simulation of Runoff Rates

The present study aims to investigate the performance of the artificial intelligence to emulate the conventional physically-based hydrological models. Although these conventional models could accurately depict the underlying physical processes, but they require a lengthy preprocessing phase as well as a tedious calibration time. Therefore, a need to examine the potential efficient alternative for these models is highly felt. This need becomes imperative once we adopt fine temporal and spatial resolutions for our hydrological modeling, leading to a massive number of to-be-analyzed cells. To this end, we propose a learning framework towards an accurate prediction of runoff rates using meteorological variables, and hence, mimicking the Variable Infiltration Capacity (VIC) by a nimble systematized predictive model. We also present a novel strategy to optimally select the most informative subset of data to train our predictive model, out of the pool of accessible data. This strategy would then considerably enhance the performance of our prediction in terms of computation time. We reported our result as the Pearson correlation coefficient between the predicted and actual runoff rates. Our predictive model was able to forecast the runoff rates with the mean correlation coefficient of 0.9007 for the cells within the study basin.

[1]  Eric A. Rosenberg,et al.  A Long-Term Hydrologically Based Dataset of Land Surface Fluxes and States for the Conterminous United States: Update and Extensions* , 2002 .

[2]  David A. Landgrebe,et al.  A survey of decision tree classifier methodology , 1991, IEEE Trans. Syst. Man Cybern..

[3]  Vijay P. Singh,et al.  ANN and Fuzzy Logic Models for Simulating Event-Based Rainfall-Runoff , 2006 .

[4]  Jian Wu,et al.  An Active Learning Approach with Uncertainty, Representativeness, and Diversity , 2014, TheScientificWorldJournal.

[5]  J. Adamowski,et al.  Multi-step streamflow forecasting using data-driven non-linear methods in contrasting climate regimes , 2014 .

[6]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..

[7]  S. Bratton,et al.  Urbanization is a major influence on microplastic ingestion by sunfish in the Brazos River Basin, Central Texas, USA. , 2016, Environmental pollution.

[8]  Farid Melgani,et al.  SVR active learning for product quality control , 2012, 2012 11th International Conference on Information Science, Signal Processing and their Applications (ISSPA).

[9]  MohammadSajjad Khan,et al.  Application of Support Vector Machine in Lake Water Level Prediction , 2006 .

[10]  Xiao Yang,et al.  Prolongation of SMAP to Spatiotemporally Seamless Coverage of Continental U.S. Using a Deep Learning Neural Network , 2017, 1707.06611.

[11]  Gavin Brown,et al.  Ensemble Learning , 2010, Encyclopedia of Machine Learning and Data Mining.

[12]  Thomas G. Dietterich Overfitting and undercomputing in machine learning , 1995, CSUR.

[13]  J. Valdes,et al.  Frequency and Spatial Characteristics of Droughts in the Conchos River Basin, Mexico , 2002 .

[14]  Andy Liaw,et al.  Classification and Regression by randomForest , 2007 .

[15]  Andrew McCallum,et al.  Employing EM and Pool-Based Active Learning for Text Classification , 1998, ICML.

[16]  Christian W. Dawson,et al.  An artificial neural network approach to rainfall-runoff modelling , 1998 .

[17]  Robert K. Nelson,et al.  Interpreting comprehensive two-dimensional gas chromatography using peak topography maps with application to petroleum forensics , 2016, Chemistry Central Journal.

[18]  D. Lettenmaier,et al.  A Long-Term Hydrologically Based Dataset of Land Surface Fluxes and States for the Conterminous United States* , 2002 .

[19]  Faisal Zaman,et al.  A rainfall forecasting method using machine learning models and its application to the Fukuoka city case , 2012, Int. J. Appl. Math. Comput. Sci..