Data-driven modelling: some past experiences and new approaches

Physically based (process) models based on mathematical descriptions of water motion are widely used in river basin management. During the last decade the so-called data-driven models are becoming more and more common. These models rely upon the methods of computational intelligence and machine learning, and thus assume the presence of a considerable amount of data describing the modelled system9s physics (i.e. hydraulic and/or hydrologic phenomena). This paper is a preface to the special issue on Data Driven Modelling and Evolutionary Optimization for River Basin Management, and presents a brief overview of the most popular techniques and some of the experiences of the authors in data-driven modelling relevant to river basin management. It also identifies the current trends and common pitfalls, provides some examples of successful applications and mentions the research challenges.

[1]  R. S. Govindaraju,et al.  Artificial Neural Networks in Hydrology , 2010 .

[2]  Mitja Brilly,et al.  Precipitation Interception Modelling Using Machine Learning Methods – The Dragonja River Basin Case Study , 2009 .

[3]  John L. Kittle,et al.  BASINS: Better Assessment Science Integrating Point and Nonpoint Sources , 2009 .

[4]  Durga Lal Shrestha,et al.  Instance‐based learning compared to other data‐driven methods in hydrological forecasting , 2008 .

[5]  Orazio Giustolisi,et al.  An evolutionary multiobjective strategy for the effective management of groundwater resources , 2008 .

[6]  Robert J. Abrahart,et al.  Neural network modelling of non-linear hydrological relationships , 2007 .

[7]  William H. Press,et al.  Numerical Recipes 3rd Edition: The Art of Scientific Computing , 2007 .

[8]  Dimitri P. Solomatine,et al.  Baseflow separation techniques for modular artificial neural network modelling in flow forecasting , 2007 .

[9]  A. Heppenstall,et al.  Timing error correction procedure applied to neural network rainfall—runoff modelling , 2007 .

[10]  T. Hu,et al.  Rainfall–runoff modeling using principal component analysis and neural network , 2007 .

[11]  Robert J. Abrahart,et al.  Hydroinformatics: computational intelligence and technological developments in water science applications—Editorial , 2007 .

[12]  Orazio Giustolisi,et al.  A multi-model approach to analysis of environmental phenomena , 2007, Environ. Model. Softw..

[13]  Vladan Babovic,et al.  Ensemble modeling approach for rainfall/groundwater balancing , 2007 .

[14]  Simon Li,et al.  Uncertainties in real‐time flood forecasting with neural networks , 2007 .

[15]  Sobri Harun,et al.  Radial Basis Function Modeling of Hourly Streamflow Hydrograph , 2007 .

[16]  M. Kenward,et al.  An Introduction to the Bootstrap , 2007 .

[17]  William H. Press,et al.  Numerical recipes: the art of scientific computing, 3rd Edition , 2007 .

[18]  ENEL-CRIS Savizio Idrologico,et al.  A comparison of parametric and non-parametric methods for runoff forecasting , 2007 .

[19]  Delft Hydraulics,et al.  RIBASIM River Basin Planning and Management Simulation Program , 2006 .

[20]  D. Savić,et al.  A symbolic data-driven technique based on evolutionary polynomial regression , 2006 .

[21]  Durga L. Shrestha,et al.  Experiments with AdaBoost.RT, an Improved Boosting Scheme for Regression , 2006, Neural Computation.

[22]  Taesoon Kim,et al.  Multireservoir system optimization in the Han River basin using multi‐objective genetic algorithms , 2006 .

[23]  P. Gelder,et al.  Forecasting daily streamflow using hybrid ANN models , 2006 .

[24]  A. Preis,et al.  Kinneret watershed analysis tool: a cell-based decision tree model for watershed flow and pollutants predictions. , 2006, Water science and technology : a journal of the International Association on Water Pollution Research.

[25]  Durga L. Shrestha,et al.  Machine learning approaches for estimation of prediction interval for the model output , 2006, Neural Networks.

[26]  Dimitri P. Solomatine,et al.  Modular learning models in forecasting natural phenomena , 2006, Neural Networks.

[27]  Julio J. Valdés,et al.  Computational intelligence in earth sciences and environmental applications: Issues and challenges , 2006, Neural Networks.

[28]  Ashu Jain,et al.  Integrated approach to model decomposed flow hydrograph using artificial neural network and conceptual techniques , 2006 .

[29]  D. P. Solomatine Neural Network Approximation of a Hydrodynamic Model in Optimizing Reservoir Operation , 2006 .

[30]  R. Falconer,et al.  Environmental modelling in river basin management , 2005 .

[31]  Avi Ostfeld,et al.  A hybrid genetic—instance based learning algorithm for CE-QUAL-W2 calibration , 2005 .

[32]  Bernard De Baets,et al.  Comparison of data-driven TakagiSugeno models of rainfalldischarge dynamics , 2005 .

[33]  Holger R. Maier,et al.  Input determination for neural network models in water resources applications. Part 1—background and methodology , 2005 .

[34]  Dimitri P. Solomatine,et al.  Neural networks and M5 model trees in modelling water level-discharge relationship , 2005, Neurocomputing.

[35]  Dimitri P. Solomatine,et al.  M5 Model Trees and Neural Networks: Application to Flood Forecasting in the Upper Reach of the Huai River in China , 2004 .

[36]  Roland K. Price,et al.  Information Theory and Neural Networks for Managing Uncertainty in Flood Routing , 2004 .

[37]  Dawei Han,et al.  Identification of Support Vector Machines for Runoff Modelling , 2004 .

[38]  Misgana K. Muleta,et al.  Joint Application of Artificial Neural Networks and Evolutionary Algorithms to Watershed Management , 2004 .

[39]  V. Guinot,et al.  Treatment of precipitation uncertainty in rainfall-runoff modelling: a fuzzy set approach , 2004 .

[40]  Kuolin Hsu,et al.  Improved streamflow forecasting using self-organizing radial basis function artificial neural networks , 2004 .

[41]  Dimitri P. Solomatine,et al.  FLEXIBLE AND OPTIMAL M5 MODEL TREES WITH APPLICATIONS TO FLOW PREDICTIONS , 2004 .

[42]  A. Brath,et al.  A stochastic approach for assessing the uncertainty of rainfall‐runoff simulations , 2004 .

[43]  Vladan Babovic,et al.  Declarative and Preferential Bias in GP-based Scientific Discovery , 2002, Genetic Programming and Evolvable Machines.

[44]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[45]  Mehmet Emre Çek,et al.  Analysis of observed chaotic data , 2004 .

[46]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[47]  Eric Gaume,et al.  Over-parameterisation, a major obstacle to the use of artificial neural networks in hydrology? , 2003 .

[48]  Holger R. Maier,et al.  Data transformation for neural network models in water resources applications , 2003 .

[49]  D. Solomatine,et al.  Model trees as an alternative to neural networks in rainfall—runoff modelling , 2003 .

[50]  S. Jain,et al.  Radial Basis Function Neural Network for Modeling Rating Curves , 2003 .

[51]  Kuolin Hsu,et al.  Self‐organizing linear output map (SOLO): An artificial neural network suitable for hydrologic modeling and analysis , 2002 .

[52]  Vladan Babovic,et al.  Rainfall runoff modelling based on genetic programming , 2002 .

[53]  Shie-Yui Liong,et al.  Practical Inverse Approach for Forecasting Nonlinear Hydrological Time Series , 2002 .

[54]  Shie-Yui Liong,et al.  FLOOD STAGE FORECASTING WITH SUPPORT VECTOR MACHINES 1 , 2002 .

[55]  Holger R. Maier,et al.  Optimal division of data for neural network models in water resources applications , 2002 .

[56]  Asaad Y. Shamseldin,et al.  A non-linear neural network technique for updating of river flow forecasts , 2001 .

[57]  Demetris F. Lekkas,et al.  Improved non-linear transfer function and neural network methods of flow routing for real-time forecasting , 2001 .

[58]  Dimitri P. Solomatine,et al.  Model Induction with Support Vector Machines: Introduction and Applications , 2001 .

[59]  Asaad Y. Shamseldin,et al.  A non-linear combination of the forecasts of rainfall-runoff models by the first-order Takagi–Sugeno fuzzy system , 2001 .

[60]  Vladan Babovic,et al.  Neural networks as routine for error updating of numerical models , 2001 .

[61]  E. Toth,et al.  Comparison of short-term rainfall prediction models for real-time flood forecasting , 2000 .

[62]  David M. Hannah,et al.  Classification of river regimes: a context for hydroecology. , 2000 .

[63]  R. Abrahart,et al.  Comparing neural network and autoregressive moving average techniques for the provision of continuous river flow forecasts in two contrasting catchments , 2000 .

[64]  Ian Witten,et al.  Data Mining , 2000 .

[65]  Stan Openshaw,et al.  A hybrid multi-model approach to river level forecasting , 2000 .

[66]  Dimitri P. Solomatine,et al.  Application of adaptive fuzzy rule-based models for reconstruction of missing precipitation events , 2000 .

[67]  David M. Hannah,et al.  An approach to hydrograph classification , 2000 .

[68]  D. P. Solomatine,et al.  Chaos theory in predicting surge water levels in the North Sea , 2000 .

[69]  M. Keijzer,et al.  Genetic programming as a model induction engine , 2000 .

[70]  A. J. Abebe,et al.  Fuzzy alpha-cut vs . Monte Carlo techniques in assessing uncertainty in model parameters , 2000 .

[71]  Dimitri P. Solomatine,et al.  Application of Data Mining Techniques for Remote Sensing Image Analysis , 2000 .

[72]  A. W. Minns,et al.  The classification of hydrologically homogeneous regions , 1999 .

[73]  Dimitri P. Solomatine,et al.  On the encapsulation of numerical-hydraulic models in artificial neural network , 1999 .

[74]  Dimitri P. Solomatine,et al.  Control of water levels in polder areas using neural networks and fuzzy adaptive systems , 1999 .

[75]  Michael I. Jordan,et al.  Modular and hierarchical learning systems , 1998 .

[76]  Christian W. Dawson,et al.  An artificial neural network approach to rainfall-runoff modelling , 1998 .

[77]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[78]  Bart Kosko,et al.  Fuzzy Engineering , 1996 .

[79]  Geza Pesti,et al.  A fuzzy rule-based approach to drought assessment , 1996 .

[80]  M. J. Hall,et al.  Artificial neural networks as rainfall-runoff models , 1996 .

[81]  Asaad Y. Shamseldin,et al.  A nearest neighbour linear perturbation model for river flow forecasting , 1996 .

[82]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[83]  Kuolin Hsu,et al.  Artificial Neural Network Modeling of the Rainfall‐Runoff Process , 1995 .

[84]  Lucien Duckstein,et al.  Fuzzy Rule-Based Modeling with Applications to Geophysical, Biological and Engineering Systems , 1995 .

[85]  Michael A. Arbib,et al.  The handbook of brain theory and neural networks , 1995, A Bradford book.

[86]  Paul J. Werbos,et al.  The roots of backpropagation , 1994 .

[87]  J. R. Quinlan Learning With Continuous Classes , 1992 .

[88]  M. Abbott Hydroinformatics: Information Technology and the Aquatic Environment , 1991 .

[89]  S. Yakowitz,et al.  Nearest‐neighbor methods for nonparametric rainfall‐runoff forecasting , 1987 .

[90]  Douglas A. Haith,et al.  GENERALIZED WATERSHED LOADING FUNCTIONS FOR STREAM FLOW NUTRIENTS , 1987 .

[91]  Zbigniew W. Kundzewicz,et al.  Nonlinear flood routing with multilinear models , 1987 .

[92]  M. H. Diskin,et al.  Application of a Cell Model to the Bellebeek Watershed , 1984 .

[93]  M. H. Diskin,et al.  Determination of an optimal IUH for linear, time invariant systems from multi-storm records , 1975 .

[94]  P. S. Eagleson,et al.  Computation of optimum realizable unit hydrographs , 1966 .

[95]  Lotfi A. Zadeh,et al.  Fuzzy Sets , 1996, Inf. Control..

[96]  M. H. Diskin,et al.  A Basic Study of the Linearity of the Rainfall-Runoff Process in Watersheds , 1964 .

[97]  J. Dooge A general theory of the unit hydrograph , 1959 .

[98]  J. E. Nash,et al.  The form of the instantaneous unit hydrograph , 1957 .

[99]  L. K. Sherman Streamflow from rainfall by the unit-graph method , 1932 .