Product-Units neural networks for catchment runoff forecasting

Abstract In this paper Product-Units neural networks (PUNNs), which probably have never been used within the field of hydrology, are introduced and applied for catchment runoff forecasting in cold climate zones. This type of neural networks, a subclass of higher order neural networks uses product nodes with inputs raised to exponential weights in one layer and well-known summation nodes in another layer. The present paper empirically shows that PUNNs with unbounded weights are difficult to train and do not perform well for catchment runoff forecasting. However, a very good predictive performance may be achieved when the weights are bounded within [−1, 1] interval. Several variants of optimization methods, mostly Differential Evolution-based algorithms, and a few approaches enabling good generalization capabilities of neural networks are compared in order to select the appropriate technique for PUNNs training. PUNNs with parameters bounded within [−1, 1] interval are shown to outperform Multi-Layer Perceptron neural networks and HBV conceptual model for runoff forecasting case study at Annapolis River, Nova Scotia, Canada. Gradient-based Levenberg–Marquardt algorithm and Evolutionary Computation-based Differential Evolution with Global and Local Neighborhood method turn out to be the most successful among the tested training algorithms. Surprisingly, in the case of Product-Units neural networks with weights bounded within [−1, 1] interval using noise injection or early stopping do not improve the results obtained when no method to avoid overfitting is used.

[1]  C. Lee Giles,et al.  Learning with Product Units , 1994, NIPS.

[2]  Liang Gao,et al.  A differential evolution algorithm with self-adapting strategy and control parameters , 2011, Comput. Oper. Res..

[3]  Emile Fiesler,et al.  High-order and multilayer perceptron initialization , 1997, IEEE Trans. Neural Networks.

[4]  César Hervás-Martínez,et al.  Evolutionary product unit based neural networks for regression , 2006, Neural Networks.

[5]  Adam P. Piotrowski,et al.  Comparison of evolutionary computation techniques for noise injected neural network training to estimate longitudinal dispersion coefficients in rivers , 2012, Expert Syst. Appl..

[6]  J. Parajka,et al.  Scale effects in conceptual hydrological modeling , 2009 .

[7]  Juan C. Valle-Lisboa,et al.  Elman topology with sigma-pi units: An application to the modeling of verbal hallucinations in schizophrenia , 2005, Neural Networks.

[8]  Zixiang Xiong,et al.  Noise-injected neural networks show promise for use on small-sample expression data , 2006, BMC Bioinformatics.

[9]  Lutz Prechelt,et al.  Automatic early stopping using cross validation: quantifying the criteria , 1998, Neural Networks.

[10]  Avi Ostfeld,et al.  Data-driven modelling: some past experiences and new approaches , 2008 .

[11]  P. N. Suganthan,et al.  Differential Evolution: A Survey of the State-of-the-Art , 2011, IEEE Transactions on Evolutionary Computation.

[12]  César Hervás-Martínez,et al.  Income prediction in the agrarian sector using product unit neural networks , 2010, Eur. J. Oper. Res..

[13]  Theodore G. Cleveland,et al.  Time of Concentration Estimated Using Watershed Parameters Determined by Automated and Manual Methods , 2008 .

[14]  P. N. Suganthan,et al.  Differential Evolution Algorithm With Strategy Adaptation for Global Numerical Optimization , 2009, IEEE Transactions on Evolutionary Computation.

[15]  David E. Rumelhart,et al.  Product Units: A Computationally Powerful and Biologically Plausible Extension to Backpropagation Networks , 1989, Neural Computation.

[16]  Goutami Chattopadhyay,et al.  Identification of the best architecture of a multilayer perceptron in modeling daily total ozone concentration over Kolkata, India , 2011 .

[17]  David Naso,et al.  Compact Differential Evolution , 2011, IEEE Transactions on Evolutionary Computation.

[18]  Pascal Bouvry,et al.  Improving Classical and Decentralized Differential Evolution With New Mutation Operator and Population Topologies , 2011, IEEE Transactions on Evolutionary Computation.

[19]  Michael G. Epitropakis,et al.  Hardware-friendly Higher-Order Neural Network Training using Distributed Evolutionary Algorithms , 2010, Appl. Soft Comput..

[20]  C. Perrin,et al.  Improvement of a parsimonious model for streamflow simulation , 2003 .

[21]  András Bárdossy,et al.  Calibration of hydrological models on hydrologically unusual events , 2012 .

[22]  Yakov Ben-Haim,et al.  Evaluation of Neural Network Robust Reliability Using Information-Gap Theory , 2006, IEEE Transactions on Neural Networks.

[23]  François Anctil,et al.  Added gains of soil moisture content observations for streamflow predictions using neural networks , 2008 .

[24]  Chaopeng Shen,et al.  A process-based, distributed hydrologic model based on a large-scale method for surface–subsurface coupling , 2010 .

[25]  Ercan Kahya,et al.  Flow forecast by SWAT model and ANN in Pracana basin, Portugal , 2009, Adv. Eng. Softw..

[26]  Adam P. Piotrowski,et al.  The grouping differential evolution algorithm for multi-dimensional optimization problems , 2010 .

[27]  Linda See,et al.  A conceptual and neural network model for real-time flood forecasting of the Tiber River in Rome , 2010 .

[28]  H. Wheater,et al.  Evaluation of precipitation products over complex mountainous terrain: A water resources perspective , 2011 .

[29]  François Anctil,et al.  Evaluation of Neural Network Streamflow Forecasting on 47 Watersheds , 2005 .

[30]  Joydeep Ghosh,et al.  The pi-sigma network: an efficient higher-order neural network for pattern classification and function approximation , 1991, IJCNN-91-Seattle International Joint Conference on Neural Networks.

[31]  Dimitri Solomatine,et al.  Experimental investigation of the predictive capabilities of data driven modeling techniques in hydrology - Part 1: Concepts and methodology , 2009 .

[32]  Dimitri Solomatine,et al.  Experimental investigation of the predictive capabilities of data driven modeling techniques in hydrology - Part 2: Application , 2009 .

[33]  Wei Wu,et al.  Convergence Analysis of Batch Gradient Algorithm for Three Classes of Sigma-Pi Neural Networks , 2007, Neural Processing Letters.

[34]  K. Beven,et al.  A physically based, variable contributing area model of basin hydrology , 1979 .

[35]  S. Hyakin,et al.  Neural Networks: A Comprehensive Foundation , 1994 .

[36]  R. Storn,et al.  Differential Evolution - A simple and efficient adaptive scheme for global optimization over continuous spaces , 2004 .

[37]  L. A. Schmit,et al.  Advances in the integrated approach to structural synthesis. , 1966 .

[38]  Adam P. Piotrowski,et al.  Evaluation of temporal concentration profiles for ungauged rivers following pollution incidents , 2011 .

[39]  Isabelle Guyon,et al.  High-order neural networks: information storage without errors , 1987 .

[40]  Richard Neville,et al.  Third-order generalization: A new approach to categorizing higher-order generalization , 2008, Neurocomputing.

[41]  Yves Grandvalet,et al.  Noise Injection: Theoretical Prospects , 1997, Neural Computation.

[42]  P. Kitanidis,et al.  Real‐time forecasting with a conceptual hydrologic model: 2. Applications and results , 1980 .

[43]  Amit Konar,et al.  Particle Swarm Optimization and Differential Evolution Algorithms: Technical Analysis, Applications and Hybridization Perspectives , 2008, Advances of Computational Intelligence in Industrial Systems.

[44]  Arthur C. Sanderson,et al.  JADE: Adaptive Differential Evolution With Optional External Archive , 2009, IEEE Transactions on Evolutionary Computation.

[45]  Andries Petrus Engelbrecht,et al.  Global optimization algorithms for training product unit neural networks , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[46]  Nader Nariman-zadeh,et al.  Evolutionary design of generalized GMDH-type neural network for prediction of concrete compressive strength using UPV , 2010 .

[47]  C. Zhang,et al.  Convergence of BP algorithm for product unit neural networks with exponential weights , 2008, Neurocomputing.

[48]  K. P. Sudheer,et al.  Methods used for the development of neural networks for the prediction of water resource variables in river systems: Current status and future directions , 2010, Environ. Model. Softw..

[49]  Pedro Antonio Gutiérrez,et al.  Combining classification and regression approaches for the quantification of highly overlapping capillary electrophoresis peaks by using evolutionary sigmoidal and product unit neural networks , 2007 .

[50]  Jan Adamowski,et al.  Comparison of Multivariate Regression and Artificial Neural Networks for Peak Urban Water-Demand Forecasting: Evaluation of Different ANN Learning Algorithms , 2010 .

[51]  Sung-Kwun Oh,et al.  Polynomial neural networks architecture: analysis and design , 2003, Comput. Electr. Eng..

[52]  Marshall R. Mayberry,et al.  Learning to Attend: A Connectionist Model of Situated Language Comprehension , 2009, Cogn. Sci..

[53]  Ville Tirronen,et al.  Recent advances in differential evolution: a survey and experimental analysis , 2010, Artificial Intelligence Review.

[54]  Shang-Jeng Tsai,et al.  Efficient Population Utilization Strategy for Particle Swarm Optimizer , 2009, IEEE Trans. Syst. Man Cybern. Part B.

[55]  Kevin Kok Wai Wong,et al.  Classification of adaptive memetic algorithms: a comparative study , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[56]  R. K. Price,et al.  Combining semi-distributed process-based and data-driven models in flow simulation: A case study of the Meuse river basin , 2009 .

[57]  Geoffrey E. Hinton,et al.  Simplifying Neural Networks by Soft Weight-Sharing , 1992, Neural Computation.

[58]  James F. Frenzel,et al.  Training product unit neural networks with genetic algorithms , 1993, IEEE Expert.

[59]  Adam Kiczko,et al.  Differential Evolution algorithm with Separated Groups for multi-dimensional optimization problems , 2012, Eur. J. Oper. Res..

[60]  V. Singh,et al.  Computer Models of Watershed Hydrology , 1995 .

[61]  Jens Christian Refsgaard,et al.  Validation and Intercomparison of Different Updating Procedures for Real-Time Forecasting , 1997 .

[62]  Burkhard Lenze,et al.  Note on interpolation on the hypercube by means of sigma-pi neural networks , 2004, Neurocomputing.

[63]  C. W. Thornthwaite An Approach Toward a Rational Classification of Climate , 1948 .

[64]  J. Napiorkowski,et al.  Application of Volterra series to modelling of rainfall-runoff systems and flow in open channels , 1986 .

[65]  P. E. O'connell,et al.  An introduction to the European Hydrological System — Systeme Hydrologique Europeen, “SHE”, 2: Structure of a physically-based, distributed modelling system , 1986 .

[66]  Robert J. Marks,et al.  Similarities of error regularization, sigmoid gain scaling, target smoothing, and training with jitter , 1995, IEEE Trans. Neural Networks.

[67]  Rainer Storn,et al.  Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces , 1997, J. Glob. Optim..

[68]  Peter L. Bartlett,et al.  The Sample Complexity of Pattern Classification with Neural Networks: The Size of the Weights is More Important than the Size of the Network , 1998, IEEE Trans. Inf. Theory.

[69]  Madan M. Gupta,et al.  River-Flow Forecasting Using Higher-Order Neural Networks , 2012 .

[70]  Martin Mandischer A comparison of evolution strategies and backpropagation for neural network training , 2002, Neurocomputing.

[71]  Qingxiang Wu,et al.  Learning under weight constraints in networks of temporal encoding spiking neurons , 2006, Neurocomputing.

[72]  Göran Lindström,et al.  A Simple Automatic Calibration Routine for the HBV Model , 1997 .

[73]  Abir Jaafar Hussain,et al.  Non-stationary and stationary prediction of financial time series using dynamic ridge polynomial neural network , 2009, Neurocomputing.

[74]  P. Coulibaly,et al.  TDNN with logical values for hydrologic modeling in a cold and snowy climate , 2008 .

[75]  Amin Elshorbagy,et al.  On the relevance of using artificial neural networks for estimating soil moisture content , 2008 .

[76]  Michael Schmitt,et al.  On the Complexity of Computing and Learning with Multiplicative Neural Networks , 2002, Neural Computation.

[77]  Colin Giles,et al.  Learning, invariance, and generalization in high-order neural networks. , 1987, Applied optics.

[78]  Geoffrey E. Hinton,et al.  A general framework for parallel distributed processing , 1986 .

[79]  Adam P. Piotrowski,et al.  Optimizing neural networks for river flow forecasting – Evolutionary Computation methods versus the Levenberg–Marquardt approach , 2011 .

[80]  Amit Konar,et al.  Differential Evolution Using a Neighborhood-Based Mutation Operator , 2009, IEEE Transactions on Evolutionary Computation.

[81]  Devendra K. Chaturvedi,et al.  Improved generalized neuron model for short-term load forecasting , 2003, Soft Comput..

[82]  François Anctil,et al.  Impact of the length of observed records on the performance of ANN and of conceptual parsimonious rainfall-runoff forecasting models , 2004, Environ. Model. Softw..

[83]  Riccardo Poli,et al.  Particle swarm optimization , 1995, Swarm Intelligence.

[84]  Adam P. Piotrowski,et al.  Estimation of parameters of the transient storage model by means of multi-layer perceptron neural networks / Estimation des paramètres du modèle de transport TSM au moyen de réseaux de neurones perceptrons multi-couches , 2008 .

[85]  William H. Press,et al.  Numerical Recipes in Fortran 77 , 1992 .

[86]  A. Kalra,et al.  Estimating soil moisture using remote sensing data: A machine learning approach , 2010 .

[87]  Elie Bienenstock,et al.  Neural Networks and the Bias/Variance Dilemma , 1992, Neural Computation.

[88]  Gwo-Fong Lin,et al.  Effective typhoon characteristics and their effects on hourly reservoir inflow forecasting , 2010 .

[89]  V. Singh,et al.  The HBV model. , 1995 .

[90]  Rollin H. Hotchkiss,et al.  Peak Discharge for Small Agricultural Watersheds , 1995 .

[91]  Petri Koistinen,et al.  Using additive noise in back-propagation training , 1992, IEEE Trans. Neural Networks.

[92]  A. G. Ivakhnenko,et al.  Polynomial Theory of Complex Systems , 1971, IEEE Trans. Syst. Man Cybern..

[93]  Henrik Madsen,et al.  Adaptive state updating in real-time river flow forecasting—a combined filtering and error forecasting procedure , 2005 .

[94]  Simon A. Barton Sigmoidal Weight Constraint in a Recurrent Neural Network , 2004 .

[95]  Pedro Antonio Gutiérrez,et al.  Hybridizing logistic regression with product unit and RBF networks for accurate detection and prediction of banking crises , 2010, Omega.

[96]  Zhen Zhu,et al.  Optimized Approximation Algorithm in Neural Networks Without Overfitting , 2008, IEEE Transactions on Neural Networks.

[97]  S. G. Wallis,et al.  Evaluation of 1-D tracer concentration profile by ANN , 2007 .

[98]  N. J. DE VOS,et al.  Multi-objective performance comparison of an artificial neural network and a conceptual rainfall—runoff model , 2007 .

[99]  William H. Press,et al.  Numerical recipes in Fortran 77 : the art of scientificcomputing. , 1992 .

[100]  Eric Gaume,et al.  Over-parameterisation, a major obstacle to the use of artificial neural networks in hydrology? , 2003 .

[101]  Wade T. Crow,et al.  Improving hydrologic predictions of a catchment model via assimilation of surface soil moisture , 2011 .

[102]  R. Storn,et al.  Differential Evolution: A Practical Approach to Global Optimization (Natural Computing Series) , 2005 .

[103]  Klaus-Robert Müller,et al.  Asymptotic statistical theory of overtraining and cross-validation , 1997, IEEE Trans. Neural Networks.

[104]  Ville Tirronen,et al.  Distributed differential evolution with explorative–exploitative population families , 2009, Genetic Programming and Evolvable Machines.

[105]  Orazio Giustolisi,et al.  Improving generalization of artificial neural networks in rainfall–runoff modelling / Amélioration de la généralisation de réseaux de neurones artificiels pour la modélisation pluie-débit , 2005 .

[106]  Adam Kiczko,et al.  Corrigendum to: "Differential evolution algorithm with separated groups for multi-dimensional optimization problems" [Eur. J. Oper. Res. 216 (2012) 33-46] , 2012, Eur. J. Oper. Res..

[107]  P. E. O'connell,et al.  An introduction to the European Hydrological System — Systeme Hydrologique Europeen, “SHE”, 1: History and philosophy of a physically-based, distributed modelling system , 1986 .

[108]  Michael Y. Hu,et al.  Forecasting with artificial neural networks: The state of the art , 1997 .

[109]  Joydeep Ghosh,et al.  Ridge polynomial networks , 1995, IEEE Trans. Neural Networks.

[110]  Dimitris K. Tasoulis,et al.  Enhancing Differential Evolution Utilizing Proximity-Based Mutation Operators , 2011, IEEE Transactions on Evolutionary Computation.

[111]  C. Perrin,et al.  Does a large number of parameters enhance model performance? Comparative assessment of common catchment model structures on 429 catchments , 2001 .

[112]  Lorenzo L. Pesce,et al.  Noise injection for training artificial neural networks: a comparison with weight decay and early stopping. , 2009, Medical physics.

[113]  Günter Blöschl,et al.  Time stability of catchment model parameters: Implications for climate impact analyses , 2011 .