Piezometric level prediction based on data mining techniques

The safety assessment of dams is a complex task that is made possible thanks to a constant monitoring of pertinent parameters. Once collected, the data are processed by statistical analysis models in order to describe the behaviour of the structure. The aim of those models is to detect early signs of abnormal behaviour so as to take corrective actions when required. Because of the uniqueness of each structure, the behavioural models need to adapt to each of these structures, and thus flexibility is required. Simultaneously, generalization capacities are sought, so a trade-off has to be found. This flexibility is even more important when the analysed phenomenon is characterized by nonlinear features. This is notably the case of the piezometric levels (PL) monitored at the rock–concrete interface of arch dams, when this interface opens. In that case, the linear models that are classically used by engineers show poor performances. Consequently, interest naturally grows for the advanced learning algorithms known as machine learning techniques. In this work, the aim was to compare the predictive performances and generalization capacities of six different data mining algorithms that are likely to be used for monitoring purposes in the particular case of the piezometry at the interface of arch dams: artificial neural networks (ANN), support vector machines (SVM), decision tree, k -nearest neighbour, random forest and multiple regression. All six are used to analyse the same time series. The interpretation of those PL permits to understand the phenomenon of the aperture of the interface, which is highly nonlinear, and of great concern in dam safety. The achieved results show that SVM and ANN stand out as the most efficient algorithms, when it comes to analysing nonlinear monitored phenomenon. Through a global sensitivity analysis, the influence of the models’ attributes is measured, showing a high impact of Z (relative trough) in PL prediction.

[1]  Richard Malm,et al.  Proceedings of the 14th ICOLD International Benchmark Workshop on Numerical Analysis of Dams , 2017 .

[2]  Paulo Cortez,et al.  Using sensitivity analysis and visualization techniques to open black box data mining models , 2013, Inf. Sci..

[3]  Arie Ben-David,et al.  Control of properties in injection molding by neural networks , 2001 .

[4]  V. M. S. R. Murthy,et al.  Prediction of throw in bench blasting using neural networks: an approach , 2016, Neural Computing and Applications.

[5]  Yuan Wang,et al.  Extreme learning machine-based surrogate model for analyzing system reliability of soil slopes , 2017 .

[6]  Nenad Grujovic,et al.  Modelling of dam behaviour based on neuro-fuzzy identification , 2012 .

[7]  Nenad Grujovic,et al.  Predicting piezometric water level in dams via artificial neural networks , 2012, Neural Computing and Applications.

[8]  Nenad Grujovic,et al.  Development of support vector regression identification model for prediction of dam structural behaviour , 2014 .

[9]  Joydeep Ghosh,et al.  Data Clustering Algorithms And Applications , 2013 .

[10]  Joaquim Agostinho Barbosa Tinoco,et al.  Stability Condition Identification of Rock and Soil Cutting Slopes Based on Soft Computing , 2018 .

[11]  Jean-Michel Poggi,et al.  Random Forests for Big Data , 2015, Big Data Res..

[12]  Jinbo Bi,et al.  Regression Error Characteristic Curves , 2003, ICML.

[13]  Ashutosh Kumar Singh,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2010 .

[14]  Michael J. A. Berry,et al.  Mastering Data Mining: The Art and Science of Customer Relationship Management , 1999 .

[15]  Dan Cornford,et al.  Optimal design for correlated processes with input-dependent noise , 2014, Comput. Stat. Data Anal..

[16]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[17]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[18]  Ali Akbar Ramezanianpour,et al.  Hybrid support vector regression – Particle swarm optimization for prediction of compressive strength and RCPT of concretes containing metakaolin , 2012 .

[19]  Chih-Chieh Young,et al.  Genetic algorithm and fuzzy neural networks combined with the hydrological modeling system for forecasting watershed runoff discharge , 2015, Neural Computing and Applications.

[20]  Walter Krämer,et al.  Review of Modern applied statistics with S, 4th ed. by W.N. Venables and B.D. Ripley. Springer-Verlag 2002 , 2003 .

[21]  P. Léger,et al.  Hydrostatic, Temperature, Time-Displacement Model for Concrete Dams , 2007 .

[22]  Paulo Cortez,et al.  Data Mining with Neural Networks and Support Vector Machines Using the R/rminer Tool , 2010, ICDM.

[23]  Eugenio Oñate,et al.  Data-Based Models for the Prediction of Dam Behaviour: A Review and Some Methodological Considerations , 2017 .

[24]  Samir Roy,et al.  A neural network-based intelligent cognitive state recognizer for confidence-based e-learning system , 2016, Neural Computing and Applications.

[25]  Yunqian Ma,et al.  Practical selection of SVM parameters and noise estimation for SVM regression , 2004, Neural Networks.

[26]  Eugenio Oñate,et al.  A machine learning based methodology for anomaly detection in dam behaviour , 2017 .

[27]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[28]  Eugenio Oñate,et al.  An empirical comparison of machine learning techniques for dam behaviour modelling , 2015 .

[29]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[30]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[31]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[32]  William N. Venables,et al.  Modern Applied Statistics with S , 2010 .

[33]  Jesús Fraile-Ardanuy,et al.  Dam seepage analysis based on artificial neural networks: The hysteresis phenomenon , 2013, The 2013 International Joint Conference on Neural Networks (IJCNN).

[34]  J. Mata,et al.  Interpretation of concrete dam behaviour with artificial neural network and multiple linear regression models , 2011 .

[35]  Braga Farinha Hydromechanical behaviour of concrete dam foundations. In situ tests and numerical modelling , 2010 .

[36]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..