A machine learning-based approach to predict the velocity profiles in small streams

This article addresses the determination of velocity profile in small streams by employing powerful machine learning algorithms that include artificial neural networks (ANNs), support vector machine (SVMs), and k-nearest neighbor algorithms (k-NN). Therefore, this study also aims to present a reliable and low-cost method for predicting velocity profile. The data set used in this study was achieved by field measurements performed by using the acoustic Doppler velocimeter (ADV) between 2005 and 2010, in Central Turkey. The eight observational variables and calculated non-dimensional parameters were used as inputs to the models for predicting the target values, u (point velocity in measured verticals). Performances of prediction methods were determined via 10-fold cross-validation approach. The comparative results revealed that k-NN algorithms outperformed the other two machine learning models, with the R value of 0.98 ± 0.0069 and the MAE value of 0.053 ± 0.0075, while ANNs and SVMs models have the R values of 0.95 ± 0.0085 and 0.89 ± 0.0046, the MAE values of 0.085 ± 0.0077 and 0.099 ± 0.0117, respectively. Importance of the predictor variables for ANNs and SVMs models were also presented by using sensitivity analysis.

[1]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[2]  N. Şarlak Filyos River Streamflow Reconstruction from Tree‐Ring Chronologies with Nonparametric Approaches , 2014 .

[3]  S. Yakowitz,et al.  Nearest‐neighbor methods for nonparametric rainfall‐runoff forecasting , 1987 .

[4]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[5]  Mehmet Ardiclioglu,et al.  Applicabilité des Equations de Distribution de Vitesses dans les Ecoulements en Canal Ouvert à fond rugueux , 2005 .

[6]  Fazal H. Chaudhry,et al.  Experimental Evaluation of 2-D Entropy Model for Open-Channel Flow , 1998 .

[7]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[8]  D. Basak,et al.  Support Vector Regression , 2008 .

[9]  N. Cheng Power-law index for velocity profiles in open channel flows , 2007 .

[10]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[11]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[12]  Julien Clinton Sprott,et al.  Artificial neural networks: powerful tools for modeling chaotic behavior in the nervous system , 2014, Front. Comput. Neurosci..

[13]  Ramon Cabrera,et al.  Performance of a single-beam pulse-to-pulse coherent Doppler profiler , 1996 .

[14]  Victoria J. Hodge,et al.  A Survey of Outlier Detection Methodologies , 2004, Artificial Intelligence Review.

[15]  S. Fischer,et al.  Ultrasonic device for real-time sewage velocity and suspended particles concentration measurements. , 2009, Water science and technology : a journal of the International Association on Water Pollution Research.

[16]  M. Kırkgöz Turbulent Velocity Profiles for Smooth and Rough Open Channel Flow , 1989 .

[17]  Dimitri P. Solomatine,et al.  Model Induction with Support Vector Machines: Introduction and Applications , 2001 .

[18]  G. Galeati A comparison of parametric and non-parametric methods for runoff forecasting , 1990 .

[19]  Cheng‐lung Chen,et al.  Unified Theory on Power Laws for Flow Resistance , 1991 .

[20]  Y. Lai,et al.  Three-Dimensional Numerical Study of Flows in Open-Channel Junctions , 2002 .

[21]  P. Moin,et al.  Eddies, streams, and convergence zones in turbulent flows , 1988 .

[22]  Stefano Tarantola,et al.  Sensitivity Analysis in Practice: A Guide to Assessing Scientific Models , 2004 .

[23]  N. Aǧiralioǧlu,et al.  Investigation of flow properties in natural streams using the entropy concept , 2012 .

[24]  D. Hosmer,et al.  Applied Logistic Regression , 1991 .

[25]  Vladan Babovic,et al.  Data Mining and Knowledge Discovery in Sediment Transport , 2000 .

[26]  Nallamuthu Rajaratnam,et al.  A simple method for measuring shear stress on rough boundaries , 2000 .

[27]  M. Rosenblatt,et al.  Multivariate k-nearest neighbor density estimates , 1979 .

[28]  Jan Adamowski,et al.  Long-term SPI drought forecasting in the Awash River Basin in Ethiopia using wavelet neural network and wavelet support vector regression models , 2014 .

[29]  Shahab Araghinejad,et al.  A Comparative Assessment of Support Vector Machines, Probabilistic Neural Networks, and K-Nearest Neighbor Algorithms for Water Quality Classification , 2014, Water Resources Management.

[30]  Hubert Chanson,et al.  The hydraulics of open channel flow , 2004 .

[31]  Fi-John Chang,et al.  Modelling combined open channel flow by artificial neural networks , 2005 .

[32]  Simon Haykin,et al.  Neural Networks and Learning Machines , 2010 .

[33]  Hydraulics Of Laminar Open Channel Flow , 2010 .

[34]  D. J. Booker,et al.  Hydraulic modelling of fish habitat in urban rivers during high flows , 2003 .

[35]  Mehmet Ardiclioglu,et al.  Velocity Profiles of Developing and Developed Open Channel Flow , 1997 .

[36]  V. Vapnik Pattern recognition using generalized portrait method , 1963 .

[37]  V. Nikora,et al.  ADV Measurements of Turbulence: Can We Improve Their Interpretation? , 1998 .

[38]  Nicola Fontana,et al.  Experimental Assessment of a 2-D Entropy-Based Model for Velocity Distribution in Open Channel Flow , 2013, Entropy.

[39]  Özgür Kişi,et al.  Determination of Mean Velocity and Discharge in Natural Streams Using Neuro-Fuzzy and Neural Network Approaches , 2014, Water Resources Management.

[40]  Abbas Afshar,et al.  Clustered K Nearest Neighbor Algorithm for Daily Inflow Forecasting , 2011 .

[41]  Alcigeimes B. Celeste,et al.  Avaliação de estratégias de modelagem guiada por dados para previsão de vazão em rio sergipano , 2014 .

[42]  G. W. Davis,et al.  Sensitivity analysis in neural net solutions , 1989, IEEE Trans. Syst. Man Cybern..

[43]  W. H. Graf,et al.  Velocity and Turbulence Distribution in Unsteady Open-Channel Flows , 1996 .

[44]  Steven C. Wheelwright,et al.  Forecasting methods and applications. , 1979 .

[45]  T. Kerh,et al.  Transient Fluid-Structure Interaction in a Control Valve , 1997 .

[46]  Dimitri P. Solomatine,et al.  Model Induction with Support Vector Machines: Introduction and Applications , 2001 .

[47]  A. El-Shafie,et al.  Daily Forecasting of Dam Water Levels: Comparing a Support Vector Machine (SVM) Model With Adaptive Neuro Fuzzy Inference System (ANFIS) , 2013, Water Resources Management.

[48]  Mahesh Pal,et al.  Prediction of the end-depth ratio and discharge in semi-circular and circular shaped channels using support vector machines , 2006 .

[49]  Ozgur Kisi,et al.  Anfis to estimate discharge capacity of rectangular side weir , 2013 .

[50]  Christopher J. C. Burges,et al.  A Tutorial on Support Vector Machines for Pattern Recognition , 1998, Data Mining and Knowledge Discovery.

[51]  Kandula V. N. Sarma,et al.  Velocity Distribution in Smooth Rectangular Open Channels , 1983 .

[52]  Aini Hussain,et al.  Erratum to: Daily Forecasting of Dam Water Levels: Comparing a Support Vector Machine (SVM) Model With Adaptive Neuro Fuzzy Inference System (ANFIS) , 2013, Water Resources Management.

[53]  J. W. Kamphuis,et al.  DETERMINATION OF SAND ROUGHNESS FOR FIXED BEDS , 1974 .

[54]  S. Gunn Support Vector Machines for Classification and Regression , 1998 .

[55]  ENEL-CRIS Savizio Idrologico,et al.  A comparison of parametric and non-parametric methods for runoff forecasting , 2007 .

[56]  K. Ghoshal,et al.  Velocity Distribution in Open Channels: Combination of Log-law and Parabolic-law , 2012 .

[57]  Joe C. Willis Near‐Bed Velocity Distribution , 1985 .

[58]  Ercan Gemici,et al.  Akarsularda Hız Dağılımının Modellenmesi ve ADV Ölçümleri , 2010 .

[59]  B. Everitt The Cambridge Dictionary of Statistics , 1998 .