Estimating longitudinal dispersion coefficient in natural streams using empirical models and machine learning algorithms

The longitudinal dispersion coefficient (LDC) plays an important role in modeling the transport of pollutants and sediment in natural rivers. As a result of transportation processes, the concentration of pollutants changes along the river. Various studies have been conducted to provide simple equations for estimating LDC. In this study, machine learning methods, namely support vector regression, Gaussian process regression, M5 model tree (M5P) and random forest, and multiple linear regression were examined in predicting the LDC in natural streams. Data sets from 60 rivers around the world with different hydraulic and geometric features were gathered to develop models for LDC estimation. Statistical criteria, including correlation coefficient (CC), root mean squared error (RMSE) and mean absolute error (MAE), were used to scrutinize the models. The LDC values estimated by these models were compared with the corresponding results of common empirical models. The Taylor chart was used to evaluate the models and the results showed that among the machine learning models, M5P had superior performance, with CC of 0.823, RMSE of 454.9 and MAE of 380.9. The model of Sahay and Dutta, with CC of 0.795, RMSE of 460.7 and MAE of 306.1, gave more precise results than the other empirical models. The main advantage of M5P models is their ability to provide practical formulae. In conclusion, the results proved that the developed M5P model with simple formulations was superior to other machine learning models and empirical models; therefore, it can be used as a proper tool for estimating the LDC in rivers.

[1]  Bellie Sivakumar,et al.  Forecasting river water temperature time series using a wavelet–neural network hybrid modelling approach , 2019, Journal of Hydrology.

[2]  Mir Jafar Sadegh Safari,et al.  Sediment transport modeling in open channels using neuro-fuzzy and gene expression programming techniques. , 2019, Water science and technology : a journal of the International Association on Water Pollution Research.

[3]  S. Shamshirband,et al.  Support Vector Regression Integrated with Fruit Fly Optimization Algorithm for River Flow Forecasting in Lake Urmia Basin , 2019, Water.

[4]  Shahaboddin Shamshirband,et al.  Estimating Daily Dew Point Temperature Using Machine Learning Algorithms , 2019, Water.

[5]  J. Adamowski,et al.  An ensemble prediction of flood susceptibility using multivariate discriminant analysis, classification and regression trees, and support vector machines. , 2019, The Science of the total environment.

[6]  Shahaboddin Shamshirband,et al.  Prediction of Hydropower Generation Using Grey Wolf Optimization Adaptive Neuro-Fuzzy Inference System , 2019, Energies.

[7]  Sultan Noman Qasem,et al.  Daily global solar radiation modeling using data-driven techniques and empirical equations in a semi-arid climate , 2019, Engineering Applications of Computational Fluid Mechanics.

[8]  Shahaboddin Shamshirband,et al.  Modeling monthly pan evaporation using wavelet support vector regression and wavelet artificial neural networks in arid and humid climates , 2019, Engineering Applications of Computational Fluid Mechanics.

[9]  Megan Mehaffey,et al.  Development of a spatially complete floodplain map of the conterminous United States using random forest , 2018, The Science of the total environment.

[10]  Mohammad Ali Ghorbani,et al.  Forecasting soil temperature at multiple-depth with a hybrid artificial neural network model coupled-hybrid firefly optimizer algorithm , 2018, Information Processing in Agriculture.

[11]  Salim Heddam,et al.  Modeling daily water temperature for rivers: comparison between adaptive neuro-fuzzy inference systems and artificial neural networks models , 2018, Environmental Science and Pollution Research.

[12]  Kwok-wing Chau,et al.  Flood Prediction Using Machine Learning Models: Literature Review , 2018, Water.

[13]  Mohamad Javad Alizadeh,et al.  Toward multi-day-ahead forecasting of suspended sediment concentration using ensemble models , 2017, Environmental Science and Pollution Research.

[14]  J. Ríha,et al.  Longitudinal dispersion in an open channel determined from a tracer study , 2017, Environmental Earth Sciences.

[15]  Mahsa Modiri Gharehveran,et al.  Prediction of the compressive strength of normal and high-performance concretes using M5P model tree algorithm , 2017 .

[16]  Saeed Samadianfard,et al.  Performance evaluation of ANNs and an M5 model tree in Sattarkhan Reservoir inflow prediction , 2017 .

[17]  Wenxin Huai,et al.  Physically sound formula for longitudinal dispersion coefficients of natural rivers , 2017 .

[18]  Wenxin Huai,et al.  Estimating the Longitudinal Dispersion Coefficient in Straight Natural Rivers , 2016 .

[19]  S. Samadianfard,et al.  Intelligent analysis of global warming effects on sea surface temperature in Hormuzgan Coast, Persian Gulf , 2016 .

[20]  Kwok-wing Chau,et al.  A Hybrid Double Feedforward Neural Network for Suspended Sediment Load Estimation , 2016, Water Resources Management.

[21]  Zhiqiang Deng,et al.  How Reliable Are ANN, ANFIS, and SVM Techniques for Predicting Longitudinal Dispersion Coefficient in Natural Rivers? , 2016 .

[22]  Kwok-Wing Chau,et al.  A comparison of various artificial intelligence approaches performance for estimating suspended sediment load of river systems: a case study in United States , 2015, Environmental Monitoring and Assessment.

[23]  W. Huai,et al.  Estimation of longitudinal dispersion coefficient in rivers , 2014 .

[24]  Minghao Yin,et al.  Differential Evolution for Prediction of Longitudinal Dispersion Coefficients in Natural Streams , 2013, Water Resources Management.

[25]  Aman Mohammad Kalteh,et al.  Monthly river flow forecasting using artificial neural network and support vector regression models coupled with wavelet transform , 2013, Comput. Geosci..

[26]  Adam P. Piotrowski,et al.  A comparison of methods to avoid overfitting in neural networks training in the case of catchment runoff modelling , 2013 .

[27]  Bachir Achour,et al.  An evaluation of ANN methods for estimating the lengths of hydraulic jumps in U-shaped channel , 2013 .

[28]  Amir Etemad-Shahidi,et al.  Predicting Longitudinal Dispersion Coefficient in Natural Streams Using M5′ Model Tree , 2012 .

[29]  Rajeev Ranjan Sahay,et al.  Prediction of longitudinal dispersion coefficients in natural rivers using artificial neural network , 2011 .

[30]  Hazi Mohammad Azamathulla,et al.  Genetic Programming for Predicting Longitudinal Dispersion Coefficients in Streams , 2011 .

[31]  Som Dutta,et al.  Prediction of longitudinal dispersion coefficients in natural rivers using genetic algorithm , 2009 .

[32]  Mohammad Mehdi Ebadzadeh,et al.  An expert system for predicting longitudinal dispersion coefficient in natural streams by using ANFIS , 2009, Expert Syst. Appl..

[33]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[34]  Kellie J. Archer,et al.  Empirical characterization of random forest variable importance measures , 2008, Comput. Stat. Data Anal..

[35]  Ping-Feng Pai,et al.  A recurrent support vector regression model in rainfall forecasting , 2007 .

[36]  Vijay P. Singh,et al.  Predicting Longitudinal Dispersion Coefficient in Natural Streams by Artificial Neural Network , 2005 .

[37]  G. Jirka Mixing and dispersion in rivers , 2004 .

[38]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[39]  Roger A Falconer,et al.  Longitudinal dispersion coefficients in natural channels. , 2002, Water research.

[40]  K. Taylor Summarizing multiple aspects of model performance in a single diagram , 2001 .

[41]  K. Chau Transverse mixing coefficient measurements in an open rectangular channel , 2000 .

[42]  Il Won Seo,et al.  Predicting Longitudinal Dispersion Coefficient in Natural Streams , 1998 .

[43]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[44]  Ian H. Witten,et al.  Induction of model trees for predicting continuous classes , 1996 .

[45]  Julia Badal Graf Measured and predicted velocity and longitudinal dispersion at steady and unsteady flow, Colorado River, Glen Canyon Dam to lake mead , 1995 .

[46]  Vladimir Naumovich Vapni The Nature of Statistical Learning Theory , 1995 .

[47]  J. R. Quinlan Learning With Continuous Classes , 1992 .

[48]  Roy A. Walters,et al.  Simulation of solute transport in a mountain pool‐and‐riffle stream: A transient storage model , 1983 .

[49]  H. Fischer Mixing in Inland and Coastal Waters , 1979 .

[50]  E. J. List,et al.  Shear Flow Dispersion , 1979 .

[51]  Thomas N. Keefer,et al.  Simple Method for Predicting Dispersion in Streams , 1974 .

[52]  George V. Sabol,et al.  Empirical data on longitudinal dispersion in rivers , 1974 .

[53]  H. B. Fisher,et al.  Dispersion Predictions in Natural Streams , 1968 .

[54]  Geoffrey Ingram Taylor,et al.  The dispersion of matter in turbulent flow through a pipe , 1954, Proceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences.