Estimating the aeration coefficient and air demand in bottom outlet conduits of dams using GEP and decision tree methods

Abstract Accurate estimates of air demand are critical when addressing the cavitation phenomenon in bottom outlet conduits of dams. In the present study, this accuracy was evaluated for air demand estimation models using gene expression programming (GEP), classification tree methods [including boosted regression trees (BRT) and random forest (RF) algorithms], and empirical relationships. Using k -fold cross validation and drawing on data from 6 dams, two different air-demand-related parameters were estimated (aerator air discharge demand and aeration coefficient) and the estimation accuracy was assessed using the Nash-Sutcliffe (NS) efficiency and other statistics. Outperforming the other models, the GEP model performed well in estimating aerator air demand (NS =0.674) and the aeration coefficient (NS =0.489). Moreover, on average, the GEP model showed 45% and 12% improvements in aeration coefficient estimation accuracy over empirical relationships and decision tree methods, respectively. Finally, in the function finding phase, the derived mathematical models were presented as non-linear equations to estimate the air demand of aerator and aeration coefficients using a non-linear relation derived from the GEP model.

[1]  Ling Li,et al.  Numerical Simulation of Dissolved Oxygen Supersaturation Flow over the Three Gorges Dam Spillway , 2010 .

[2]  Özgür Kisi,et al.  Performance of radial basis and LM-feed forward artificial neural networks for predicting daily watershed runoff , 2013, Appl. Soft Comput..

[3]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[4]  Vladan Babovic,et al.  Rainfall runoff modelling based on genetic programming , 2002 .

[5]  J Elith,et al.  A working guide to boosted regression trees. , 2008, The Journal of animal ecology.

[6]  Cândida Ferreira,et al.  Gene Expression Programming: A New Adaptive Algorithm for Solving Problems , 2001, Complex Syst..

[7]  Shaban Shataeea,et al.  Plot-level Forest Volume Estimation Using Airborne Laser Scanner and TM Data, Comparison of Boosting and Random Forest Tree Regression Algorithms , 2011 .

[8]  Vladan Babovic,et al.  Rainfall‐Runoff Modeling Based on Genetic Programming , 2006 .

[9]  M. Keijzer,et al.  Genetic programming as a model induction engine , 2000 .

[10]  R. Woods,et al.  Comparing and combining physically-based and empirically-based approaches for estimating the hydrology of ungauged catchments , 2014 .

[11]  G. De’ath Boosted trees for ecological modeling and prediction. , 2007, Ecology.

[13]  B. Huwe,et al.  Uncertainty in the spatial prediction of soil texture: Comparison of regression tree and Random Forest models , 2012 .

[15]  Stefano Malavasi,et al.  Improvements and Validation of the Numerical Prediction of the Incipient Cavitation Index , 2015 .

[16]  Daniel Westreich,et al.  Propensity score estimation: neural networks, support vector machines, decision trees (CART), and meta-classifiers as alternatives to logistic regression. , 2010, Journal of clinical epidemiology.

[17]  Heitor Silvério Lopes,et al.  EGIPSYS: AN ENHANCED GENE EXPRESSION PROGRAMMING APPROACH FOR SYMBOLIC REGRESSION PROBLEMS † , 2004 .

[18]  O. Kisi,et al.  Suspended sediment modeling using genetic programming and soft computing techniques , 2012 .

[19]  Jianming Zhang,et al.  Three-Dimensional Numerical Simulation of Aerated Flows Downstream Sudden Fall Aerator Expansion-In a Tunnel , 2011 .

[20]  Simon J. Pittman,et al.  Using Lidar Bathymetry and Boosted Regression Trees to Predict the Diversity and Abundance of Fish and Corals , 2009 .

[21]  Miklas Scholz,et al.  Computing Air Demand Using the Takagi-Sugeno Model for Dam Outlets , 2013 .

[22]  Samia Boukir,et al.  Relevance of airborne lidar and multispectral image data for urban scene classification using Random Forests , 2011 .

[23]  S. Vincenzi,et al.  Application of a Random Forest algorithm to predict spatial distribution of the potential yield of Ruditapes philippinarum in the Venice lagoon, Italy , 2011 .

[24]  Hari R. Sharma,et al.  Air-Entrainment in High Head Gated Conduits , 1976 .

[25]  Henry T. Falvey,et al.  Prevention of Cavitation on Chutes and Spillways , 1983 .

[26]  Amir Hossein Zaji,et al.  Gene expression programming to predict the discharge coefficient in rectangular side weirs , 2015, Appl. Soft Comput..

[27]  Meiling Liu,et al.  Multivariable integration method for estimating sea surface salinity in coastal waters from in situ data and remotely sensed data using random forest algorithm , 2015, Comput. Geosci..

[28]  Cândida Ferreira,et al.  Gene Expression Programming and the Evolution of Computer Programs , 2004 .

[29]  B. Guyton,et al.  Air Demand In Gated Outlet Works , 1953 .

[30]  Mario Nicodemi,et al.  Performance of genetic programming to extract the trend in noisy data series , 2006 .