Meteorological data mining and hybrid data-intelligence models for reference evaporation simulation: A case study in Iraq

Abstract To model an agriculture process for any region, it is significantly essential to accurately simulate the reference evaporation (ETo) from the available regional meteorological parameters. Nine models, including five data mining algorithms and four adaptive neuro-fuzzy inference systems (ANFISs), were tested for their ability to predict ETo at meteorological stations in Baghdad and Mosul (Iraq). Various weather parameters (e.g., wind speed, sunshine hours, rainfall, maximum and minimum temperature and relative humidity) were recorded and employed as explanatory variables in the models. Pearson correlation analysis showed ETo to have the closest correlation with sunshine hours, maximum and minimum temperatures, and relative humidity. The modeling performance was assessed using the statistical measures of coefficient of determination (R2), root mean square error (RMSE), mean absolute error (MAE), Nash-Sutcliffe efficiency (NSE), percentage of bias (PBIAS), and the ratio of RMSE to the standard deviation of observations (RSR). Investigations on the modeling accuracy with different input parameter combinations showed that, despite the different structures of the models, no single input combination showed a consistent modeling outcome. Fewer variables were necessary to achieve the same high predictive power for the models developed for the Baghdad station than for those developed for the Mosul station. For both stations, the ANFIS-GA model generally showed the greatest predictive power whereas the random tree algorithm showed the poorest. Moreover, hybrid models showed a higher predictive power than the individual models.

[1]  N. Nagelkerke,et al.  A note on a general definition of the coefficient of determination , 1991 .

[2]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[3]  Zaher Mundher Yaseen,et al.  Rainfall Pattern Forecasting Using Novel Hybrid Intelligent Model Based ANFIS-FFA , 2017, Water Resources Management.

[4]  Zaher Mundher Yaseen,et al.  Estimation the Physical Variables of Rainwater Harvesting System Using Integrated GIS-Based Remote Sensing Approach , 2016, Water Resources Management.

[5]  Dieu Tien Bui,et al.  Meta optimization of an adaptive neuro-fuzzy inference system with grey wolf optimizer and biogeography-based optimization algorithms for spatial prediction of landslide susceptibility , 2019, CATENA.

[6]  C. Almeida,et al.  Application of Response Surface Methodology and Machine Learning Combined with Data Simulation to Metal Determination of Freshwater Sediment , 2017, Water, Air, & Soil Pollution.

[7]  Akbar Karimi,et al.  Rules for Optimal Operation of Reservoir-River-Groundwater Systems Considering Water Quality Targets: Application of M5P Model , 2013, Water Resources Management.

[8]  G. Stanhill,et al.  Is the Class A evaporation pan still the most practical and accurate meteorological method for determining irrigation water requirements , 2002 .

[9]  Z. Yaseen,et al.  Reference evapotranspiration prediction using hybridized fuzzy model with firefly algorithm: Regional case study in Burkina Faso , 2018, Agricultural Water Management.

[10]  Zaher Mundher Yaseen,et al.  Genetic programming in water resources engineering: A state-of-the-art review , 2018, Journal of Hydrology.

[11]  Nadhir Al-Ansari,et al.  Implementation of Univariate Paradigm for Streamflow Simulation Using Hybrid Data-Driven Model: Case Study in Tropical Region , 2019, IEEE Access.

[12]  Zaher Mundher Yaseen,et al.  Precipitation pattern modeling using cross-station perception: regional investigation , 2018, Environmental Earth Sciences.

[13]  H. L. Penman Natural evaporation from open water, bare soil and grass , 1948, Proceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences.

[14]  Christos Faloutsos,et al.  Mining and Forecasting of Big Time-Series Data , 2015, 2019 IEEE International Conference on Pervasive Computing and Communications Workshops (PerCom Workshops).

[15]  Goldberg,et al.  Genetic algorithms , 1993, Robust Control Systems with Genetic Algorithms.

[16]  M. Yazdi,et al.  Early Pennsylvanian, Bashkirian, echinoderms from eastern Iran, a potential transitional fauna between Laurentia/Avalonia and the Paleotethys, and a Permian cromyocrinid from central Iran , 2011 .

[17]  Ahmed El-Shafie,et al.  Utilizing RBF-NN and ANFIS Methods for Multi-Lead ahead Prediction Model of Evaporation from Reservoir , 2016, Water Resources Management.

[18]  Omid Bozorg-Haddad,et al.  Soil moisture simulation using hybrid artificial intelligent model: Hybridization of adaptive neuro fuzzy inference system with grey wolf optimizer algorithm , 2019, Journal of Hydrology.

[19]  Shahaboddin Shamshirband,et al.  Assessing the suitability of hybridizing the Cuckoo optimization algorithm with ANN and ANFIS techniques to predict daily evaporation , 2016, Environmental Earth Sciences.

[20]  J. Evans,et al.  Evaluating regional climate models for simulating sub-daily rainfall extremes , 2016, Climate Dynamics.

[21]  Hamid Reza Pourghasemi,et al.  Performance evaluation of GIS-based new ensemble data mining techniques of adaptive neuro-fuzzy inference system (ANFIS) with genetic algorithm (GA), differential evolution (DE), and particle swarm optimization (PSO) for landslide spatial modelling , 2017 .

[22]  Shahaboddin Shamshirband,et al.  Modeling monthly pan evaporation using wavelet support vector regression and wavelet artificial neural networks in arid and humid climates , 2019, Engineering Applications of Computational Fluid Mechanics.

[23]  C. Priestley,et al.  On the Assessment of Surface Heat Flux and Evaporation Using Large-Scale Parameters , 1972 .

[24]  Zhi-Hua Zhou,et al.  Classification Under Streaming Emerging New Classes: A Solution Using Completely-Random Trees , 2016, IEEE Transactions on Knowledge and Data Engineering.

[25]  B. Pham,et al.  A comparative assessment of decision trees algorithms for flash flood susceptibility modeling at Haraz watershed, northern Iran. , 2018, The Science of the total environment.

[26]  D. Solomatine,et al.  Model trees as an alternative to neural networks in rainfall—runoff modelling , 2003 .

[27]  David E. Goldberg,et al.  Genetic algorithms and Machine Learning , 1988, Machine Learning.

[28]  J. Nash,et al.  River flow forecasting through conceptual models part I — A discussion of principles☆ , 1970 .

[29]  Chyon-Hwa Yeh,et al.  Classification and regression trees (CART) , 1991 .

[30]  Z. Yaseen,et al.  Novel Hybrid Data-Intelligence Model for Forecasting Monthly Rainfall with Uncertainty Analysis , 2019, Water.

[31]  Amir Etemad-Shahidi,et al.  Design of rubble-mound breakwaters using M5 ′ machine learning method , 2009 .

[32]  D. Bui,et al.  Spatial prediction of groundwater spring potential mapping based on an adaptive neuro-fuzzy inference system and metaheuristic optimization , 2018, Hydrology and Earth System Sciences.

[33]  Zaher Mundher Yaseen,et al.  Determination of compound channel apparent shear stress: application of novel data mining models , 2019, Journal of Hydroinformatics.

[34]  Zekai Şen,et al.  Precipitation projections under GCMs perspective and Turkish Water Foundation (TWF) statistical downscaling model procedures , 2018, Theoretical and Applied Climatology.

[35]  H. Md. Azamathulla,et al.  Assessment of M5′ model tree and classification and regression trees for prediction of scour depth below free overfall spillways , 2012, Neural Computing and Applications.

[36]  Andy Fourie,et al.  Cemented paste backfill for mineral tailings management: Review and future perspectives , 2019 .

[37]  Ahmad Sharafati,et al.  The potential of novel data mining models for global solar radiation prediction , 2019, International Journal of Environmental Science and Technology.

[38]  J. Niemeyer,et al.  Contextual classification of lidar data and building object detection in urban areas , 2014 .

[39]  Soroosh Sorooshian,et al.  Status of Automatic Calibration for Hydrologic Models: Comparison with Multilevel Expert Calibration , 1999 .

[40]  Rainer Storn,et al.  Differential Evolution – A Simple and Efficient Heuristic for global Optimization over Continuous Spaces , 1997, J. Glob. Optim..

[41]  M. S. Moran,et al.  Combining the Penman-Monteith equation with measurements of surface temperature and reflectance to estimate evaporation rates of semiarid grassland , 1996 .

[42]  Zaher Mundher Yaseen,et al.  The implementation of univariable scheme-based air temperature for solar radiation prediction: New development of dynamic evolving neural-fuzzy inference system model , 2019, Applied Energy.

[43]  Zaher Mundher Yaseen,et al.  Quantifying hourly suspended sediment load using data mining models: Case study of a glacierized Andean catchment in Chile , 2018, Journal of Hydrology.

[44]  Hamid Darabi,et al.  River suspended sediment modelling using the CART model: A comparative study of machine learning techniques. , 2018, The Science of the total environment.

[45]  D. Legates,et al.  Evaluating the use of “goodness‐of‐fit” Measures in hydrologic and hydroclimatic model validation , 1999 .

[46]  Jyh-Shing Roger Jang,et al.  ANFIS: adaptive-network-based fuzzy inference system , 1993, IEEE Trans. Syst. Man Cybern..

[47]  Zaher Mundher Yaseen,et al.  An enhanced extreme learning machine model for river flow forecasting: State-of-the-art, practical applications in water resource engineering area and future research direction , 2019, Journal of Hydrology.

[48]  K. Taylor Summarizing multiple aspects of model performance in a single diagram , 2001 .

[49]  Himan Shahabi,et al.  Hybrid artificial intelligence models based on a neuro-fuzzy system and metaheuristic optimization algorithms for spatial prediction of wildfire probability , 2019, Agricultural and Forest Meteorology.

[50]  J. Arnold,et al.  HYDROLOGICAL MODELING OF THE IROQUOIS RIVER WATERSHED USING HSPF AND SWAT 1 , 2005 .

[51]  Himan Shahabi,et al.  Flood susceptibility assessment using integration of adaptive network-based fuzzy inference system (ANFIS) and biogeography-based optimization (BBO) and BAT algorithms (BA) , 2019 .

[52]  K. S. Yap,et al.  Extreme Learning Machines: A new approach for prediction of reference evapotranspiration , 2015 .

[53]  V. Nekkanti,et al.  Media milling process optimization for manufacture of drug nanoparticles using design of experiments (DOE) , 2015, Drug Development and Industrial Pharmacy.

[54]  Dillip K. Ghose,et al.  Prediction and optimization of runoff via ANFIS and GA , 2013 .

[55]  Sama Azadi,et al.  Evaluating the ability of artificial neural network and PCA-M5P models in predicting leachate COD load in landfills. , 2016, Waste management.

[56]  Babita Majhi,et al.  Improved prediction of daily pan evaporation using Deep-LSTM model , 2019, Neural Computing and Applications.

[57]  Zaher Mundher Yaseen,et al.  Hybrid Adaptive Neuro-Fuzzy Models for Water Quality Index Estimation , 2018, Water Resources Management.

[58]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[59]  Biswajeet Pradhan,et al.  Landslide susceptibility assessment using a novel hybrid model of statistical bivariate methods (FR and WOE) and adaptive neuro-fuzzy inference system (ANFIS) at southern Zagros Mountains in Iran , 2017, Environmental Earth Sciences.

[60]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[61]  Zaher Mundher Yaseen,et al.  Application of soft computing based hybrid models in hydrological variables modeling: a comprehensive review , 2017, Theoretical and Applied Climatology.

[62]  Nadhir Al-Ansari,et al.  Present Conditions and Future Challenges of Water Resources Problems in Iraq , 2014 .

[63]  I. Eames,et al.  The evaporation coefficient of water: a review , 1997 .

[64]  Zaher Mundher Yaseen,et al.  Temperature-based modeling of reference evapotranspiration using several artificial intelligence models: application of different modeling scenarios , 2018, Theoretical and Applied Climatology.

[65]  Albert Gan,et al.  Prediction of Lane Clearance Time of Freeway Incidents Using the M5P Tree Algorithm , 2011, IEEE Transactions on Intelligent Transportation Systems.

[66]  Paresh Chandra Deka,et al.  Support vector machine applications in the field of hydrology: A review , 2014, Appl. Soft Comput..

[67]  Chuen-Tsai Sun,et al.  Neuro-fuzzy modeling and control , 1995, Proc. IEEE.

[68]  J. Lelieveld,et al.  Climate change and impacts in the Eastern Mediterranean and the Middle East , 2012, Climatic Change.

[69]  Aniruddha Ghosh,et al.  A framework for mapping tree species combining hyperspectral and LiDAR data: Role of selected classifiers and sensor across three spatial scales , 2014, Int. J. Appl. Earth Obs. Geoinformation.

[70]  Nadhir Al-Ansari,et al.  Water resources problems of Iraq : Climate change adaptation and mitigation , 2018 .

[71]  Kasım Koçak,et al.  SVR-based prediction of evaporation combined with chaotic approach , 2014 .

[72]  Mohammad Ali Ghorbani,et al.  Estimating daily pan evaporation from climatic data of the State of Illinois, USA using adaptive neuro-fuzzy inference system (ANFIS) and artificial neural network (ANN) , 2011 .

[73]  R. Storn,et al.  Differential Evolution: A Practical Approach to Global Optimization (Natural Computing Series) , 2005 .

[74]  Servet Soyguder,et al.  An expert system for the humidity and temperature control in HVAC systems using ANFIS and optimization with Fuzzy Modeling Approach , 2009 .

[75]  Adriana Bruggeman,et al.  Impact of climate change on the water resources of the eastern Mediterranean and Middle East region: Modeled 21st century changes and implications , 2011 .

[76]  Ernani Sartori,et al.  A critical review on equations employed for the calculation of the evaporation rate from free water surfaces , 2000 .

[77]  Sushila Maheshkar,et al.  Verification of static signature pattern based on random subspace, REP tree and bagging , 2017, Multimedia Tools and Applications.

[78]  T. Chai,et al.  Root mean square error (RMSE) or mean absolute error (MAE)? – Arguments against avoiding RMSE in the literature , 2014 .