Groundwater Augmentation through the Site Selection of Floodwater Spreading Using a Data Mining Approach (Case study: Mashhad Plain, Iran)

It is a well-known fact that sustainable development goals are difficult to achieve without a proper water resources management strategy. This study tries to implement some state-of-the-art statistical and data mining models i.e., weights-of-evidence (WoE), boosted regression trees (BRT), and classification and regression tree (CART) to identify suitable areas for artificial recharge through floodwater spreading (FWS). At first, suitable areas for the FWS project were identified in a basin in north-eastern Iran based on the national guidelines and a literature survey. Using the same methodology, an identical number of FWS unsuitable areas were also determined. Afterward, a set of different FWS conditioning factors were selected for modeling FWS suitability. The models were applied using 70% of the suitable and unsuitable locations and validated with the rest of the input data (i.e., 30%). Finally, a receiver operating characteristics (ROC) curve was plotted to compare the produced FWS suitability maps. The findings depicted acceptable performance of the BRT, CART, and WoE for FWS suitability mapping with an area under the ROC curves of 92, 87.5, and 81.6%, respectively. Among the considered variables, transmissivity, distance from rivers, aquifer thickness, and electrical conductivity were determined as the most important contributors in the modeling. FWS suitability maps produced by the proposed method in this study could be used as a guideline for water resource managers to control flood damage and obtain new sources of groundwater. This methodology could be easily replicated to produce FWS suitability maps in other regions with similar hydrogeological conditions.

[1]  C. Uvo,et al.  An extended modeling approach to assess climate change impacts on groundwater recharge and adaptation in arid areas , 2014 .

[2]  Bahareh Kalantar,et al.  Groundwater potential mapping using C5.0, random forest, and multivariate adaptive regression spline models in GIS , 2018, Environmental Monitoring and Assessment.

[3]  Seyed Amir Naghibi,et al.  GIS-based landslide spatial modeling in Ganzhou City, China , 2016, Arabian Journal of Geosciences.

[4]  B. Pradhan,et al.  A comparative study of logistic model tree, random forest, and classification and regression tree models for spatial prediction of landslide susceptibility , 2017 .

[5]  Arash Malekian,et al.  Application of GIS techniques to determine areas most suitable for artificial groundwater recharge in a coastal aquifer in southern Iran , 2007 .

[6]  Wei Chen,et al.  Land Subsidence Susceptibility Mapping in South Korea Using Machine Learning Algorithms , 2018, Sensors.

[7]  Hyun-Joo Oh,et al.  Assessment of ground subsidence using GIS and the weights-of-evidence model , 2010 .

[8]  Seyed Amir Naghibi,et al.  A comparative study of landslide susceptibility maps produced using support vector machine with different kernel functions and entropy data mining models in China , 2018, Bulletin of Engineering Geology and the Environment.

[9]  L. Ayalew,et al.  The application of GIS-based logistic regression for landslide susceptibility mapping in the Kakuda-Yahiko Mountains, Central Japan , 2005 .

[10]  Rahim Ali Abbaspour,et al.  Using combined AHP–genetic algorithm in artificial groundwater recharge site selection of Gareh Bygone Plain, Iran , 2014, Environmental Earth Sciences.

[11]  Hamid Reza Pourghasemi,et al.  A comparison between ten advanced and soft computing models for groundwater qanat potential assessment in Iran using R and GIS , 2018, Theoretical and Applied Climatology.

[12]  E. Carranza Weights of Evidence Modeling of Mineral Potential: A Case Study Using Small Number of Prospects, Abra, Philippines , 2004 .

[13]  Seyed Amir Naghibi,et al.  A Comparative Assessment Between Three Machine Learning Models and Their Performance Comparison by Bivariate and Multivariate Statistical Methods in Groundwater Potential Mapping , 2015, Water Resources Management.

[14]  Biswajeet Pradhan,et al.  Land use change modeling and the effect of compact city paradigms: integration of GIS-based cellular automata and weights-of-evidence techniques , 2018, Environmental Earth Sciences.

[15]  S. H. Tabatabaei,et al.  Application of digital techniques to identify aquifer artificial recharge sites in GIS environment , 2013, Int. J. Digit. Earth.

[16]  S. Venkateswaran,et al.  Delineation of Artificial Recharge Zones Using Geospatial Techniques in Sarabanga Sub Basin Cauvery River, Tamil Nadu☆ , 2015 .

[17]  B. Pradhan,et al.  Application of weights-of-evidence and certainty factor models and their comparison in landslide susceptibility mapping at Haraz watershed, Iran , 2013, Arabian Journal of Geosciences.

[18]  Seyed Amir Naghibi,et al.  GIS-based groundwater potential mapping using boosted regression tree, classification and regression tree, and random forest machine learning models in Iran , 2015, Environmental Monitoring and Assessment.

[19]  Hamid Reza Pourghasemi,et al.  Assessment of a spatial multi-criteria evaluation to site selection underground dams in the Alborz Province, Iran , 2016 .

[20]  B. Pradhan,et al.  Landslide susceptibility mapping at Golestan Province, Iran: A comparison between frequency ratio, Dempster-Shafer, and weights-of-evidence models , 2012 .

[21]  Mohamad Hamzeh,et al.  Determining the most suitable areas for artificial groundwater recharge via an integrated PROMETHEE II-AHP method in GIS environment (case study: Garabaygan Basin, Iran) , 2012, Environmental Monitoring and Assessment.

[22]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[23]  N. Chandrasekar,et al.  Delineation of groundwater potential zones in Theni district, Tamil Nadu, using remote sensing, GIS and MIF techniques , 2012 .

[24]  Bahareh Kalantar,et al.  Groundwater potential mapping using a novel data-mining ensemble model , 2018, Hydrogeology Journal.

[25]  C. Sutton Classification and Regression Trees, Bagging, and Boosting , 2005 .

[26]  H. Yasuda,et al.  Using Floodwater for Artificial Recharge and Spate Irrigation , 2017 .

[27]  Ali Asghar Alesheikh,et al.  Land assessment for flood spreading site selection using geospatial information system , 2008 .

[28]  R. Berndtsson,et al.  Traditional irrigation techniques in MENA with a focus on Tunisia , 2016 .

[29]  Magnus Persson,et al.  Natural vs. artificial groundwater recharge, quantification through inverse modeling , 2012 .

[30]  P. A. Shary,et al.  Fundamental quantitative methods of land surface analysis , 2002 .

[31]  F. Moreira,et al.  Modeling and mapping wildfire ignition risk in Portugal , 2009 .

[32]  J Elith,et al.  A working guide to boosted regression trees. , 2008, The Journal of animal ecology.

[33]  Omid Rahmati,et al.  Application of Dempster-Shafer theory, spatial analysis and remote sensing for groundwater potentiality and nitrate pollution analysis in the semi-arid region of Khuzestan, Iran. , 2016, The Science of the total environment.

[34]  R. Berndtsson,et al.  Artificial recharge by floodwater spreading estimated by water balances and groundwater modelling in arid Iran , 2015 .

[35]  Zohre Sadat Pourtaghi,et al.  Landslide susceptibility mapping using random forest, boosted regression tree, classification and regression tree, and general linear models and comparison of their performance at Wadi Tayyah Basin, Asir Region, Saudi Arabia , 2015, Landslides.

[36]  Seyed Amir Naghibi,et al.  GIS-based Groundwater Spring Potential Mapping Using Data Mining Boosted Regression Tree and Probabilistic Frequency Ratio Models in Iran , 2017 .

[37]  Mustafa Neamah Jebur,et al.  Flood susceptibility mapping using a novel ensemble weights-of-evidence and support vector machine models in GIS , 2014 .

[38]  W. Cornelis,et al.  Artificial recharge efficiency assessment by soil water balance and modelling approaches in a multi-layered vadose zone in a dry region , 2018, Hydrological Sciences Journal.

[39]  Biswajeet Pradhan,et al.  A 100‐year maximum flood susceptibility mapping using integrated hydrological and hydrodynamic models: Kelantan River Corridor, Malaysia , 2011 .

[40]  Seyed Amir Naghibi,et al.  Evaluation of four supervised learning methods for groundwater spring potential mapping in Khalkhal region (Iran) using GIS-based features , 2017, Hydrogeology Journal.

[41]  Ali Noroozi,et al.  Integrating GIS and DSS for identification of suitable areas for artificial recharge, case study Meimeh Basin, Isfahan, Iran , 2005 .

[42]  B. Pradhan,et al.  Application of frequency ratio, statistical index, and weights-of-evidence models and their comparison in landslide susceptibility mapping in Central Nepal Himalaya , 2014, Arabian Journal of Geosciences.

[43]  Afshin Honarbakhsh,et al.  Application of Spatial Analysis Techniques to Select the Most Suitable Areas for Flood Spreading , 2013, Water Resources Management.

[44]  Aabha Sargaonkar,et al.  Identifying potential sites for artificial groundwater recharge in sub-watershed of River Kanhan, India , 2011 .

[45]  Seyed Amir Naghibi,et al.  Prioritization of landslide conditioning factors and its spatial modeling in Shangnan County, China using GIS-based data mining algorithms , 2018, Bulletin of Engineering Geology and the Environment.

[46]  Saowanee Wijitkosum,et al.  A Review of the Analytical Hierarchy Process (AHP): An Approach to Water Resource Management in Thailand , 2015 .

[47]  M. K. Zarkesh,et al.  Decision Support System for Floodwater Spreading Site Selection in Iran , 2002 .

[48]  Biswajeet Pradhan,et al.  Sustainable Brownfields Land Use Change Modeling Using GIS-based Weights-of-Evidence Approach , 2016 .

[49]  F. Bação,et al.  MODELLING AND MAPPING THE OCCURRENCE OF WILDFIRE IGNITIONS IN PORTUGAL , 2007 .

[50]  Pradeep Garg,et al.  Remote Sensing and GIS Based Groundwater Potential & Recharge Zones Mapping Using Multi-Criteria Decision Making Technique , 2015, Water Resources Management.

[51]  Omid Rahmati,et al.  Flood hazard zoning in Yasooj region, Iran, using GIS and multi-criteria decision analysis , 2016 .

[52]  Yousef Nazzal,et al.  Identification of potential artificial groundwater recharge zones in Northwestern Saudi Arabia using GIS and Boolean logic , 2015 .

[53]  B. Pradhan,et al.  A novel machine learning-based approach for the risk assessment of nitrate groundwater contamination. , 2018, The Science of the total environment.

[54]  Stephen J. Carver,et al.  Integrating multi-criteria evaluation with geographical information systems , 1991, Int. J. Geogr. Inf. Sci..

[55]  Hamid Reza Pourghasemi,et al.  Assessment and comparison of combined bivariate and AHP models with logistic regression for landslide susceptibility mapping in the Chaharmahal-e-Bakhtiari Province, Iran , 2016, Arabian Journal of Geosciences.

[56]  C. Uvo,et al.  An extended modeling approach to assess climate change impacts on groundwater recharge and adaptation in arid areas , 2014 .

[57]  R. Berndtsson,et al.  SPATE IRRIGATION OF BARLEY THROUGH FLOODWATER HARVESTING IN THE GAREH‐BYGONE PLAIN, IRAN , 2014 .

[58]  I. P. Senanayake,et al.  An approach to delineate groundwater recharge potential sites in Ambalantota, Sri Lanka using GIS techniques , 2016 .

[59]  Bart Muys,et al.  Predicting forest site productivity in temperate lowland from forest floor, soil and litterfall characteristics using boosted regression trees , 2011, Plant and Soil.

[60]  Shane Abeare Comparisons of boosted regression tree, GLM and GAM performance in the standardization of yellowfin tuna catch-rate data from the Gulf of Mexico lonline [sic] fishery , 2009 .

[61]  Saro Lee,et al.  Application of a weights-of-evidence method and GIS to regional groundwater productivity potential mapping. , 2012, Journal of environmental management.

[62]  Tri Dev Acharya,et al.  Landslide susceptibility mapping using J48 Decision Tree with AdaBoost, Bagging and Rotation Forest ensembles in the Guangchang area (China) , 2018 .

[63]  B. Pradhan,et al.  A comparative assessment of prediction capabilities of Dempster–Shafer and Weights-of-evidence models in landslide susceptibility mapping using GIS , 2013 .

[64]  Hamid Reza Pourghasemi,et al.  A comparative assessment between linear and quadratic discriminant analyses (LDA-QDA) with frequency ratio and weights-of-evidence models for forest fire susceptibility mapping in China , 2017, Arabian Journal of Geosciences.

[65]  H. Pourghasemi,et al.  Application of GIS-based data driven random forest and maximum entropy models for groundwater potential mapping: A case study at Mehran Region, Iran , 2016 .