Mapping Groundwater Potential Using a Novel Hybrid Intelligence Approach

Identifying areas with high groundwater potential is important for groundwater resources management. The main objective of this study is to propose a novel classifier ensemble method, namely Random Forest Classifier based on Random Subspace Ensemble (RS-RF), for groundwater potential mapping (GWPM) in Qorveh-Dehgolan plain, Kurdistan province, Iran. A total of 12 conditioning factors (slope, aspect, elevation, curvature, stream power index (SPI), topographic wetness index (TWI), rainfall, lithology, land use, normalized difference vegetation index (NDVI), fault density, and river density) were selected for groundwater modeling. The least square support vector machine (LSSVM) feature selection method with a 10-fold cross-validation technique was used to validate the predictive capability of these conditioning factors for training the models. The performance of the RS-RF model was validated using the area under receiver operating characteristic curve (AUROC), success and prediction rate curves, kappa index, and several statistical index-based measures. In addition, Friedman and Wilcoxon signed-rank tests were used to assess statistically significant level among the new model with the state-of-the-art soft computing benchmark models, such as random forest (RF), logistic regression (LR) and naïve Bayes (NB). Results showed that the new hybrid model of RS-RF had a very high predictive capability for groundwater potential mapping and exhibited the best performance among other benchmark models (LR, RF, and NB). Results of the present study might be useful to water managers to make proper decisions on the optimal use of groundwater resources for future planning in the critical study area.

[1]  M. Friedman The Use of Ranks to Avoid the Assumption of Normality Implicit in the Analysis of Variance , 1937 .

[2]  A. Simpson,et al.  What is the best index of detectability? , 1973, Psychological Bulletin.

[3]  J A Swets,et al.  Measuring the accuracy of diagnostic systems. , 1988, Science.

[4]  Centor Rm,et al.  Receiver Operating Characteristics (ROC) Curve Area Analysis Using the ROC ANALYZER. , 1989 .

[5]  D. Dubois,et al.  ROUGH FUZZY SETS AND FUZZY ROUGH SETS , 1990 .

[6]  I. Moore,et al.  Digital terrain modelling: A review of hydrological, geomorphological, and biological applications , 1991 .

[7]  I. Moore,et al.  Length-slope factors for the Revised Universal Soil Loss Equation: simplified method of estimation , 1992 .

[8]  Chang-Jo Chung,et al.  The representation of geoscience information for data integration , 1993 .

[9]  Igor Kononenko,et al.  Estimating Attributes: Analysis and Extensions of RELIEF , 1994, ECML.

[10]  M. Alavi TECTONICS OF THE ZAGROS OROGENIC BELT OF IRAN - NEW DATA AND INTERPRETATIONS , 1994 .

[11]  J. Ross Quinlan,et al.  Improved Use of Continuous Attributes in C4.5 , 1996, J. Artif. Intell. Res..

[12]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  P. Sree Devi,et al.  Hydrogeomorphological and groundwater prospects of the Pageru river basin by using remote sensing data , 2001 .

[14]  S. Walter,et al.  Properties of the summary receiver operating characteristic (SROC) curve for diagnostic test data , 2002, Statistics in medicine.

[15]  Johan A. K. Suykens,et al.  Least Squares Support Vector Machines , 2002 .

[16]  Robert P. W. Duin,et al.  Bagging, Boosting and the Random Subspace Method for Linear Classifiers , 2002, Pattern Analysis & Applications.

[17]  T. Mark Beasley,et al.  Comparison of Aligned Friedman Rank and Parametric Methods for Testing Interactions in Split-Plot Designs , 2002, Comput. Stat. Data Anal..

[18]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[19]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[20]  Pengfei Zhao,et al.  Robust Precise Eye Location by Adaboost and SVM Techniques , 2005, ISNN.

[21]  Lior Rokach,et al.  Data Mining And Knowledge Discovery Handbook , 2005 .

[22]  Lior Rokach,et al.  Ensemble Methods for Classifiers , 2005, The Data Mining and Knowledge Discovery Handbook.

[23]  Limin Wang,et al.  Combining decision tree and Naive Bayes for classification , 2006, Knowl. Based Syst..

[24]  V. Chowdary,et al.  Groundwater management and development by integrated remote sensing and geographic information systems: prospects and constraints , 2007 .

[25]  G. Gopinath,et al.  Application of remote sensing and GIS for the demarcation of groundwater potential zones of a river basin in Kerala, southwest coast of India , 2007 .

[26]  V. M. Chowdary,et al.  Delineation of groundwater recharge zones and identification of artificial recharge sites in West Medinipur district, West Bengal, using RS, GIS and MCDM techniques , 2009 .

[27]  Mapping potential areas for groundwater storage in Wadi Aurnah Basin, western Arabian Peninsula, using remote sensing and geographic information system techniques , 2010 .

[28]  I. Dar,et al.  Remote sensing technology and geographic information system modeling: An integrated approach towards the mapping of groundwater potential zones in Hardrock terrain, Mamundiyar basin , 2010 .

[29]  Ismail Chenini,et al.  Groundwater Recharge Zone Mapping Using GIS-Based Multi-criteria Analysis: A Case Study in Central Tunisia (Maknassy Basin) , 2010 .

[30]  Tara G Martin,et al.  A guide to eliciting and using expert knowledge in Bayesian ecological models. , 2010, Ecology letters.

[31]  M. A. Saud Mapping potential areas for groundwater storage in Wadi Aurnah Basin, western Arabian Peninsula, using remote sensing and geographic information system techniques , 2010 .

[32]  B. Pradhan,et al.  A knowledge-driven GIS modeling technique for groundwater potential mapping at the Upper Langat Basin, Malaysia , 2013, Arabian Journal of Geosciences.

[33]  Saro Lee,et al.  GIS mapping of regional probabilistic groundwater potential in the area of Pohang City, Korea , 2011 .

[34]  Biswajeet Pradhan,et al.  Spatial prediction of landslide hazards in Hoa Binh province (Vietnam): a comparative assessment of , 2012 .

[35]  C. Petus,et al.  Using MODIS Normalized Difference Vegetation Index to monitor seasonal and inter-annual dynamics of wetland vegetation in the Great Artesian Basin: a baseline for assessment of future changes in a unique ecosystem , 2012 .

[36]  Cristina Aguilar,et al.  NDVI as an indicator for changes in water availability to woody vegetation , 2012 .

[37]  Biswajeet Pradhan,et al.  Application of probabilistic-based frequency ratio model in groundwater potential mapping using remote sensing data and GIS , 2014, Arabian Journal of Geosciences.

[38]  B. Pradhan,et al.  Estimating groundwater vulnerability to pollution using a modified DRASTIC model in the Kerman agricultural area, Iran , 2014, Environmental Earth Sciences.

[39]  Mustafa Neamah Jebur,et al.  Spatial prediction of flood susceptible areas using rule based decision tree (DT) and a novel ensemble bivariate and multivariate statistical models in GIS , 2013 .

[40]  Antonio Criminisi,et al.  Decision Forests for Computer Vision and Medical Image Analysis , 2013, Advances in Computer Vision and Pattern Recognition.

[41]  Biswajeet Pradhan,et al.  A comparative study on the predictive ability of the decision tree, support vector machine and neuro-fuzzy models in landslide susceptibility mapping using GIS , 2013, Comput. Geosci..

[42]  Zhigang Zeng,et al.  Ensemble of extreme learning machine for landslide displacement prediction based on time series analysis , 2013, Neural Computing and Applications.

[43]  Alan Mair,et al.  Logistic regression modeling to assess groundwater vulnerability to contamination in Hawaii, USA. , 2013, Journal of contaminant hydrology.

[44]  Mikhail Kanevski,et al.  Machine Learning Feature Selection Methods for Landslide Susceptibility Mapping , 2013, Mathematical Geosciences.

[45]  K. Chapi,et al.  Spatiotemporal patterns of stable isotopes and hydrochemistry in springs and river flow of the upper Karkheh River Basin, Iran , 2014, Isotopes in environmental and health studies.

[46]  H. Pourghasemi,et al.  Groundwater potential mapping at Kurdistan region of Iran using analytic hierarchy process and GIS , 2015, Arabian Journal of Geosciences.

[47]  Li Zhang,et al.  Hybrid decision tree and naïve Bayes classifiers for multi-class classification tasks , 2014, Expert Syst. Appl..

[48]  Hamid Reza Pourghasemi,et al.  Groundwater qanat potential mapping using frequency ratio and Shannon’s entropy models in the Moghan watershed, Iran , 2015, Earth Science Informatics.

[49]  Biswajeet Pradhan,et al.  A novel ensemble decision tree-based CHi-squared Automatic Interaction Detection (CHAID) and multivariate logistic regression models in landslide susceptibility mapping , 2014, Landslides.

[50]  B. Pradhan,et al.  Application of GIS based data driven evidential belief function model to predict groundwater potential zonation , 2014 .

[51]  H. Shahabi,et al.  Remote sensing and GIS-based landslide susceptibility mapping using frequency ratio, logistic regression, and fuzzy logic methods at the central Zab basin, Iran , 2015, Environmental Earth Sciences.

[52]  Arvind Pandey,et al.  Delineation of groundwater potential zone in hard rock terrain of India using remote sensing, geographical information system (GIS) and analytic hierarchy process (AHP) techniques , 2015 .

[53]  Bahram Gharabaghi,et al.  Spatial-Temporal Dynamics of Runoff Generation Areas in a Small Agricultural Watershed in Southern Ontario , 2015 .

[54]  V. Rodriguez-Galiano,et al.  Machine learning predictive models for mineral prospectivity: an evaluation of neural networks, random forest, regression trees and support vector machines , 2015 .

[55]  Baihua Fu,et al.  Riparian vegetation NDVI dynamics and its relationship with climate, surface water and groundwater , 2015 .

[56]  B. Pradhan,et al.  A novel hybrid evidential belief function-based fuzzy logic model in spatial prediction of rainfall-induced shallow landslides in the Lang Son city area (Vietnam) , 2015 .

[57]  Omid Rahmati,et al.  Spatial analysis of groundwater potential using weights-of-evidence and evidential belief function models and remote sensing , 2015, Arabian Journal of Geosciences.

[58]  Konstantinos Voudouris,et al.  A GIS/Remote Sensing-based methodology for groundwater potentiality assessment in Tirnavos area, Greece , 2015 .

[59]  M. Wang,et al.  Susceptibility Analysis of Earthquake-Induced Landslide Using Random Forest Method , 2015 .

[60]  H. Pourghasemi,et al.  Application of GIS-based data driven random forest and maximum entropy models for groundwater potential mapping: A case study at Mehran Region, Iran , 2016 .

[61]  K. Solaimani,et al.  Rock fall susceptibility assessment along a mountainous road: an evaluation of bivariate statistic, analytical hierarchy process and frequency ratio , 2017, Environmental Earth Sciences.

[62]  D. Bui,et al.  Shallow landslide susceptibility assessment using a novel hybrid intelligence approach , 2017, Environmental Earth Sciences.

[63]  Dieu Tien Bui,et al.  Hybrid integration of Multilayer Perceptron Neural Networks and machine learning ensembles for landslide susceptibility assessment at Himalayan area (India) using GIS , 2017 .

[64]  Omid Rahmati,et al.  Applicability of generalized additive model in groundwater potential modelling and comparison its performance by bivariate statistical methods , 2017 .

[65]  Bahareh Kalantar,et al.  Groundwater potential mapping using C5.0, random forest, and multivariate adaptive regression spline models in GIS , 2018, Environmental Monitoring and Assessment.

[66]  Biswajeet Pradhan,et al.  Groundwater spring potential modelling: Comprising the capability and robustness of three different modeling approaches , 2018, Journal of Hydrology.