Application of Probabilistic and Machine Learning Models for Groundwater Potentiality Mapping in Damghan Sedimentary Plain, Iran

Groundwater is one of the most important natural resources, as it regulates the earth’s hydrological system. The Damghan sedimentary plain area, located in the region of a semi-arid climate of Iran, has very critical conditions of groundwater due to massive pressure on it and is in need of robust models for identifying the groundwater potential zones (GWPZ). The main goal of the current research is to prepare a groundwater potentiality map (GWPM) considering the probabilistic, machine learning, data mining, and multi-criteria decision analysis (MCDA) approaches. For this purpose, 80 wells collected from the Iranian groundwater resource department and field investigation with global positioning system (GPS), have been selected randomly and considered as the groundwater inventory datasets. Out of 80 wells, 56 (70%) wells have been brought into play for modeling and 24 (30%) for validation purposes. Elevation, slope, aspect, convergence index (CI), rainfall, drainage density (Dd), distance to river, distance to fault, distance to road, lithology, soil type, land use/land cover (LU/LC), normalized difference vegetation index (NDVI), topographic wetness index (TWI), topographic position index (TPI), and stream power index (SPI) have been used for modeling purpose. The area under the receiver operating characteristic (AUROC), sensitivity (SE), specificity (SP), accuracy (AC), mean absolute error (MAE), and root mean square error (RMSE) are used for checking the goodness-of-fit and prediction accuracy of approaches to compare their performance. In addition, the influence of groundwater determining factors (GWDFs) on groundwater occurrence was evaluated by performing a sensitivity analysis model. The GWPMs, produced by technique for order preference by similarity to ideal solution (TOPSIS), random forest (RF), binary logistic regression (BLR), weight of evidence (WoE) and support vector machine (SVM) have been classified into four categories, i.e., low, medium, high and very high groundwater potentiality with the help of the natural break classification methods in the GIS environment. The very high groundwater potentiality class is covered 15.09% for TOPSIS, 15.46% for WoE, 25.26% for RF, 15.47% for BLR, and 18.74% for SVM of the entire plain area. Based on sensitivity analysis, distance from river, and drainage density represent significantly effects on the groundwater occurrence. validation results show that the BLR model with best prediction accuracy and goodness-of-fit outperforms the other five models. Although, all models have very good performance in modeling of groundwater potential. Results of seed cell area index model that used for checking accuracy classification of models show that all models have suitable performance. Therefore, these are promising models that can be applied for the GWPZs identification, which will help for some needful action of these areas.

[1]  Biswajeet Pradhan,et al.  Application of probabilistic-based frequency ratio model in groundwater potential mapping using remote sensing data and GIS , 2014, Arabian Journal of Geosciences.

[2]  Hamid Reza Pourghasemi,et al.  A comparison between ten advanced and soft computing models for groundwater qanat potential assessment in Iran using R and GIS , 2018, Theoretical and Applied Climatology.

[3]  Ziyue Chen,et al.  A new indicator of ecosystem water use efficiency based on surface soil moisture retrieved from remote sensing , 2017 .

[4]  P. Kuhnert,et al.  Incorporating uncertainty in gully erosion calculations using the random forests modelling approach , 2009 .

[5]  Qiuhong Tang,et al.  Groundwater recharge and discharge in a hyperarid alluvial plain (Akesu, Taklimakan Desert, China) , 2007 .

[6]  Hamid Reza Pourghasemi,et al.  Spatial Modelling of Gully Erosion Using GIS and R Programing: A Comparison among Three Data Mining Algorithms , 2018, Applied Sciences.

[7]  S. Anbazhagan,et al.  Modeling groundwater probability index in Ponnaiyar River basin of South India using analytic hierarchy process , 2016, Modeling Earth Systems and Environment.

[8]  Ayele Almaw Fenta,et al.  Spatial analysis of groundwater potential using remote sensing and GIS-based multi-criteria evaluation in Raya Valley, northern Ethiopia , 2015, Hydrogeology Journal.

[9]  E. Rotigliano,et al.  Using topographical attributes to evaluate gully erosion proneness (susceptibility) in two mediterranean basins: advantages and limitations , 2015, Natural Hazards.

[10]  Iuliana Armaş,et al.  Weights of evidence method for landslide susceptibility mapping. Prahova Subcarpathians, Romania , 2012, Natural Hazards.

[11]  Netra R. Regmi,et al.  Modeling susceptibility to landslides using the weight of evidence approach: Western Colorado, USA , 2010 .

[12]  Sunil Saha,et al.  Landslide susceptibility mapping using knowledge driven statistical models in Darjeeling District, West Bengal, India , 2019, Geoenvironmental Disasters.

[13]  Raphael Huser,et al.  Point process-based modeling of multiple debris flow landslides using INLA: an application to the 2009 Messina disaster , 2017, Stochastic Environmental Research and Risk Assessment.

[14]  H. Pourghasemi,et al.  GIS-based gully erosion susceptibility mapping: a comparison among three data-driven models and AHP knowledge-based technique , 2018, Environmental Earth Sciences.

[15]  Peter A. Vanrolleghem,et al.  Uncertainty in the environmental modelling process - A framework and guidance , 2007, Environ. Model. Softw..

[16]  Zeshui Xu,et al.  Efficiency evaluation of sustainable water management using the HF-TODIM method , 2019, Int. Trans. Oper. Res..

[17]  Robert P. W. Duin,et al.  Uniform Object Generation for Optimizing One-class Classifiers , 2002, J. Mach. Learn. Res..

[18]  A. Tavili,et al.  Monitoring and assessment of seasonal land cover changes using remote sensing: a 30-year (1987–2016) case study of Hamoun Wetland, Iran , 2018, Environmental Monitoring and Assessment.

[19]  Hamid Reza Pourghasemi,et al.  Application of analytical hierarchy process, frequency ratio, and certainty factor models for groundwater potential mapping using GIS , 2015, Earth Science Informatics.

[20]  Kellie J. Archer,et al.  Empirical characterization of random forest variable importance measures , 2008, Comput. Stat. Data Anal..

[21]  Assefa M. Melesse,et al.  A Comprehensive Review on Water Quality Parameters Estimation Using Remote Sensing Techniques , 2016, Sensors.

[22]  Vijaya Babu Vommi,et al.  TOPSIS with statistical distances: A new approach to MADM , 2017 .

[23]  B. Pradhan,et al.  Landslide susceptibility mapping at Golestan Province, Iran: A comparison between frequency ratio, Dempster-Shafer, and weights-of-evidence models , 2012 .

[24]  Binh Thai Pham,et al.  Artificial Intelligence Approaches for Prediction of Compressive Strength of Geopolymer Concrete , 2019, Materials.

[25]  Carlos Henrique Grohmann,et al.  Comparison of roving-window and search-window techniques for characterising landscape morphometry , 2009, Comput. Geosci..

[26]  D. Bui,et al.  Landslide susceptibility analysis in the Hoa Binh province of Vietnam using statistical index and logistic regression , 2011 .

[27]  N Thilagavathi,et al.  Mapping of groundwater potential zones in Salem Chalk Hills, Tamil Nadu, India, using remote sensing and GIS techniques , 2015, Environmental Monitoring and Assessment.

[28]  Hamid Reza Pourghasemi,et al.  GIS-based bivariate statistical techniques for groundwater potential analysis (an example of Iran) , 2017, Journal of Earth System Science.

[29]  Biswajeet Pradhan,et al.  A comparative study on the predictive ability of the decision tree, support vector machine and neuro-fuzzy models in landslide susceptibility mapping using GIS , 2013, Comput. Geosci..

[30]  I. Moore,et al.  Digital terrain modelling: A review of hydrological, geomorphological, and biological applications , 1991 .

[31]  Elif Sertel,et al.  Accuracy Assessment of Different Digital Surface Models , 2018, ISPRS Int. J. Geo Inf..

[32]  Sunil Saha,et al.  Groundwater potential mapping using analytical hierarchical process: a study on Md. Bazar Block of Birbhum District, West Bengal , 2017, Spatial Information Research.

[33]  Lal Samarakoon,et al.  Landslide susceptibility mapping using logistic regression model (a case study in Badulla District, Sri Lanka) , 2018 .

[34]  Saro Lee,et al.  GIS mapping of regional probabilistic groundwater potential in the area of Pohang City, Korea , 2011 .

[35]  Igor Linkov,et al.  Untangling drivers of species distributions: Global sensitivity and uncertainty analyses of MaxEnt , 2014, Environ. Model. Softw..

[36]  B. Pradhan,et al.  A comparative study of logistic model tree, random forest, and classification and regression tree models for spatial prediction of landslide susceptibility , 2017 .

[37]  Bahareh Kalantar,et al.  Groundwater potential mapping using C5.0, random forest, and multivariate adaptive regression spline models in GIS , 2018, Environmental Monitoring and Assessment.

[38]  Kazem Nosrati,et al.  Assessment of groundwater quality using multivariate statistical techniques in Hashtgerd Plain, Iran , 2011, Environmental Earth Sciences.

[39]  Biswajeet Pradhan,et al.  Groundwater spring potential modelling: Comprising the capability and robustness of three different modeling approaches , 2018, Journal of Hydrology.

[40]  Omid Rahmati,et al.  Use of a maximum entropy model to identify the key factors that influence groundwater availability on the Gonabad Plain, Iran , 2018, Environmental Earth Sciences.

[41]  K. Beven,et al.  A physically based, variable contributing area model of basin hydrology , 1979 .

[42]  Ralph Rosenbauer,et al.  Evaluating the Quality and Accuracy of TanDEM-X Digital Elevation Models at Archaeological Sites in the Cilician Plain, Turkey , 2014, Remote. Sens..

[43]  Samee Ullah Khan,et al.  Spatial sensitivity analysis of multi-criteria weights in GIS-based land suitability evaluation , 2010, Environ. Model. Softw..

[44]  L. Tham,et al.  Landslide susceptibility mapping based on Support Vector Machine: A case study on natural slopes of Hong Kong, China , 2008 .

[45]  Jie Dou,et al.  Handling high predictor dimensionality in slope-unit-based landslide susceptibility models through LASSO-penalized Generalized Linear Model , 2017, Environ. Model. Softw..

[46]  Achim Zeileis,et al.  BMC Bioinformatics BioMed Central Methodology article Conditional variable importance for random forests , 2008 .

[47]  H. Pourghasemi,et al.  Groundwater potential mapping at Kurdistan region of Iran using analytic hierarchy process and GIS , 2015, Arabian Journal of Geosciences.

[48]  Wei Chen,et al.  GIS-based groundwater potential analysis using novel ensemble weights-of-evidence with logistic regression and functional tree models. , 2018, The Science of the total environment.

[49]  Mikhail Kanevski,et al.  Machine Learning Feature Selection Methods for Landslide Susceptibility Mapping , 2013, Mathematical Geosciences.

[50]  Alireza Arabameri,et al.  GIS-based groundwater potential mapping in Shahroud plain, Iran. A comparison among statistical (bivariate and multivariate), data mining and MCDM approaches. , 2019, The Science of the total environment.

[51]  Hamid Reza Pourghasemi,et al.  Applying different scenarios for landslide spatial modeling using computational intelligence methods , 2017, Environmental Earth Sciences.

[52]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[53]  Binh Thai Pham,et al.  A Novel Classifier Based on Composite Hyper-cubes on Iterated Random Projections for Assessment of Landslide Susceptibility , 2018, Journal of the Geological Society of India.

[54]  V. Doyuran,et al.  A comparison of the GIS based landslide susceptibility assessment methods: multivariate versus bivariate , 2004 .

[55]  Binh Thai Pham,et al.  Prediction and Sensitivity Analysis of Bubble Dissolution Time in 3D Selective Laser Sintering Using Ensemble Decision Trees , 2019, Materials.

[56]  Thomas Blaschke,et al.  Object based image analysis for remote sensing , 2010 .

[57]  E. Rotigliano,et al.  Improving transferability strategies for debris flow susceptibility assessment: Application to the Saponara and Itala catchments (Messina, Italy) , 2017 .

[58]  M. Jha,et al.  Cost-effective Approaches for Sustainable Groundwater Management in Alluvial Aquifer Systems , 2008 .

[59]  Mustafa Neamah Jebur,et al.  Spatial prediction of flood susceptible areas using rule based decision tree (DT) and a novel ensemble bivariate and multivariate statistical models in GIS , 2013 .

[60]  Alexis J. Comber,et al.  Random forest classification of salt marsh vegetation habitats using quad-polarimetric airborne SAR, elevation and optical RS data , 2014 .

[61]  M. Conforti,et al.  Geomorphology and GIS analysis for mapping gully erosion susceptibility in the Turbolo stream catchment (Northern Calabria, Italy) , 2011 .

[62]  Binh Thai Pham,et al.  Prediction of Compressive Strength of Geopolymer Concrete Using Entirely Steel Slag Aggregates: Novel Hybrid Artificial Intelligence Approaches , 2019, Applied Sciences.

[63]  Stefano Tarantola,et al.  Trends in sensitivity analysis practice in the last decade. , 2016, The Science of the total environment.

[64]  N. Park Using maximum entropy modeling for landslide susceptibility mapping with multiple geoenvironmental data sets , 2015, Environmental Earth Sciences.

[65]  Dieu Tien Bui,et al.  A novel artificial intelligence approach based on Multi-layer Perceptron Neural Network and Biogeography-based Optimization for predicting coefficient of consolidation of soil , 2019, CATENA.

[66]  Stefano Tarantola,et al.  Uncertainty and sensitivity analysis: tools for GIS-based model implementation , 2001, Int. J. Geogr. Inf. Sci..

[67]  Tavi Murray,et al.  DEM quality assessment for quantification of glacier surface change , 2007, Annals of Glaciology.

[68]  Bahareh Kalantar,et al.  Groundwater potential mapping using a novel data-mining ensemble model , 2018, Hydrogeology Journal.

[69]  Pradeep Garg,et al.  Remote Sensing and GIS Based Groundwater Potential & Recharge Zones Mapping Using Multi-Criteria Decision Making Technique , 2015, Water Resources Management.

[70]  A. Nonomura,et al.  GIS-based weights-of-evidence modelling of rainfall-induced landslides in small catchments for landslide susceptibility mapping , 2008 .

[71]  B. Pradhan,et al.  Groundwater spring potential mapping using bivariate statistical model and GIS in the Taleghan Watershed, Iran , 2015, Arabian Journal of Geosciences.

[72]  Omid Rahmati,et al.  Spatial analysis of groundwater potential using weights-of-evidence and evidential belief function models and remote sensing , 2015, Arabian Journal of Geosciences.

[73]  Weldon A. Lodwick,et al.  Attribute error and sensitivity analysis of map operations in geographical informations systems: suitability analysis , 1990, Int. J. Geogr. Inf. Sci..