Optimization of geochemical anomaly detection using a novel genetic K-means clustering (GKMC) algorithm

Abstract Due to the complicated characteristics of regional geochemical data from stream sediments as a result of the complexity of geological features, detection of multi-elemental geochemical footprints of mineral deposits of interest is a challenging task. As a way to address this, a hybrid genetic algorithm-based technique, namely genetic K-means clustering (GKMC) algorithm, is proposed here for optimum delineation of multi-elemental patterns (both anomaly and background) in stream sediment geochemical data. To do so, factor analysis and sample catchment basin modeling were coupled with GKMC and traditional K-means clustering (TKMC) methods for identification of anomalous multi-elemental geochemical footprints of deposits of porphyry copper and skarn copper in the 1:100,000 scale Varzaghan map sheet, northwest Iran. Based on higher prediction rate, it can be inferred that the model derived by GKMC is superior to the one derived by TKMC. In addition, the strong anomaly classes of the GKMC and TKMC models predict, respectively, ~83% and ~66% of the porphyry/skarn Cu deposits in ~22% and ~36% of the study district. Thus, the geochemical anomaly targets derived by the GKMC method are more reliable than those generated by the TKMC method. This revealed that the GKMC algorithm is an efficient and robust tool for recognizing multi-element geochemical anomalies for mineral exploration.

[1]  Qiuming Cheng,et al.  Application of singularity mapping technique to identify local anomalies using stream sediment geochemical data, a case study from Gangdese, Tibet, western China , 2009 .

[2]  Filipa Luz,et al.  Cu- and Zn-Soil Anomalies in the NE Border of the South Portuguese Zone (Iberian Variscides, Portugal) Identified by Multifractal and Geostatistical Analyses , 2014, Natural Resources Research.

[3]  M. Narasimha Murty,et al.  Genetic K-means algorithm , 1999, IEEE Trans. Syst. Man Cybern. Part B.

[4]  P. Filzmoser,et al.  Principal component analysis for compositional data with outliers , 2009 .

[5]  E. Carranza,et al.  Data-Driven Predictive Modeling of Mineral Prospectivity Using Random Forests: A Case Study in Catanduanes Island (Philippines) , 2016, Natural Resources Research.

[6]  Q. Cheng,et al.  Integrated Spatial and Spectrum Method for Geochemical Anomaly Separation , 2000 .

[7]  Ravi P. Agarwal,et al.  Multiple solutions for the Dirichlet second-order boundary value problem of nonsingular type , 1999 .

[8]  Lalit M. Patnaik,et al.  Genetic algorithms: a survey , 1994, Computer.

[9]  Y. Dilek,et al.  Geochemistry and tectonics of Cenozoic volcanism in the Lesser Caucasus (Azerbaijan) and the peri-Arabian region: collision-induced mantle dynamics and its magmatic fingerprint , 2010 .

[10]  J. Walshe,et al.  Giant Porphyry Deposits: Characteristics, Distribution, and Tectonic Controls , 2005 .

[11]  Clemens Reimann,et al.  Factor analysis applied to regional geochemical data: problems and possibilities , 2002 .

[12]  Peter Filzmoser,et al.  Exploratory factor analysis revisited: How robust methods support the detection of hidden multivariate data structures in IS research , 2010, Inf. Manag..

[13]  T. Velmurugan,et al.  Computational Complexity between K-Means and K-Medoids Clustering Algorithms for Normal and Uniform Distributions of Data Points , 2010 .

[14]  M. Parsa,et al.  Decomposition of anomaly patterns of multi-element geochemical signatures in Ahar area, NW Iran: a comparison of U-spatial statistics and fractal models , 2016, Arabian Journal of Geosciences.

[15]  A. Clare,et al.  A comparison of unsupervised neural networks and k-means clustering in the analysis of multi-element stream sediment data , 2001, Geochemistry: Exploration, Environment, Analysis.

[16]  Graeme F. Bonham-Carter,et al.  Measuring the Performance of Mineral-Potential Maps , 2005 .

[17]  Yue Liu,et al.  A new method for geochemical anomaly separation based on the distribution patterns of singularity indices , 2017, Comput. Geosci..

[18]  Q. Cheng Mapping singularities with stream sediment geochemical data for prediction of undiscovered mineral deposits in Gejiu, Yunnan Province, China , 2007 .

[19]  E. Carranza,et al.  Mapping of single- and multi-element geochemical indicators based on catchment basin analysis: Application of fractal method and unsupervised clustering models , 2019, Journal of Geochemical Exploration.

[20]  Abbas Maghsoudi,et al.  An Improved Data-Driven Multiple Criteria Decision-Making Procedure for Spatial Modeling of Mineral Prospectivity: Adaption of Prediction–Area Plot and Logistic Functions , 2019, Natural Resources Research.

[21]  John Aitchison,et al.  The Statistical Analysis of Compositional Data , 1986 .

[22]  Mohammad Habibi Parsa,et al.  A Receiver Operating Characteristics-Based Geochemical Data Fusion Technique for Targeting Undiscovered Mineral Deposits , 2017, Natural Resources Research.

[23]  Sriparna Saha,et al.  A generalized automatic clustering algorithm in a multiobjective framework , 2013, Appl. Soft Comput..

[24]  G. Mateu-Figueras,et al.  Isometric Logratio Transformations for Compositional Data Analysis , 2003 .

[25]  Michael J. Thompson,et al.  Duplicate analysis in geochemical practice. Part I. Theoretical approach and estimation of analytical reproducibility , 1976 .

[26]  Heping Pan,et al.  Performance of the synergetic wavelet transform and modified K-means clustering in lithology classification using nuclear log , 2016 .

[27]  T Watson Layne,et al.  A Genetic Algorithm Approach to Cluster Analysis , 1998 .

[28]  Javier Ruiz del Solar,et al.  Visualization of Volcanic Rock Geochemical Data and Classification with Artificial Neural Networks , 2006 .

[29]  Anil K. Jain Data clustering: 50 years beyond K-means , 2010, Pattern Recognit. Lett..

[30]  Noel C. White,et al.  Geology of the post-collisional porphyry copper-molybdenum deposit at Qulong, Tibet , 2009 .

[31]  A. Maghsoudi,et al.  Comparison of U-spatial statistics and C–A fractal models for delineating anomaly patterns of porphyry-type Cu geochemical signatures in the Varzaghan district, NW Iran , 2018 .

[32]  A. Maghsoudi,et al.  Combination of multifractal geostatistical interpolation and spectrum–area (S–A) fractal model for Cu–Au geochemical prospects in Feizabad district, NE Iran , 2019, Arabian Journal of Geosciences.

[33]  P.F.M. van Gaans,et al.  The application of fuzzy c-means cluster analysis and non-linear mapping to geochemical datasets: examples from Portugal , 1988 .

[34]  Emmanuel John M. Carranza,et al.  A catchment basin approach to the analysis of reconnaissance geochemical - geological data from Albay province, Philippines , 1997 .

[35]  Emmanuel John M. Carranza,et al.  Performance evaluation of RBF- and SVM-based machine learning algorithms for predictive mineral prospectivity modeling: integration of S-A multifractal model and mineralization controls , 2019, Earth Science Informatics.

[36]  A. Maghsoudi,et al.  Prospectivity modeling of porphyry copper deposits: recognition of efficient mono- and multi-element geochemical signatures in the Varzaghan district, NW Iran , 2018, Acta Geochimica.

[37]  A. Maghsoudi,et al.  Incorporation of principal component analysis, geostatistical interpolation approaches and frequency-space-based models for portraying the Cu-Au geochemical prospects in the Feizabad district, NW Iran , 2019, Geochemistry.

[38]  E. Carranza,et al.  Geochemical mineralization probability index (GMPI): A new approach to generate enhanced stream sediment geochemical evidential map for increasing probability of success in mineral potential mapping , 2012 .

[39]  H. Jamali,et al.  Metallogeny and tectonic evolution of the Cenozoic Ahar–Arasbaran volcanic belt, northern Iran , 2010 .

[40]  Ujjwal Maulik,et al.  Genetic algorithm-based clustering technique , 2000, Pattern Recognit..

[41]  A. Maghsoudi,et al.  A hybrid AHP-VIKOR approach for prospectivity modeling of porphyry Cu deposits in the Varzaghan District, NW Iran , 2018, Arabian Journal of Geosciences.

[42]  Jian Wang,et al.  Deep learning and its application in geochemical mapping , 2019, Earth-Science Reviews.

[43]  P. Filzmoser,et al.  The bivariate statistical analysis of environmental (compositional) data. , 2010, The Science of the total environment.

[44]  Limin Zhou,et al.  Temporal–spatial distribution and tectonic setting of porphyry copper deposits in Iran: Constraints from zircon U–Pb and molybdenite Re–Os geochronology , 2015 .

[45]  Wei Wu,et al.  Separation of geochemical anomalies from the sample data of unknown distribution population using Gaussian mixture model , 2019, Comput. Geosci..

[46]  E. Carranza Analysis and mapping of geochemical anomalies using logratio-transformed stream sediment data with c , 2011 .

[47]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .