A stacking methodology of machine learning for 3D geological modeling with geological-geophysical datasets, Laochang Sn camp, Gejiu (China)

Abstract A stacking method was followed for the classification and prediction of rock types and for 3D geological modeling in the Laochang Sn camp, Geijiu (China). The sequence of the methodology is as follows. Initially, multi-source datasets were compiled GOCAD software, including a borehole dataset with rock types and 1:50,000 scale gravity (residual density) and magnetic susceptibility inversion interpretation using 3D-grid model. Then, the datasets were subjected to first-level classifiers using support vector machine, gradient boosting decision tree, neural network, random forest, and XGBoost. This was followed by subjecting the datasets to second-level classifier using XGBoost. Next, features (XYZ coordinates, residual density and magnetic susceptibility) were input for training and testing (70% of data for training, 30% for testing). Then, upon completion of training the model, the machine learning metrics were used to evaluate comprehensively the performance of the model, and the 3D geological model was visualized. Finally, the 3D geological modeling was validated by comparison with a robust geological dataset with mining channel dataset. The research results show the stacking method of machine learning can be used to construct effectively and quickly a 3D multi-type rocks model using geological and geophysical datasets. The obtained areas under the ROC curves were 0.93 for the granites, 0.98 for the basalts, and 0.94 for "other" rocks. The methodology can be extended to multi-source datasets of geoscience big data and even to different metallogenic models or geological settings.

[1]  Stephen V. Stehman,et al.  Selecting and interpreting measures of thematic classification accuracy , 1997 .

[2]  S. Bergman The kernel function and conformal mapping , 1950 .

[3]  W S McCulloch,et al.  A logical calculus of the ideas immanent in nervous activity , 1990, The Philosophy of Artificial Intelligence.

[4]  Mirko van der Baan,et al.  Neural networks in geophysical applications , 2000 .

[5]  Vladimir Vapnik,et al.  Support-vector networks , 2004, Machine Learning.

[6]  Padhraic Smyth,et al.  Linearly Combining Density Estimators via Stacking , 1999, Machine Learning.

[7]  Geophysical Models for the Cu-Dominated VHMS Mineralization in Katta District, Western Ethiopia , 2013, Natural Resources Research.

[8]  G. McMechan,et al.  Vector-based elastic reverse time migration , 2015 .

[9]  Ian H. Witten,et al.  Issues in Stacked Generalization , 2011, J. Artif. Intell. Res..

[10]  Tianqi Chen,et al.  XGBoost: A Scalable Tree Boosting System , 2016, KDD.

[11]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[12]  Maarit Middleton,et al.  Boosting for Mineral Prospectivity Modeling: A New GIS Toolbox , 2019, Natural Resources Research.

[13]  A. Journel Nonparametric estimation of spatial distributions , 1983 .

[14]  Gongwen Wang,et al.  3D geological modeling based on gravitational and magnetic data inversion in the Luanchuan ore region, Henan Province, China , 2012 .

[15]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[16]  Yao Yevenyo Ziggah,et al.  A Novel Hybrid Technique of Integrating Gradient-Boosted Machine and Clustering Algorithms for Lithology Classification , 2019, Natural Resources Research.

[17]  Dong Li,et al.  Mineral potential targeting and resource assessment based on 3D geological modeling in Luanchuan region, China , 2011, Comput. Geosci..

[18]  E. Grunsky,et al.  3D GIS as a support for mineral discovery , 2011 .

[19]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[20]  Gongwen Wang,et al.  Integration of multi-source and multi-scale datasets for 3D structural modeling for subsurface exploration targeting, Luanchuan Mo-polymetallic district, China , 2017 .

[21]  Charles L. Bérubé,et al.  Predicting rock type and detecting hydrothermal alteration using machine learning and petrophysical properties of the Canadian Malartic ore and host rocks, Pontiac Subprovince, Québec, Canada , 2018 .

[22]  David M. W. Powers,et al.  Evaluation: from precision, recall and F-measure to ROC, informedness, markedness and correlation , 2011, ArXiv.

[23]  Simon W. Houlding,et al.  3D Geoscience Modeling: Computer Techniques for Geological Characterization , 1995 .

[24]  E. Carranza,et al.  3D geological modeling for prediction of subsurface Mo targets in the Luanchuan district, China , 2015 .

[25]  Gongwen Wang,et al.  GeoCube: A 3D mineral resources quantitative prediction and assessment system , 2016, Comput. Geosci..

[26]  Claudio Moraga,et al.  The Influence of the Sigmoid Function Parameters on the Speed of Backpropagation Learning , 1995, IWANN.

[27]  E. Carranza,et al.  3D Mineral Prospectivity Mapping with Random Forests: A Case Study of Tongling, Anhui, China , 2019, Natural Resources Research.

[28]  Zhi-Hua Zhou,et al.  Ensemble Methods: Foundations and Algorithms , 2012 .

[29]  Nan Li,et al.  3D Geological Modeling for Mineral System Approach to GIS-Based Prospectivity Analysis: Case Study of an MVT Pb–Zn Deposit , 2018, Natural Resources Research.

[30]  Binxu Zhai,et al.  Development of a stacked ensemble model for forecasting and analyzing daily average PM2.5 concentrations in Beijing, China. , 2018, The Science of the total environment.

[31]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[32]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[33]  Gongwen Wang,et al.  Interactive 3D Modeling by Integration of Geoscience Datasets for Exploration Targeting in Luanchuan Mo Polymetallic District, China , 2018, Natural Resources Research.

[34]  Jianping Chen,et al.  Three-Dimensional Mineral Potential Mapping for Reducing Multiplicity and Uncertainty: Kaerqueka Polymetallic Deposit, QingHai Province, China , 2019, Natural Resources Research.

[35]  J. Chilès,et al.  Geological modelling from field data and geological knowledge. Part I. Modelling method coupling 3D potential-field interpolation and geological rules , 2008 .

[36]  E. Carranza,et al.  Multi-scale Numerical Simulation and 3D Modeling for Deep Mineral Exploration in the Jiaojia Gold District, China , 2019, Natural Resources Research.

[37]  Sisi Zlatanova,et al.  Topological models and frameworks for 3D spatial objects , 2004, Comput. Geosci..

[38]  V. Rodriguez-Galiano,et al.  Machine learning predictive models for mineral prospectivity: an evaluation of neural networks, random forest, regression trees and support vector machines , 2015 .

[39]  Olivier Kaufmann,et al.  3D geological modelling from boreholes, cross-sections and geological maps, application over former natural gas storages in coal mines , 2008, Comput. Geosci..

[40]  Liang Yang,et al.  Assessing and visualizing uncertainty of 3D geological surfaces using level sets with stochastic motion , 2019, Comput. Geosci..

[41]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[42]  G. Caumon,et al.  Surface-Based 3D Modeling of Geological Structures , 2009 .

[43]  J. Mallet Elements of Mathematical Sedimentary Geology: the GeoChron Model , 2014 .

[44]  Mao Jingwen,et al.  Age and geochemistry of granites in Gejiu area, Yunnan province, SW China: Constraints on their petrogenesis and tectonic setting , 2010 .

[45]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[46]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[47]  Guillaume Caumon,et al.  Towards Stochastic Time-Varying Geological Modeling , 2010 .

[48]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[49]  R. Zuo Machine Learning of Mineralization-Related Geochemical Anomalies: A Review of Potential Methods , 2017, Natural Resources Research.

[50]  M. Jessell Three-dimensional geological modelling of potential-field data , 2001 .

[51]  W. Brent Lindquist,et al.  Image Thresholding by Indicator Kriging , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[52]  Frédéric Nguyen,et al.  1D geological imaging of the subsurface from geophysical data with Bayesian Evidential Learning , 2019, Comput. Geosci..

[53]  Binbin Zhao,et al.  Application of Bi-dimensional empirical mode decomposition (BEMD) modeling for extracting gravity anomaly indicating the ore-controlling geological architectures and granites in the Gejiu tin-copper polymetallic ore field, southwestern China , 2017 .

[54]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[55]  J. Friedman Stochastic gradient boosting , 2002 .

[56]  Pejman Tahmasebi,et al.  Hybrid geological modeling: Combining machine learning and multiple-point statistics , 2020, Comput. Geosci..

[57]  Jeffrey M. Dick,et al.  Exploring Spatially Non-stationary Relationships in the Determinants of Mineralization in 3D Geological Space , 2019, Natural Resources Research.

[58]  Anand Singh Triangular Grid-Based Fuzzy Cross-Update Inversion of Gravity Data: Case Studies from Mineral Exploration , 2019, Natural Resources Research.

[59]  Chaofeng Li,et al.  Identifying organic-rich Marcellus Shale lithofacies by support vector machine classifier in the Appalachian basin , 2014, Comput. Geosci..