The application of machine learning methods to aggregate geochemistry predicts quarry source location: An example from Ireland

Abstract Attempts using geochemical data to classify quarry sources which provided reactive rock aggregate, composed of Carboniferous aged pyritic mudrocks and limestones, which has caused structural damage to over 12, 500 homes across Ireland have not yet succeeded. In this paper, a possible solution to this problem is found by performing machine learning models, such as Logistic regression and Random Forest, upon a geochemical dataset obtained through the scanning electron microscope energy-dispersive X-ray spectroscopy (SEM-EDS) and Laser ablation-quadrupole-inductively couple plasma mass spectrometry (LA-Q-ICPMS) of pyrite and Isotope ratio mass spectrometry (IRMS) of bulk rock aggregate, to predict quarry source location. When comparing the classification scores, the LA-Q-ICPMS dataset achieved the highest average classification score of 55.38% for Random Forest and 67.73% for Logistic regression based on 10-fold cross validation testing. As a result, this dataset was then used to classify a set of known unknown samples and achieved average classification accuracies of 40.30% for random forest and 66.80% for logistic regression, based on a systematic train-test procedure. There is scope to enhance these classification scores to an accuracy of 100% by combining the geochemical datasets together. However, due to the difficulty in linking pyrites analysed by SEM-EDS to those analysed by LA-Q-ICPMS, and relating a bulk rock analytical technique (IRMS) to mineral geochemistry (SEM-EDS, LA-Q-ICPMS), median values have to be used when combining IRMS (Fe, S) and SEM-EDS (TS and δ34S) datasets with LA-Q-ICPMS data. Therefore, if these combined datasets were used as part of an applied quarry classification system, statistically meaningful mean values taken from a near normally distributed dataset would have to be used in order to accurately represent the quarry composition.

[1]  R. Goodhue,et al.  Discriminating aggregate sources with in situ mineral chemistry: an Irish example , 2019, Quarterly Journal of Engineering Geology and Hydrogeology.

[2]  S. Kuhn,et al.  Distinguishing Ore Deposit Type and Barren Sedimentary Pyrite Using Laser Ablation-Inductively Coupled Plasma-Mass Spectrometry Trace Element Data and Statistical Analysis of Large Data Sets , 2019, Economic Geology.

[3]  Nicholas W M Ritchie,et al.  Performing elemental microanalysis with high accuracy and high precision by scanning electron microscopy/silicon drift detector energy-dispersive X-ray spectrometry (SEM/SDD-EDS) , 2014, Journal of Materials Science.

[4]  W. Ridley,et al.  Development of sulfide calibration standards for the laser ablation inductively-coupled plasma mass spectrometry technique , 2002 .

[5]  M. Böttcher,et al.  Trace metals in Holocene coastal peats and their relation to pyrite formation (NW Germany) , 2002 .

[6]  R. Reyment Compositional data analysis , 1989 .

[7]  Isik Yilmaz,et al.  Landslide susceptibility mapping using frequency ratio, logistic regression, artificial neural networks and their comparison: A case study from Kat landslides (Tokat - Turkey) , 2009, Comput. Geosci..

[8]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[9]  T. Lyons,et al.  Trace Element Content of Sedimentary Pyrite in Black Shales , 2015 .

[10]  A. Stepanov,et al.  Whole rock and discrete pyrite geochemistry as complementary tracers of ancient ocean chemistry: An example from the Neoproterozoic Doushantuo Formation, China , 2017 .

[11]  Kazuhide Sawada,et al.  Landslide susceptibility analysis with logistic regression model based on FCM sampling strategy , 2013, Comput. Geosci..

[12]  I. Somerville,et al.  Stratigraphy and sedimentology of lower carboniferous (Dinantian) boreholes from West Co. Meath, Ireland , 1990 .

[13]  R. Large,et al.  Geochemistry of shale and sedimentary pyrite as a proxy for gold fertility in the Selwyn basin area, Yukon , 2018, Mineralium Deposita.

[14]  Renguang Zuo,et al.  GIS-based rare events logistic regression for mineral prospectivity mapping , 2018, Comput. Geosci..

[15]  K. Savage,et al.  The effect of As, Co, and Ni impurities on pyrite oxidation kinetics: Batch and flow-through reactor experiments with synthetic pyrite , 2008 .

[16]  M. Murphy,et al.  What Is Machine Learning , 2015 .

[17]  Emmanuel John M. Carranza,et al.  Random forest predictive modeling of mineral prospectivity with small number of prospects and data with missing values in Abra (Philippines) , 2015, Comput. Geosci..

[18]  J. Hellstrom,et al.  Iolite: Freeware for the visualisation and processing of mass spectrometric data , 2011 .

[19]  H. Gäbler,et al.  Development of a Matrix‐Matched Sphalerite Reference Material (MUL‐ZnS‐1) for Calibration of In Situ Trace Element Measurements by Laser Ablation‐Inductively Coupled Plasma‐Mass Spectrometry , 2017 .

[20]  A. Ashworth,et al.  Breast cancer molecular profiling with single sample predictors: a retrospective analysis. , 2010, The Lancet. Oncology.

[21]  P. Atkinson,et al.  Random Forest classification of Mediterranean land cover using multi-seasonal imagery and multi-seasonal texture , 2012 .

[22]  Paul Quigley,et al.  Evaluating pyrite-induced swelling in Dublin mudrocks , 2015 .