Low-resolution Geological Survey of Pakistan (GSP) maps surrounding the region of interest show oolitic and fossiliferous limestone occurrences correspondingly in Samanasuk, Lockhart, and Margalla hill formations in the Hazara division, Pakistan. Machine-learning algorithms (MLAs) have been rarely applied to multispectral remote sensing data for differentiating between limestone formations formed due to different depositional environments, such as oolitic or fossiliferous. Unlike the previous studies that mostly report lithological classification of rock types having different chemical compositions by the MLAs, this paper aimed to investigate MLAs’ potential for mapping subclasses within the same lithology, i.e., limestone. Additionally, selecting appropriate data labels, training algorithms, hyperparameters, and remote sensing data sources were also investigated while applying these MLAs. In this paper, first, oolitic (Samanasuk), fossiliferous (Lockhart and Margalla) limestone-bearing formations along with the adjoining Hazara formation were mapped using random forest (RF), support vector machine (SVM), classification and regression tree (CART), and naïve Bayes (NB) MLAs. The RF algorithm reported the best accuracy of 83.28% and a Kappa coefficient of 0.78. To further improve the targeted allochemical limestone formation map, annotation labels were generated by the fusion of maps obtained from principal component analysis (PCA), decorrelation stretching (DS), X-means clustering applied to ASTER-L1T, Landsat-8, and Sentinel-2 datasets. These labels were used to train and validate SVM, CART, NB, and RF MLAs to obtain a binary classification map of limestone occurrences in the Hazara division, Pakistan using the Google Earth Engine (GEE) platform. The classification of Landsat-8 data by CART reported 99.63% accuracy, with a Kappa coefficient of 0.99, and was in good agreement with the field validation. This binary limestone map was further classified into oolitic (Samanasuk) and fossiliferous (Lockhart and Margalla) formations by all the four MLAs; in this case, RF surpassed all the other algorithms with an improved accuracy of 96.36%. This improvement can be attributed to better annotation, resulting in a binary limestone classification map, which formed a mask for improved classification of oolitic and fossiliferous limestone in the area.
[1]
R. Folk.
Practical petrographic classification of limestones
,
1959
.
[2]
Aboul Ella Hassanien,et al.
Image Fusion Techniques in Remote Sensing
,
2014,
ArXiv.
[3]
P. Gatt.
Model of limestone weathering and damage in masonry: Sedimentological and geotechnical controls in the Globigerina Limestone Formation (Miocene) of Malta
,
2006
.
[4]
W. P. Loughlin,et al.
PRINCIPAL COMPONENT ANALYSIS FOR ALTERATION MAPPING
,
1991
.
[5]
Z. Botev.
Variance Reduction
,
2017
.
[6]
José Augusto Baranauskas,et al.
How Many Trees in a Random Forest?
,
2012,
MLDM.
[7]
Sajjad Ahmad,et al.
Ooid Fabric in the Jurassic of the Indus Basin, Pakistan: Control on the Original Mineralogy
,
2020,
Current Science.
[8]
C. Nsofor,et al.
Mineral Detection and Mapping Using Band Ratioing and Crosta Technique in Bwari Area Council, Abuja Nigeria.
,
2014
.
[10]
Mark R. Segal,et al.
Machine Learning Benchmarks and Random Forest Regression
,
2004
.
[11]
I. Jan,et al.
Microfacies and diagenetic-fabric of the Samana Suk Formation at Harnoi Section, Abbottabad, Khyber Pakhtunkhwa, Pakistan
,
2013
.