An Approach for the Classification of Rock Types Using Machine Learning of Core and Log Data

Classifying rocks based on core data is the most common method used by geologists. However, due to factors such as drilling costs, it is impossible to obtain core samples from all wells, which poses challenges for the accurate identification of rocks. In this study, the authors demonstrated the application of an explainable machine-learning workflow using core and log data to identify rock types. The rock type is determined utilizing the flow zone index (FZI) method using core data first, and then based on the collection, collation, and cleaning of well log data, four supervised learning techniques were used to correlate well log data with rock types, and learning and prediction models were constructed. The optimal machine learning algorithm for the classification of rocks is selected based on a 10-fold cross-test and a comparison of AUC (area under curve) values. The accuracy rate of the results indicates that the proposed method can greatly improve the accuracy of the classification of rocks. SHapley Additive exPlanations (SHAP) was used to rank the importance of the various well logs used as input variables for the prediction of rock types and provides both local and global sensitivities, enabling the interpretation of prediction models and solving the “black box” problem with associated machine learning algorithms. The results of this study demonstrated that the proposed method can reliably predict rock types based on well log data and can solve hard problems in geological research. Furthermore, the method can provide consistent well log interpretation arising from the lack of core data while providing a powerful tool for well trajectory optimization. Finally, the system can aid with the selection of intervals to be completed and/or perforated.

[1]  Fengming Jin,et al.  Numerical study of fracture dynamics in different shale fabric facies by integrating machine learning and 3-D lattice method: A case from Cangdong Sag, Bohai Bay basin, China , 2022, Journal of Petroleum Science and Engineering.

[2]  Fariha Sohil,et al.  An introduction to statistical learning with applications in R , 2021, Statistical Theory and Related Fields.

[3]  D. Weitz,et al.  A New Ensemble Machine-Learning Framework for Searching Sweet Spots in Shale Reservoirs , 2020, SPE Journal.

[4]  Matthew J. Cracknell,et al.  Lithological mapping in the Central African Copper Belt using Random Forests and clustering: Strategies for optimised results , 2019, Ore Geology Reviews.

[5]  Márcio P. de Albuquerque,et al.  A deep residual convolutional neural network for automatic lithological facies identification in Brazilian pre-salt oilfield wellbore image logs , 2019, Journal of Petroleum Science and Engineering.

[6]  Y. Nishitsuji,et al.  Elastic impedance based facies classification using support vector machine and deep learning , 2018, Geophysical Prospecting.

[7]  Ping Zhang,et al.  Feature selection considering two types of feature relevancy and feature interdependency , 2018, Expert Syst. Appl..

[8]  Scott Lundberg,et al.  A Unified Approach to Interpreting Model Predictions , 2017, NIPS.

[9]  Brendon Hall,et al.  Facies classification using machine learning , 2016 .

[10]  T. Hastie,et al.  Neural networks and Deep Learning , 2016, Machine Learning Guide for Oil and Gas Using Python.

[11]  川野 秀一 An Introduction to Statistical Learning (with Applications in R), Gareth James,Daniela Witten,Trevor Hastie and Robert Tibshirani著, Springer, 2013年8月, 430pp., 価格 59.99〓, ISBN 978-1-4614-7137-0 , 2014 .

[12]  Trevor Hastie,et al.  An Introduction to Statistical Learning , 2013, Springer Texts in Statistics.

[13]  Max Kuhn,et al.  Building Predictive Models in R Using the caret Package , 2008 .

[14]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[15]  N. Altman An Introduction to Kernel and Nearest-Neighbor Nonparametric Regression , 1992 .

[16]  E. Pittman Relationship of porosity and permeability to various parameters derived from mercury injection-capillary pressure curves for sandstone , 1992 .

[17]  Josef Schmee,et al.  Outliers in Statistical Data (2nd ed.) , 1986 .

[18]  J. Wyatt,et al.  Study , 2022, Evaluation Methods in Biomedical and Health Informatics.

[19]  Juliet Wu A LITHOLOGY IDENTIFICATION METHOD IN ES4 RESERVOIR OF XIN 176 BLOCK WITH BAYES STEPWISE DISCRIMINANT METHOD , 2012 .

[20]  L. Breiman Random Forests , 2001, Machine Learning.

[21]  G. Ulmishek Petroleum geology and resources of the Amu-Darya basin, Turkmenistan, Uzbekistan, Afghanistan, and Iran , 2004 .

[22]  Yoshua Bengio,et al.  Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[23]  L. Breiman Arcing the edge , 1997 .

[24]  Djebbar Tiab,et al.  Enhanced Reservoir Description: Using Core and Log Data to Identify Hydraulic (Flow) Units and Predict Permeability in Uncored Intervals/Wells , 1993 .

[25]  Stanley Kolodzie,et al.  Analysis Of Pore Throat Size And Use Of The Waxman-Smits Equation To Determine Ooip In Spindle Field, Colorado , 1980 .