Prediction of Water Saturation from Well Log Data by Machine Learning Algorithms: Boosting and Super Learner

Intelligent predictive methods have the power to reliably estimate water saturation (Sw) compared to conventional experimental methods commonly performed by petrphysicists. However, due to nonlinearity and uncertainty in the data set, the prediction might not be accurate. There exist new machine learning (ML) algorithms such as gradient boosting techniques that have shown significant success in other disciplines yet have not been examined for Sw prediction or other reservoir or rock properties in the petroleum industry. To bridge the literature gap, in this study, for the first time, a total of five ML code programs that belong to the family of Super Learner along with boosting algorithms: XGBoost, LightGBM, CatBoost, AdaBoost, are developed to predict water saturation without relying on the resistivity log data. This is important since conventional methods of water saturation prediction that rely on resistivity log can become problematic in particular formations such as shale or tight carbonates. Thus, to do so, two datasets were constructed by collecting several types of well logs (Gamma, density, neutron, sonic, PEF, and without PEF) to evaluate the robustness and accuracy of the models by comparing the results with laboratory-measured data. It was found that Super Learner and XGBoost produced the highest accurate output (R2: 0.999 and 0.993, respectively), and with considerable distance, Catboost and LightGBM were ranked third and fourth, respectively. Ultimately, both XGBoost and Super Learner produced negligible errors but the latest is considered as the best amongst all.

[1]  Sadegh Baziar,et al.  Prediction of water saturation in a tight gas sandstone reservoir by using four intelligent methods: a comparative study , 2016, Neural Computing and Applications.

[2]  Trevor Hastie,et al.  Multi-class AdaBoost ∗ , 2009 .

[3]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1997, EuroCOLT.

[4]  Miquel Sànchez-Marrè,et al.  A survey on pre-processing techniques: Relevant issues in the context of environmental data mining , 2016, AI Commun..

[5]  Meshal A. Al-Amri,et al.  Integrated petrophysical and reservoir characterization workflow to enhance permeability and water saturation prediction , 2015 .

[6]  Syamsiah Mashohor,et al.  Robust committee machine for water saturation prediction , 2013 .

[7]  Hamid Heydari Gholanlo,et al.  Estimation of water saturation by using radial based function artificial neural network in carbonate reservoir: A case study in Sarvak formation , 2016 .

[8]  Saheed Olawale Olayiwola,et al.  A cohesive approach at estimating water saturation in a low-resistivity pay carbonate reservoir and its validation , 2017, Journal of Petroleum Exploration and Production Technology.

[9]  Hossein Memarian,et al.  Estimation of water saturation from petrophysical logs using radial basis function neural network , 2013 .

[10]  M. Sahimi,et al.  Machine learning in geo- and environmental sciences: From small to large scale , 2020, Advances in Water Resources.

[11]  M. Masihi,et al.  Introducing a method for calculating water saturation in a carbonate gas reservoir , 2019, Journal of Natural Gas Science and Engineering.

[12]  G. E. Archie Electrical Resistivity an Aid in Core-Analysis Interpretation , 1947 .

[13]  Lior Rokach,et al.  Decision forest: Twenty years of research , 2016, Inf. Fusion.

[14]  Menghui H. Zhang,et al.  Evaluation of boosted regression trees (BRTs) and two‐step BRT procedures to model and predict blood‐brain barrier passage , 2007 .

[15]  Denis Orlov,et al.  Prediction of Porosity and Permeability Alteration Based on Machine Learning Algorithms , 2019, Transport in Porous Media.

[16]  G. E. Archie The electrical resistivity log as an aid in determining some reservoir characteristics , 1942 .

[17]  Abdulhamit Subasi,et al.  Permeability prediction of petroleum reservoirs using stochastic gradient boosting regression , 2020, Journal of Ambient Intelligence and Humanized Computing.

[18]  G. E. Archie,et al.  Classification of Carbonate Reservoir Rocks and Petrophysical Considerations , 1952 .

[19]  Yoav Freund,et al.  A Short Introduction to Boosting , 1999 .

[20]  Y. P. Kosta,et al.  Comprehensive Evolution and Evaluation of Boosting , 2010 .

[21]  Salim Ahmed,et al.  Log data-driven model and feature ranking for water saturation prediction using machine learning approach , 2020 .

[22]  Martin J. Blunt,et al.  Development of artificial neural network models for predicting water saturation and fluid distribution , 2009 .

[23]  Anna Veronika Dorogush,et al.  CatBoost: unbiased boosting with categorical features , 2017, NeurIPS.

[24]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[25]  G. E. Archie,et al.  Introduction to Petrophysics of Reservoir Rocks , 1950 .

[26]  O. González-Recio,et al.  The gradient boosting algorithm and random boosting for genome-assisted evaluation in large data sets. , 2013, Journal of dairy science.

[27]  W. Al-Mudhafar Integrating machine learning and data analytics for geostatistical characterization of clastic reservoirs , 2020 .

[28]  Ali Moradzadeh,et al.  Methods of water saturation estimation: Historical perspective , 2011 .

[29]  Licheng Zhang,et al.  Machine Learning in Rock Facies Classification: An Application of XGBoost , 2017 .

[30]  A. Mollajan Application of local linear neuro-fuzzy model in estimating reservoir water saturation from well logs , 2015, Arabian Journal of Geosciences.

[31]  Wei Shao,et al.  Carbonate Log Interpretation Models Based on Machine Learning Techniques , 2019 .

[32]  H. Fattahi,et al.  Prediction of porosity and water saturation using pre-stack seismic attributes: a comparison of Bayesian inversion and computational intelligence methods , 2016, Computational Geosciences.