Identifying channel sand-body from multiple seismic attributes with an improved random forest algorithm

Abstract Machine learning provides numerous data-driven tools for automatic pattern recognition. Even though various algorithms such as neural networks and support vector machines have been widely applied, it is still necessary to explore new paradigms and algorithms to improve the machine learning assisted seismic interpretation. Random Forest (RF) is a widely used ensemble algorithm, however, only limited studies of random forest in the seismic application were published. In this article, the methodology of random forest is introduced systematically. Meanwhile, to solve the problem of hyper-parameter determination, we propose an improved algorithm named Pruning Random Forest (PRF). To reveal the advantages of PRF in terms of predictive performance, robustness, and feature selection compared with support vector machine, neural network, and decision tree, several well-designed experiments are executed based on the seismic data of the western Bohai Sea of China. The potential and advantages of random forest in the present case are confirmed by various experiments, which substantiates that the proposed pruning random forest algorithm provides a reliable alternative way for further machine learning assisted seismic interpretation.

[1]  J. Harris,et al.  Comparison of the Data-Driven Random Forests Model and a Knowledge-Driven Method for Mineral Prospectivity Mapping: A Case Study for Gold Deposits Around the Huritz Group and Nueltin Suite, Nunavut, Canada , 2016, Natural Resources Research.

[2]  M. Wiesmeier,et al.  Digital mapping of soil organic matter stocks using Random Forest modeling in a semi-arid steppe ecosystem , 2011, Plant and Soil.

[3]  V. Rodriguez-Galiano,et al.  Machine learning predictive models for mineral prospectivity: an evaluation of neural networks, random forest, regression trees and support vector machines , 2015 .

[4]  Pierre Rochette,et al.  The effect of hydrostatic pressure up to 1.61 GPa on the Morin transition of hematite‐bearing rocks: Implications for planetary crustal magnetization , 2015 .

[5]  Matthew J. Cracknell,et al.  Lithologic mapping using Random Forests applied to geophysical and remote-sensing data: A demonstration study from the Eastern Goldfields of Australia , 2018, GEOPHYSICS.

[6]  Luc Van Gool,et al.  Random Forests for Real Time 3D Face Analysis , 2012, International Journal of Computer Vision.

[7]  Haibin Di,et al.  Seismic Multi-attribute Classification for Salt Boundary Detection - A Comparison , 2017 .

[8]  Vera Louise Hauge,et al.  Machine Learning Methods for Sweet Spot Detection: A Case Study , 2017 .

[9]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[10]  Victor F. Rodriguez-Galiano,et al.  Predictive modelling of gold potential with the integration of multisource information based on random forest: a case study on the Rodalquilar area, Southern Spain , 2014, Int. J. Geogr. Inf. Sci..

[11]  Johannes R. Sveinsson,et al.  Random Forests for land cover classification , 2006, Pattern Recognit. Lett..

[12]  Hideitsu Hino,et al.  Geochemical Discrimination and Characteristics of Magmatic Tectonic Settings: A Machine‐Learning‐Based Approach , 2017, 1712.09016.

[13]  Emmanuel John M. Carranza,et al.  Random forest predictive modeling of mineral prospectivity with small number of prospects and data with missing values in Abra (Philippines) , 2015, Comput. Geosci..

[14]  Mary M. Poulton,et al.  Neural networks as an intelligence amplification tool: A review of applications , 2002 .

[15]  Yoshua Bengio,et al.  Random Search for Hyper-Parameter Optimization , 2012, J. Mach. Learn. Res..

[16]  Tin Kam Ho,et al.  The Random Subspace Method for Constructing Decision Forests , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[17]  D. R. Cutler,et al.  Utah State University From the SelectedWorks of , 2017 .

[18]  Vikram Jayaram,et al.  A comparison of classification techniques for seismic facies recognition , 2015 .

[19]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[20]  Amin Gholami,et al.  Estimation of porosity from seismic attributes using a committee model with bat-inspired optimization algorithm , 2017 .

[21]  Tomaso Poggio,et al.  Automated fault detection without seismic processing , 2017 .

[22]  Denes Vigh,et al.  Deep learning prior models from seismic images for full-waveform inversion , 2017 .

[23]  Seyed Amir Naghibi,et al.  A Comparative Assessment Between Three Machine Learning Models and Their Performance Comparison by Bivariate and Multivariate Statistical Methods in Groundwater Potential Mapping , 2015, Water Resources Management.

[24]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[25]  Eric C. Grunsky,et al.  Predictive lithological mapping of Canada's North using Random Forest classification applied to geophysical and geochemical data , 2015, Comput. Geosci..

[26]  Peter Tiño,et al.  Managing Diversity in Regression Ensembles , 2005, J. Mach. Learn. Res..

[27]  X. Chen,et al.  Random forests for genomic data analysis. , 2012, Genomics.

[28]  Srikanta Mishra,et al.  Applications of machine learning for facies and fracture prediction using Bayesian Network Theory and Random Forest: Case studies from the Appalachian basin, USA , 2018, Journal of Petroleum Science and Engineering.

[29]  Masoud Nikravesh,et al.  Past, present and future intelligent reservoir characterization trends , 2001 .

[30]  Kurt J. Marfurt,et al.  Seismic Attributes for Prospect Identification and Reservoir Characterization , 2007 .

[31]  Margaret G. Schmidt,et al.  Predictive soil parent material mapping at a regional-scale: a Random Forest approach. , 2014 .

[32]  Kevin P. Dorrington,et al.  Genetic‐algorithm/neural‐network approach to seismic attribute selection for well‐log prediction , 2004 .

[33]  Mohammad Ali Riahi,et al.  Estimation of Reservoir Porosity and Water Saturation Based on Seismic Attributes Using Support Vector Regression Approach , 2014 .

[34]  G. V. Kass An Exploratory Technique for Investigating Large Quantities of Categorical Data , 1980 .

[35]  Timothy R. Carr,et al.  Comparison of supervised and unsupervised approaches for mudstone lithofacies classification: Case studies from the Bakken and Mahantango-Marcellus Shale, USA , 2016 .

[36]  Wen Zhou,et al.  Evaluation of machine learning methods for formation lithology identification: A comparison of tuning processes and model performances , 2018 .

[37]  Michael Sawada,et al.  A comparison of classification algorithms using Landsat-7 and Landsat-8 data for mapping lithology in Canada’s Arctic , 2015 .

[38]  Ning Shi,et al.  Semi-supervised least squares support vector machine algorithm: application to offshore oil reservoir , 2016, Applied Geophysics.

[39]  Warren T. Wood,et al.  A global prediction of seafloor sediment porosity using machine learning , 2015 .

[40]  Jane Labadin,et al.  Applied Soft Computing , 2014 .

[41]  Ursula Iturrarán-Viveros,et al.  Artificial Neural Networks applied to estimate permeability, porosity and intrinsic attenuation using seismic attributes and well-log data , 2014 .

[42]  Timothy R. Carr,et al.  Marcellus Shale Lithofacies Prediction by Multiclass Neural Network Classification in the Appalachian Basin , 2012, Mathematical Geosciences.

[43]  Aurobinda Routray,et al.  A Novel Preprocessing Scheme to Improve the Prediction of Sand Fraction From Seismic Attributes Using Neural Networks , 2015, IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing.

[44]  Sadegh Baziar,et al.  Prediction of water saturation in a tight gas sandstone reservoir by using four intelligent methods: a comparative study , 2016, Neural Computing and Applications.

[45]  Naonori Ueda,et al.  Generalization error of ensemble estimators , 1996, Proceedings of International Conference on Neural Networks (ICNN'96).

[46]  Timothy R. Carr,et al.  Methodology of organic-rich shale lithofacies identification and prediction: A case study from Marcellus Shale in the Appalachian basin , 2012, Comput. Geosci..

[47]  Mohammad Ali Riahi,et al.  Multi attribute transform and neural network in porosity estimation of an offshore oil field — A case study , 2011 .

[48]  Aurobinda Routray,et al.  Quantification of sand fraction from seismic attributes using Neuro-Fuzzy approach , 2014, ArXiv.

[49]  M. B. Widess HOW THIN IS A THIN BED , 1973 .

[50]  Kevin Leyton-Brown,et al.  Auto-WEKA: combined selection and hyperparameter optimization of classification algorithms , 2012, KDD.

[51]  Arthur E. Barnes,et al.  Handbook of Poststack Seismic Attributes , 2016 .

[52]  Masoud Nikravesh,et al.  Soft computing-based computational intelligent for reservoir characterization , 2004, Expert Syst. Appl..

[53]  Vikram Jayaram,et al.  Lithofacies classification in Barnett Shale using proximal support vector machines , 2014 .

[54]  André Carlos Ponce de Leon Ferreira de Carvalho,et al.  Evolutionary tuning of SVM parameter values in multiclass problems , 2008, Neurocomputing.

[55]  H. Pourghasemi,et al.  Application of GIS-based data driven random forest and maximum entropy models for groundwater potential mapping: A case study at Mehran Region, Iran , 2016 .

[56]  H. Elsenbeer,et al.  Soil organic carbon concentrations and stocks on Barro Colorado Island — Digital soil mapping using Random Forests analysis , 2008 .

[57]  Chaofeng Li,et al.  Identifying organic-rich Marcellus Shale lithofacies by support vector machine classifier in the Appalachian basin , 2014, Comput. Geosci..

[58]  Jane Labadin,et al.  Predicting Petroleum Reservoir Properties from Downhole Sensor Data using an Ensemble Model of Neural Networks , 2013, MLSDA '13.

[59]  Matthew J. Cracknell,et al.  Geological mapping using remote sensing data: A comparison of five machine learning algorithms, their response to variations in the spatial distribution of training data and the use of explicit spatial information , 2014, Comput. Geosci..

[60]  Mahesh Pal,et al.  Random forest classifier for remote sensing classification , 2005 .

[61]  Pierre Geurts,et al.  Extremely randomized trees , 2006, Machine Learning.

[62]  B. Huwe,et al.  Uncertainty in the spatial prediction of soil texture: Comparison of regression tree and Random Forest models , 2012 .

[63]  Anil K. Jain,et al.  Bootstrap Techniques for Error Estimation , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[64]  E. Carranza,et al.  Data-driven predictive mapping of gold prospectivity, Baguio district, Philippines: Application of Random Forests algorithm , 2015 .

[65]  Zhi-Hua Zhou,et al.  Ensemble Methods: Foundations and Algorithms , 2012 .