Machine learning approach for risk-based inspection screening assessment

Abstract Risk-based inspection (RBI) screening assessment is used to identify equipment that makes a significant contribution to the system's total risk of failure (RoF), so that the RBI detailed assessment can focus on analyzing higher-risk equipment. Due to its qualitative nature and high dependency on sound engineering judgment, screening assessment is vulnerable to human biases and errors, and thus subject to output variability and threatens the integrity of the assets. This paper attempts to tackle these challenges by utilizing a machine learning approach to conduct screening assessment. A case study using a dataset of RBI assessment for oil and gas production and processing units is provided, to illustrate the development of an intelligent system, based on a machine learning model for performing RBI screening assessment. The best performing model achieves accuracy and precision of 92.33% and 84.58%, respectively. A comparative analysis between the performance of the intelligent system and the conventional assessment is performed to examine the benefits of applying the machine learning approach in the RBI screening assessment. The result shows that the application of the machine learning approach potentially improves the quality of the conventional RBI screening assessment output by reducing output variability and increasing accuracy and precision.

[1]  Kemal Polat,et al.  An expert system approach based on principal component analysis and adaptive neuro-fuzzy inference system to diagnosis of diabetes disease , 2007, Digit. Signal Process..

[2]  Stefan H. Thomke,et al.  The Effect of 'Front-Loading' Problem-Solving on Product Development Performance , 2000 .

[3]  Richard Weber,et al.  A wrapper method for feature selection using Support Vector Machines , 2009, Inf. Sci..

[4]  David W. Armitage,et al.  A comparison of supervised learning techniques in the classification of bat echolocation calls , 2010, Ecol. Informatics.

[5]  Jeffrey K. Liker,et al.  The Toyota Product Development System: Integrating People, Process And Technology , 2006 .

[6]  Randa Herzallah,et al.  Probabilistic multiple model neural network based leak detection system: experimental study , 2015 .

[7]  Chih-Jen Lin,et al.  A Practical Guide to Support Vector Classication , 2008 .

[8]  Rajeev Sharma,et al.  Transforming Decision-Making Processes Transforming decision-making processes : a research agenda for understanding the impact of business analytics on organizations , 2017 .

[9]  Pedro Larrañaga,et al.  A review of feature selection techniques in bioinformatics , 2007, Bioinform..

[10]  Ye Wei,et al.  Automatic Detection of Welding Defects using Deep Neural Network , 2018 .

[11]  Aytug Onan,et al.  A fuzzy-rough nearest neighbor classifier combined with consistency-based subset evaluation and instance selection for automated diagnosis of breast cancer , 2015, Expert Syst. Appl..

[12]  D. Cox The Regression Analysis of Binary Sequences , 1958 .

[13]  Saad Talib Hasson,et al.  Enhancing Attribute Oriented Induction Of Data Mining , 2013 .

[14]  Mario Chica-Olmo,et al.  An assessment of the effectiveness of a random forest classifier for land-cover classification , 2012 .

[15]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[16]  Xiangliang Zhang,et al.  An up-to-date comparison of state-of-the-art classification algorithms , 2017, Expert Syst. Appl..

[17]  Bohdan W. Oppenheim Lean product development flow , 2004 .

[18]  Larry A. Rendell,et al.  A Practical Approach to Feature Selection , 1992, ML.

[19]  Xiaohua Hu DB-HReduction: A data preprocessing algorithm for data mining applications , 2003, Appl. Math. Lett..

[20]  Urszula Stanczyk,et al.  Feature Evaluation by Filter, Wrapper, and Embedded Approaches , 2015, Feature Selection for Data and Pattern Recognition.

[21]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[22]  Robert E. Schapire,et al.  The Boosting Approach to Machine Learning An Overview , 2003 .

[23]  Sofiène Tahar,et al.  Detection and sizing of metal-loss defects in oil and gas pipelines using pattern-adapted wavelets and machine learning , 2017, Appl. Soft Comput..

[24]  Juan de Oña,et al.  Analysis of traffic accident severity using Decision Rules via Decision Trees , 2013, Expert Syst. Appl..

[25]  Christophe Mues,et al.  An experimental comparison of classification algorithms for imbalanced credit scoring data sets , 2012, Expert Syst. Appl..

[26]  Luis Volnei Sudati Sagrilo,et al.  MFL signals and artificial neural networks applied to detection and classification of pipe weld defects , 2006 .

[27]  Ramón Ruiz,et al.  An automatic system of classification of weld defects in radiographic images , 2009 .

[28]  Nello Cristianini,et al.  Support vector machine classification and validation of cancer tissue samples using microarray expression data , 2000, Bioinform..

[29]  Steven Salzberg,et al.  On Comparing Classifiers: Pitfalls to Avoid and a Recommended Approach , 1997, Data Mining and Knowledge Discovery.

[30]  May D. Wang,et al.  Histological image classification using biologically interpretable shape-based features , 2013, BMC Medical Imaging.

[31]  Ajit Srividya,et al.  A comprehensive framework for evaluation of piping reliability due to erosion-corrosion for risk-informed inservice inspection , 2003, Reliab. Eng. Syst. Saf..

[32]  Min Chen,et al.  Disease Prediction by Machine Learning Over Big Data From Healthcare Communities , 2017, IEEE Access.

[33]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[34]  P. Drucker Knowledge-Worker Productivity: The Biggest Challenge , 1999, IEEE Engineering Management Review.

[35]  Oded Maimon,et al.  Dimension Reduction and Feature Selection , 2010, Data Mining and Knowledge Discovery Handbook.

[36]  Mahesh Pal,et al.  Random forest classifier for remote sensing classification , 2005 .

[37]  Tarek Zayed,et al.  Artificial neural network models for predicting condition of offshore oil and gas pipelines , 2014 .

[38]  Ajit Srividya,et al.  Importance measures in ranking piping components for risk informed in-service inspection , 2003, Reliab. Eng. Syst. Saf..

[39]  Tom Fawcett,et al.  Analysis and Visualization of Classifier Performance: Comparison under Imprecise Class and Cost Distributions , 1997, KDD.

[40]  P. Good,et al.  Permutation Tests: A Practical Guide to Resampling Methods for Testing Hypotheses , 1995 .

[41]  Charles Parker,et al.  An Analysis of Performance Measures for Binary Classifiers , 2011, 2011 IEEE 11th International Conference on Data Mining.

[42]  Isabelle Guyon,et al.  An Introduction to Feature Extraction , 2006, Feature Extraction.

[43]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[44]  Thomas Pyzdek,et al.  The Six Sigma Handbook , 2000 .

[45]  Nigel Ford From information- to knowledge-management: the role of rule induction and neural net machine learning techniques in knowledge generation , 1989, J. Inf. Sci..

[46]  Arja Saarenheimo,et al.  Comparison of approaches for estimating pipe rupture frequencies for risk-informed in-service inspections , 2004, Reliab. Eng. Syst. Saf..

[47]  R. M. Chandima Ratnayake,et al.  Technical integrity management: measuring HSE awareness using AHP in selecting a maintenance strategy , 2010 .

[48]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[49]  George P. Huber,et al.  A theory of the effects of advanced information technologies on organizational design, intelligence , 1990 .

[50]  Ajit Srividya,et al.  Optimisation of ISI interval using genetic algorithms for risk informed in-service inspection , 2004, Reliab. Eng. Syst. Saf..

[51]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[52]  Hajar Mousannif,et al.  Using Machine Learning Algorithms for Breast Cancer Risk Prediction and Diagnosis , 2016, ANT/SEIT.

[53]  Sofiène Tahar,et al.  A Hybrid Intelligent Approach for Metal-Loss Defect Depth Prediction in Oil and Gas Pipelines , 2016 .

[54]  Pedro M. Domingos The Role of Occam's Razor in Knowledge Discovery , 1999, Data Mining and Knowledge Discovery.

[55]  Daniel Straub,et al.  Risk-based optimal inspection strategies for structural systems using dynamic Bayesian networks , 2019, Structural Safety.

[56]  Lawrence J. Henschen,et al.  Using domain knowledge in knowledge discovery , 1999, CIKM '99.

[57]  Tarek Zayed,et al.  Condition Prediction Models for Oil and Gas Pipelines Using Regression Analysis , 2014 .

[58]  C. M. Hinckley,et al.  Combining mistake-proofing and Jidoka to achieve world class quality in clinical chemistry , 2007 .

[59]  Eric Rebentisch,et al.  A Framework for Organizing Lean Product Development , 2011 .

[60]  Mohak Shah,et al.  Evaluating Learning Algorithms: A Classification Perspective , 2011 .

[61]  J. López-Higuera,et al.  Real-time arc-welding defect detection and classification with principal component analysis and artificial neural networks , 2007 .

[62]  Soumitra Dutta,et al.  Strategies for implementing knowledge-based systems , 1997 .

[63]  Ren-Rong Chang,et al.  Application of risk based inspection in refinery and processing piping , 2005 .

[64]  Chih-Ping Wei,et al.  Feature Selection for Medical Data Mining: Comparisons of Expert Judgment and Automatic Approaches , 2006, 19th IEEE Symposium on Computer-Based Medical Systems (CBMS'06).

[65]  Tom Fawcett,et al.  An introduction to ROC analysis , 2006, Pattern Recognit. Lett..

[66]  Abdulkadir Sengür,et al.  Effective diagnosis of heart disease through neural networks ensembles , 2009, Expert Syst. Appl..

[67]  RadhaKanta Mahapatra,et al.  Business data mining - a machine learning perspective , 2001, Inf. Manag..

[68]  Rachael Gordon,et al.  The contribution of human factors to accidents in the offshore oil industry , 1998 .

[69]  R. M. Chandima Ratnayake,et al.  Application of a fuzzy inference system for functional failure risk rank estimation: RBM of rotating equipment and instrumentation , 2014 .

[70]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[71]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[72]  Alexander S. Yeh,et al.  More accurate tests for the statistical significance of result differences , 2000, COLING.

[73]  J Elith,et al.  A working guide to boosted regression trees. , 2008, The Journal of animal ecology.

[74]  Herbert A. Simon,et al.  Applications of machine learning and rule induction , 1995, CACM.

[75]  Karim Salahshoor,et al.  Pipeline leakage detection and isolation: An integrated approach of statistical and wavelet feature extraction with multi-layer perceptron neural network (MLPNN) , 2016 .

[76]  Alina A. von Davier,et al.  Cross-Validation , 2014 .

[77]  Jayendra Kumar,et al.  Multi - Class welding flaws classification using texture feature for radiographic images , 2014, 2014 International Conference on Advances in Electrical Engineering (ICAEE).

[78]  K. N. Fleming,et al.  Markov models for evaluating risk-informed in-service inspection strategies for nuclear power plant piping systems , 2004, Reliab. Eng. Syst. Saf..

[79]  Bernard Kamsu-Foguem Information structuring and risk-based inspection for the marine oil pipelines , 2016 .

[80]  Sotiris B. Kotsiantis,et al.  Machine learning: a review of classification and combining techniques , 2006, Artificial Intelligence Review.

[81]  Jeom Kee Paik,et al.  A risk-based inspection planning method for corroded subsea pipelines , 2015 .

[82]  K. K. Vaze,et al.  New approach for risk based inspection of H2S based Process Plants , 2014 .

[83]  Enrique López Droguett,et al.  A Multi-Objective Genetic Algorithm for determining efficient Risk-Based Inspection programs , 2015, Reliab. Eng. Syst. Saf..

[84]  Richard Simon,et al.  Bias in error estimation when using cross-validation for model selection , 2006, BMC Bioinformatics.

[85]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[86]  John D. Kelleher,et al.  Fundamentals of Machine Learning for Predictive Data Analytics: Algorithms, Worked Examples, and Case Studies , 2015 .

[87]  Shouyang Wang,et al.  Forecasting stock market movement direction with support vector machine , 2005, Comput. Oper. Res..

[88]  Monica Rossi,et al.  Lean product and process development , 2016 .

[89]  Tarek Zayed,et al.  Unpiggable Oil and Gas Pipeline Condition Forecasting Models , 2016 .

[90]  Max Kuhn,et al.  Applied Predictive Modeling , 2013 .

[91]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..