Applications of machine learning methods for engineering risk assessment – A review

Abstract The purpose of this article is to present a structured review of publications utilizing machine learning methods to aid in engineering risk assessment. A keyword search is performed to retrieve relevant articles from the databases of Scopus and Engineering Village. The search results are filtered according to seven selection criteria. The filtering process resulted in the retrieval of one hundred and twenty-four relevant research articles. Statistics based on different categories from the citation database is presented. By reviewing the articles, additional categories, such as the type of machine learning algorithm used, the type of input source used, the type of industry targeted, the type of implementation, and the intended risk assessment phase are also determined. The findings show that the automotive industry is leading the adoption of machine learning algorithms for risk assessment. Artificial neural networks are the most applied machine learning method to aid in engineering risk assessment. Additional findings from the review process are also presented in this article.

[1]  Shaoquan Ni,et al.  Data mining on Chinese train accidents to derive associated rules , 2017 .

[2]  Jianping Zhang,et al.  Instance–Based Learning for Highway Accident Frequency Prediction , 1997 .

[3]  Liping Fu,et al.  Model-Based Versus Data-Driven Approach for Road Safety Analysis: Do More Data Help? , 2016 .

[4]  S. D. Robinson,et al.  Application of machine learning to mapping primary causal factors in self reported safety narratives , 2015 .

[5]  Moshe Ben-Akiva,et al.  Text analysis in incident duration prediction , 2013 .

[6]  Eugenio Oñate,et al.  An empirical comparison of machine learning techniques for dam behaviour modelling , 2015 .

[7]  Xiaowei Luo,et al.  Transfer learning and deep convolutional neural networks for safety guardrail detection in 2D images , 2018 .

[8]  Young-Jin Kim,et al.  An architecture for emergency event prediction using LSTM recurrent neural networks , 2018, Expert Syst. Appl..

[9]  P. J. García Nieto,et al.  Prediction of work-related accidents according to working conditions using support vector machines , 2011, Appl. Math. Comput..

[10]  Yang Miang Goh,et al.  Factors influencing unsafe behaviors: A supervised learning approach. , 2018, Accident; analysis and prevention.

[11]  Tong Liu,et al.  Research on Healthy Anomaly Detection Model Based on Deep Learning from Multiple Time-Series Physiological Signals , 2016, Sci. Program..

[12]  Rob Alexander,et al.  Supporting systems of systems hazard analysis using multi-agent simulation , 2013 .

[13]  Yan Liu,et al.  Machine learning-based methods for analyzing grade crossing safety , 2017, Cluster Computing.

[14]  Zhou Wei,et al.  Retracted: Cluster Analysis of Risk Factors from Near-Miss and Accident Reports in Tunneling Excavation , 2018, Journal of Construction Engineering and Management.

[15]  Maurizio Bevilacqua,et al.  DATA MINING FOR OCCUPATIONAL INJURY RISK: A CASE STUDY , 2010 .

[16]  George Bearfield,et al.  The case for IT transformation and big data for safety risk management on the GB railways , 2018 .

[17]  Yinhai Wang,et al.  Design and experiment verification of a novel analysis framework for recognition of driver injury patterns: From a multi-class classification perspective. , 2018, Accident; analysis and prevention.

[18]  Abdul Halim Ghazali,et al.  Ensemble machine-learning-based geospatial approach for flood risk assessment using multi-sensor remote-sensing data and GIS , 2017 .

[19]  Theodora Chaspari,et al.  Automated ergonomic risk monitoring using body-mounted sensors and machine learning , 2018, Adv. Eng. Informatics.

[20]  Halima Bahi,et al.  Data mining approach , 2012 .

[21]  Daeil Lee,et al.  Autonomous operation algorithm for safety systems of nuclear power plants by using long-short term memory and function-based hierarchical framework , 2018, Annals of Nuclear Energy.

[22]  Chengcheng Xu,et al.  Association rule analysis of factors contributing to extraordinarily severe traffic crashes in China. , 2018, Journal of safety research.

[23]  N. Gibbs The choice. , 2014, Time.

[24]  So Young Sohn,et al.  Data fusion, ensemble and clustering to improve the classification accuracy for the severity of road traffic accidents in Korea , 2003 .

[25]  Pijush Samui,et al.  The Use of a Relevance Vector Machine in Predicting Liquefaction Potential , 2014 .

[26]  Joaquim Agostinho Barbosa Tinoco,et al.  The Use of Data Mining Techniques in Rockburst Risk Assessment , 2017 .

[27]  Azim Eskandarian,et al.  Unobtrusive drowsiness detection by neural network learning of driver steering , 2001 .

[28]  V. Sugumaran,et al.  Fault diagnosis of automobile hydraulic brake system using statistical features and support vector machines , 2015 .

[29]  Benjamin Schrauwen,et al.  Defect Detection in Reinforced Concrete Using Random Neural Architectures , 2014, Comput. Aided Civ. Infrastructure Eng..

[30]  Jacek M. Zurada,et al.  A Data-Mining Approach to Identification of Risk Factors in Safety Management Systems , 2017, J. Manag. Inf. Syst..

[31]  Krishnendu Chakrabarty,et al.  Machine Learning for Hardware Security: Opportunities and Risks , 2018, Journal of Electronic Testing.

[32]  Javier Taboada,et al.  Explaining and predicting workplace accidents using data-mining techniques , 2011, Reliab. Eng. Syst. Saf..

[33]  Mohamed Abdel-Aty,et al.  Comparative analysis of multiple techniques for developing and transferring safety performance functions. , 2019, Accident; analysis and prevention.

[34]  Geert Wets,et al.  A hybrid system of neural networks and rough sets for road safety performance indicators , 2010, Soft Comput..

[35]  Marco Botta,et al.  Real-Time Detection System of Driver Distraction Using Machine Learning , 2013, IEEE Transactions on Intelligent Transportation Systems.

[36]  Jingjing Pei,et al.  Improving Workplace Hazard Identification Performance Using Data Mining , 2018 .

[37]  Mark R Lehto,et al.  Classifying injury narratives of large administrative databases for surveillance-A practical approach combining machine learning ensembles and human review. , 2017, Accident; analysis and prevention.

[38]  Majid Sarvi,et al.  Simulation of safety: a review of the state of the art in road safety simulation modelling. , 2014, Accident; analysis and prevention.

[39]  Tsung-Chih Wu,et al.  Applying data mining techniques to analyze the causes of major occupational accidents in the petrochemical industry , 2013 .

[40]  Mohamed Abdel-Aty,et al.  Aggregate nonparametric safety analysis of traffic zones. , 2012, Accident; analysis and prevention.

[41]  Michael D. Porter,et al.  How the choice of safety performance function affects the identification of important crash prediction variables. , 2016, Accident; analysis and prevention.

[42]  Qin Zhang,et al.  On intelligent risk analysis and critical decision of underwater robotic vehicle , 2017 .

[43]  Wonil Kim,et al.  Artificial Intelligence Techniques in Game Contents , 2006 .

[44]  Madhar Taamneh,et al.  Data-mining techniques for traffic accident modeling and prediction in the United Arab Emirates , 2017 .

[45]  Marcello Sanguineti,et al.  Supervised and semi-supervised classifiers for the detection of flood-prone areas , 2017, Soft Comput..

[46]  Valeria V. Krzhizhanovskaya,et al.  Machine learning methods for environmental monitoring and flood protection , 2011 .

[47]  Denise Gravitt,et al.  Construction Safety , 2013 .

[48]  Gang Tao,et al.  A traffic accident morphology diagnostic model based on a rough set decision tree , 2016 .

[49]  Ludovic Tanguy,et al.  Natural language processing for aviation safety reports: From classification to interactive analysis , 2016, Comput. Ind..

[50]  V. Sugumaran,et al.  Safety analysis on a vibrating prismatic body: A data-mining approach , 2009, Expert Syst. Appl..

[51]  Lieyun Ding,et al.  Development of web-based system for safety risk early warning in urban metro construction , 2013 .

[52]  Jeremy M. Gernand Evaluating the Effectiveness of Mine Safety Enforcement Actions in Forecasting the Lost-Days Rate at Specific Worksites , 2016 .

[53]  Marvin Rausand,et al.  Risk Assessment: Theory, Methods, and Applications , 2011 .

[54]  Xiugang Li,et al.  Predicting motor vehicle crashes using Support Vector Machine models. , 2008, Accident; analysis and prevention.

[55]  S. Appavu alias Balamurugan,et al.  Prediction of warning level in aircraft accidents using data mining techniques , 2014, The Aeronautical Journal (1968).

[56]  Buyue Qian,et al.  Improving rail network velocity: A machine learning approach to predictive maintenance , 2014 .

[57]  Chuan Ding,et al.  A gradient boosting logit model to investigate driver’s stop-or-run behavior at signalized intersections using high-resolution traffic data , 2016 .

[58]  Afshin Shariat Mohaymany,et al.  Analysis of the traffic injury severity on two-lane, two-way rural roads based on classification tree models , 2011 .

[59]  Iman Janghorban Esfahani,et al.  Localized indoor air quality monitoring for indoor pollutants’ healthy risk assessment using sub-principal component analysis driven model and engineering big data , 2015, Korean Journal of Chemical Engineering.

[60]  Yunsick Sung,et al.  Apriori-based text mining method for the advancement of the transportation management plan in expressway work zones , 2017, The Journal of Supercomputing.

[61]  Xuedong Yan,et al.  In-depth analysis of drivers' merging behavior and rear-end crash risks in work zone merging areas. , 2015, Accident; analysis and prevention.

[62]  Priyanka Alluri,et al.  A random forests approach to prioritize Highway Safety Manual (HSM) variables for data collection , 2016 .

[63]  Dieu Tien Bui,et al.  Hybrid integration of Multilayer Perceptron Neural Networks and machine learning ensembles for landslide susceptibility assessment at Himalayan area (India) using GIS , 2017 .

[64]  Dimitri N. Mavris,et al.  Anomaly Detection in General-Aviation Operations Using Energy Metrics and Flight-Data Records , 2018 .

[65]  Chao Wu,et al.  Methodologies, principles and prospects of applying big data in safety science research , 2018 .

[66]  Cheng Zhou,et al.  Intelligent Approach Based on Random Forest for Safety Risk Prediction of Deep Foundation Pit in Subway Stations , 2019, J. Comput. Civ. Eng..

[67]  Wes McKinney,et al.  Data Structures for Statistical Computing in Python , 2010, SciPy.

[68]  John D. Lee,et al.  Real-Time Detection of Driver Cognitive Distraction Using Support Vector Machines , 2007, IEEE Transactions on Intelligent Transportation Systems.

[69]  Enrico Zio,et al.  Predicting component reliability and level of degradation with complex-valued neural networks , 2014, Reliab. Eng. Syst. Saf..

[70]  Enrico Zio,et al.  A support vector machine integrated system for the classification of operation anomalies in nuclear components and systems , 2007, Reliab. Eng. Syst. Saf..

[71]  Guofa Li,et al.  Decision Tree-Based Maneuver Prediction for Driver Rear-End Risk-Avoidance Behaviors in Cut-In Scenarios , 2017 .

[72]  Rui Zhang,et al.  Real-time transient stability assessment model using extreme learning machine , 2011 .

[73]  Edoardo Patelli,et al.  Learning from major accidents: Graphical representation and analysis of multi-attribute events to enhance risk communication , 2017 .

[74]  Eugenio Oñate,et al.  Early detection of anomalies in dam performance: A methodology based on boosted regression trees , 2017 .

[75]  Matthew R. Hallowell,et al.  Automated content analysis for construction safety: A natural language processing system to extract precursors and outcomes from unstructured injury reports , 2016 .

[76]  Huijun Gao,et al.  Data-Based Techniques Focused on Modern Industry: An Overview , 2015, IEEE Transactions on Industrial Electronics.

[77]  Hao Wu,et al.  An intelligent vision-based approach for helmet identification for work safety , 2018, Comput. Ind..

[78]  Andrea Alfonsi,et al.  Mining data in a dynamic PRA framework , 2018, Progress in Nuclear Energy.

[79]  Matteo Vagnoli,et al.  An ensemble-based change-point detection method for identifying unexpected behaviour of railway tunnel infrastructures , 2018, Tunnelling and Underground Space Technology.

[80]  TanguyLudovic,et al.  Natural language processing for aviation safety reports , 2016 .

[81]  Ahmad Mirabadi,et al.  Application of Association Rules in Iranian Railways (RAI) Accident Data Analysis , 2010 .

[82]  Billur Barshan,et al.  Detecting Falls with Wearable Sensors Using Machine Learning Techniques , 2014, Sensors.

[83]  Konstantina Gkritza,et al.  Time series modeling in traffic safety research. , 2018, Accident; analysis and prevention.

[84]  Matthew R. Hallowell,et al.  Construction Safety Clash Detection: Identifying Safety Incompatibilities among Fundamental Attributes using Data Mining , 2017 .

[85]  Fuqiang Zhou,et al.  Automated visual inspection of target parts for train safety based on deep learning , 2018 .

[86]  Luis A. Curiel-Ramirez,et al.  Towards of a modular framework for semi-autonomous driving assistance systems , 2019 .

[87]  Geoffrey E. Hinton,et al.  Deep Learning , 2015, Nature.

[88]  Ian D. Gates,et al.  A support vector machine algorithm to classify lithofacies and model permeability in heterogeneous reservoirs , 2010 .

[89]  Yong K. Cho,et al.  Data-Driven Monitoring System for Preventing the Collapse of Scaffolding Structures , 2018, Journal of Construction Engineering and Management.

[90]  Hua Hu,et al.  The safety management of urban rail transit based on operation fault log , 2017 .

[91]  Liang Wang,et al.  Machine Vision to Alert Roadside Personnel of Night Traffic Threats , 2018, IEEE Transactions on Intelligent Transportation Systems.

[92]  Bart De Schutter,et al.  A Big Data Analysis Approach for Rail Failure Risk Assessment , 2017, Risk analysis : an official publication of the Society for Risk Analysis.

[93]  Jui-Sheng Chou,et al.  The use of artificial intelligence combiners for modeling steel pitting risk and corrosion rate , 2017, Eng. Appl. Artif. Intell..

[94]  J V Tu,et al.  Advantages and disadvantages of using artificial neural networks versus logistic regression for predicting medical outcomes. , 1996, Journal of clinical epidemiology.

[95]  T.-L. Lee,et al.  Support vector regression methodology for storm surge predictions , 2008 .

[96]  Li-Yen Chang,et al.  Data mining of tree-based models to analyze freeway accident frequency. , 2005, Journal of safety research.

[97]  Zhu-feng Yue,et al.  Support vector machine for structural reliability analysis , 2006 .

[98]  Yuanyuan Jiang,et al.  A Novel Fault Diagnostic Approach for DC-DC Converters Based on CSA-DBN , 2018, IEEE Access.

[99]  Laurent El Ghaoui,et al.  Understanding large text corpora via sparse machine learning , 2013, Stat. Anal. Data Min..

[100]  Peter Jarvis,et al.  Rapid gravity filtration operational performance assessment and diagnosis for preventative maintenance from on-line data. , 2017 .

[101]  Feng Chen,et al.  Analysis of hourly crash likelihood using unbalanced panel data mixed logit model and real-time driving environmental big data. , 2018, Journal of safety research.

[102]  Yihong Gong,et al.  Driving Safety Monitoring Using Semisupervised Learning on Time Series Data , 2010, IEEE Transactions on Intelligent Transportation Systems.

[103]  Digvijay S. Pawar,et al.  Classification of Gaps at Uncontrolled Intersections and Midblock Crossings Using Support Vector Machines , 2015 .

[104]  Wei Wang,et al.  Using support vector machine models for crash injury severity analysis. , 2012, Accident; analysis and prevention.

[105]  Ella M. Atkins,et al.  Road Risk Modeling and Cloud-Aided Safety-Based Route Planning , 2016, IEEE Transactions on Cybernetics.

[106]  Jooyoung Park,et al.  Prediction of Driver’s Intention of Lane Change by Augmenting Sensor Information Using Machine Learning Techniques , 2017, Sensors.

[107]  Drago Špoljarić,et al.  Method for prediction of landslide movements based on random forests , 2017, Landslides.

[108]  Ercan Erdis,et al.  Decision tree analysis of construction fall accidents involving roofers , 2015, Expert Syst. Appl..

[109]  Young Jin Kim,et al.  Data mining on road safety: factor assessment on vehicle accidents using classification models , 2016 .

[110]  Chao Wu,et al.  A new paradigm for accident investigation and analysis in the era of big data , 2018 .

[111]  H. Burton,et al.  A machine learning framework for assessing post-earthquake structural safety , 2018 .

[112]  Donald E. Brown,et al.  Text Mining the Contributors to Rail Accidents , 2016, IEEE Transactions on Intelligent Transportation Systems.

[113]  Calin I. Anghel,et al.  Risk assessment for pipelines with active defects based on artificial intelligence methods , 2009 .

[114]  Mohammad Mehdi Besharati,et al.  A data mining approach to investigate the factors influencing the crash severity of motorcycle pillion passengers. , 2014, Journal of safety research.

[115]  Matthew R. Hallowell,et al.  Application of machine learning to construction injury prediction , 2016 .

[116]  Wan Li,et al.  Railway Traffic Accident Forecast Based on an Optimized Deep Auto-encoder , 2018 .

[117]  Naohito Takasuka,et al.  Road Surface Recognition Using Laser Radar for Automatic Platooning , 2016, IEEE Transactions on Intelligent Transportation Systems.

[118]  Sabrina Jocelyn,et al.  Estimation of probability of harm in safety of machinery using an investigation systemic approach and Logical Analysis of Data , 2018, Safety Science.

[119]  Gozde Bozdagi Akar,et al.  Driver aggressiveness detection via multisensory data fusion , 2016, EURASIP J. Image Video Process..

[120]  Henry Leung,et al.  Data fusion in intelligent transportation systems: Progress and challenges - A survey , 2011, Inf. Fusion.

[121]  Madjid Khalilian,et al.  Derailment accident risk assessment based on ensemble classification method , 2017, Safety Science.

[122]  李洪双,et al.  SUPPORT VECTOR MACHINE FOR STRUCTURAL RELIABILITY ANALYSIS , 2006 .

[123]  Zhe Xiao,et al.  Maritime Traffic Probabilistic Forecasting Based on Vessels’ Waterway Patterns and Motion Behaviors , 2017, IEEE Transactions on Intelligent Transportation Systems.

[124]  Oleg E. Bukharov,et al.  Development of a decision support system based on neural networks and a genetic algorithm , 2015, Expert Syst. Appl..

[125]  A. C. Cilliers,et al.  Continuous machine learning for abnormality identification to aid condition-based maintenance in nuclear power plant , 2018, Annals of Nuclear Energy.

[126]  Weifang Shi,et al.  Application of k-means clustering to environmental risk zoning of the chemical industrial area , 2014, Frontiers of Environmental Science & Engineering.

[127]  Behrouz Minaei-Bidgoli,et al.  A new scoring system for assessing the risk of occupational accidents: A case study using data mining techniques with Iran's Ministry of Labor data , 2014 .

[128]  Mashrur Chowdhury,et al.  Real-Time Highway Traffic Condition Assessment Framework Using Vehicle–Infrastructure Integration (VII) With Artificial Intelligence (AI) , 2009, IEEE Transactions on Intelligent Transportation Systems.

[129]  John Magee,et al.  The Method of Prediction , 2000 .

[130]  Xianmin Wang,et al.  Spatial Forecast of Landslides in Three Gorges Based On Spatial Data Mining , 2009, Sensors.

[131]  Ingemar J. Cox,et al.  Autonomous Robot Vehicles , 1990, Springer New York.

[132]  Hing Kai Chan,et al.  Recent Development in Big Data Analytics for Business Operations and Risk Management , 2017, IEEE Transactions on Cybernetics.

[133]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[134]  Yassine Ruichek,et al.  A Video-Analysis-Based Railway–Road Safety System for Detecting Hazard Situations at Level Crossings , 2015, IEEE Transactions on Intelligent Transportation Systems.

[135]  Zahid Halim,et al.  Artificial intelligence techniques for driving safety and vehicle crash prediction , 2016, Artificial Intelligence Review.

[136]  Mohamed Abdel-Aty,et al.  A new approach for calibrating safety performance functions. , 2018, Accident; analysis and prevention.

[137]  M. Marjanović,et al.  Landslide susceptibility assessment using SVM machine learning algorithm , 2011 .

[138]  Rob M.P. Goverde,et al.  Recent applications of big data analytics in railway transportation systems: A survey , 2018 .

[139]  Wonjong Rhee,et al.  Application of classification algorithms for analysis of road safety risk factor dependencies. , 2015, Accident; analysis and prevention.

[140]  Maria Riveiro,et al.  A novel analytic framework of real-time multi-vessel collision risk assessment for maritime traffic surveillance , 2017 .

[141]  Zhibin Lin,et al.  Data-driven support vector machine with optimization techniques for structural health monitoring and damage detection , 2017, KSCE Journal of Civil Engineering.