Explaining and predicting workplace accidents using data-mining techniques

Current research into workplace risk is mainly conducted using conventional descriptive statistics, which, however, fail to properly identify cause-effect relationships and are unable to construct models that could predict accidents. The authors of the present study modelled incidents and accidents in two companies in the mining and construction sectors in order to identify the most important causes of accidents and develop predictive models. Data-mining techniques (decision rules, Bayesian networks, support vector machines and classification trees) were used to model accident and incident data compiled from the mining and construction sectors and obtained in interviews conducted soon after an incident/accident occurred. The results were compared with those for a classical statistical techniques (logistic regression), revealing the superiority of decision rules, classification trees and Bayesian networks in predicting and identifying the factors underlying accidents/incidents.

[1]  Stephen J. Wright,et al.  Numerical Optimization , 2018, Fundamental Statistical Inference.

[2]  Chia-Wen Liao,et al.  Data mining for occupational injuries in the Taiwan construction industry , 2008 .

[3]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[4]  Robert C. Holte,et al.  Very Simple Classification Rules Perform Well on Most Commonly Used Datasets , 1993, Machine Learning.

[5]  R. Suganya,et al.  Data Mining Concepts and Techniques , 2010 .

[6]  Chia-Wen Liao,et al.  Discovery of unapparent association rules based on extracted probability , 2009, Decis. Support Syst..

[7]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[8]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[9]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[10]  Gregory F. Cooper,et al.  A Bayesian Method for the Induction of Probabilistic Networks from Data , 1992 .

[11]  V. Karra Analysis of non-fatal and fatal injury rates for mine operator and contractor employees and the influence of work location. , 2005, Journal of safety research.

[12]  José M. Matías,et al.  A machine learning methodology for the analysis of workplace accidents , 2008, Int. J. Comput. Math..

[13]  Javier Taboada,et al.  A Bayesian network analysis of workplace accidents caused by falls from a height , 2009 .

[14]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[15]  Aiko M. Hormann,et al.  Programs for Machine Learning. Part I , 1962, Inf. Control..

[16]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[17]  Alexander J. Smola,et al.  Learning with kernels , 1998 .

[18]  D Komljenovic,et al.  Analysis of fatalities and injuries involving mining equipment. , 2007, Journal of safety research.

[19]  George L. Germain,et al.  Practical loss control leadership , 1996 .

[20]  Ana María Gómez Subcontratación en el sector de la construcción , 2007 .

[21]  Finn V. Jensen,et al.  Bayesian Networks and Decision Graphs , 2001, Statistics for Engineering and Information Science.

[22]  Suphi Ural,et al.  Evaluation of occupational safety and health in surface mines , 2008 .

[23]  Eibe Frank,et al.  Logistic Model Trees , 2003, ECML.

[24]  Ian H. Witten,et al.  Generating Accurate Rule Sets Without Global Optimization , 1998, ICML.

[25]  Eibe Frank,et al.  Logistic Model Trees , 2003, Machine Learning.