DATA MINING FOR OCCUPATIONAL INJURY RISK: A CASE STUDY

The management of occupational injury is of strategic importance in a company from the organizational, engineering and economic point of view. This work is an attempt to apply data mining techniques to data regarding accidents in a medium-sized refinery. Several techniques were adopted in order to identify the important relationships between risk level and immediate/root causes and corrective actions. As a Data Mining technique were tested: Negative Binomial Regression (NBR), Chi-Squared Automatic Interaction Detection (CHAID); Exhaustive CHAID; Classification And Regression Trees (CART); Quick, Unbiased, Efficient Statistical Tree (QUEST), Artificial Neural Network (ANN) and Neuro-Fuzzy Systems (FIS). The comparison carried out in this study shows through a real application the flexibility and advantages of using the neuro-fuzzy network, a typical soft computing tool. Using these innovative techniques to analyse injury data this study aims to: • obtain a classification of input data according to their importance and/or influence on the risk level in injuries; • assess how a variation in one or more pieces of input data can effect occupational injury and subsequently carry out a sensitivity analysis concerning the probability, the consequences and risks of the injurie events; The analyses carried out indicated important relationships between the variables, providing useful decision-making rules which can be followed when adopting measures for improvement.

[1]  Marcello Braglia,et al.  The classification and regression tree approach to pump failure rate analysis , 2003, Reliab. Eng. Syst. Saf..

[2]  Kurt Hornik,et al.  Universal approximation of an unknown mapping and its derivatives using multilayer feedforward networks , 1990, Neural Networks.

[3]  Selwyn Piramuthu,et al.  Financial credit-risk evaluation with neural and neurofuzzy systems , 1999, Eur. J. Oper. Res..

[4]  Norberto Piccinini,et al.  Risk analysis as a basis for safety management system , 2004 .

[5]  Maurizio Faccio,et al.  CLASSIFICATION OF OCCUPATIONAL INJURY CASES USING THE REGRESSION TREE APPROACH , 2006 .

[6]  Tore J. Larsson,et al.  The distribution of occupational injury risks in the State of Victoria , 2002 .

[7]  Allan Pinkus,et al.  Multilayer Feedforward Networks with a Non-Polynomial Activation Function Can Approximate Any Function , 1991, Neural Networks.

[8]  Saifur Rahman,et al.  Fuzzy and neuro-fuzzy computing models for electric load forecasting , 1995 .

[9]  Li-Yen Chang,et al.  Analysis of freeway accident frequencies: Negative binomial regression versus artificial neural network , 2005 .

[10]  Aarnout Brombacher,et al.  Safety lifecycle management. A flowchart presentation of the IEC 61508 overall safety lifecycle model , 1999 .

[11]  Simo Salminen,et al.  Relationships between injuries at work and during leisure time. , 2005, Accident; analysis and prevention.

[12]  P Macaskill,et al.  Global burden of disease and injury due to occupational factors. , 1999, Epidemiology.

[13]  David Biggs,et al.  A method of choosing multiway partitions for classification and decision trees , 1991 .

[14]  M K Lindell,et al.  Occupational safety and health inspection scores predict rates of workers' lost-time injuries. , 1997, Accident; analysis and prevention.

[15]  E. Emmett,et al.  Risk factors for occupational hand injury. , 1986, Journal of occupational medicine. : official publication of the Industrial Medical Association.

[16]  Kurt Hornik,et al.  Approximation capabilities of multilayer feedforward networks , 1991, Neural Networks.

[17]  John P. Cotnam,et al.  Epidemiology of occupational acute traumatic hand injuries: a literature review , 2001 .

[18]  J A Waller,et al.  Treated injuries in northern Vermont. , 1995, Accident; analysis and prevention.

[19]  Li-Yen Chang,et al.  Data mining of tree-based models to analyze freeway accident frequency. , 2005, Journal of safety research.

[20]  Ajith Abraham,et al.  Feature deduction and ensemble design of intrusion detection systems , 2005, Comput. Secur..

[21]  Elyce A Biddle,et al.  Comparison of two fatal occupational injury surveillance systems in the United States. , 2002, Journal of safety research.

[22]  Kui Wang,et al.  A bivariate zero-inflated Poisson regression model to analyze occupational injuries. , 2003, Accident; analysis and prevention.

[23]  S. Béjean,et al.  Prevention of occupational injuries: moral hazard and complex agency relationships , 2004 .

[24]  L Mussone,et al.  An analysis of urban collisions using an artificial intelligence model. , 1999, Accident; analysis and prevention.

[25]  Andy H. Lee,et al.  Modeling zero-inflated count series with application to occupational health , 2004, Comput. Methods Programs Biomed..

[26]  Tian-Shyug Lee,et al.  Mining the customer credit using classification and regression tree and multivariate adaptive regression splines , 2006, Comput. Stat. Data Anal..

[27]  Veljko Milutinovic,et al.  Neural Networks: Concepts, Applications, and Implementations , 1991 .

[28]  Mevlut Ture,et al.  Comparing classification techniques for predicting essential hypertension , 2005, Expert Syst. Appl..

[29]  So Young Sohn,et al.  Data fusion, ensemble and clustering to improve the classification accuracy for the severity of road traffic accidents in Korea , 2003 .

[30]  Santosh Kumar,et al.  Ergonomics, loss management, and occupational injury and illness surveillance. Part 2: injury and illness incident profile. Sample data , 2002 .

[31]  Theodore K. Courtney,et al.  Advancing analytic epidemiologic studies of occupational injuries , 1997 .

[32]  J P Leigh,et al.  Costs of occupational injuries and illnesses in California. , 2001, Preventive medicine.

[33]  Fernando Berzal Galiano,et al.  On the quest for easy-to-understand splitting rules , 2003, Data Knowl. Eng..

[34]  M. Sugeno,et al.  Structure identification of fuzzy model , 1988 .

[35]  Bruno Fabiano,et al.  A study of the relationship between occupational injuries and firm size and type in the Italian industry , 2004 .

[36]  G. V. Kass An Exploratory Technique for Investigating Large Quantities of Categorical Data , 1980 .

[37]  Mohamed Abdel-Aty,et al.  Identifying crash propensity using specific traffic speed conditions. , 2005, Journal of safety research.

[38]  M. Ghodsi,et al.  Occupational injuries in Tehran. , 2005, Injury.

[39]  Oi-ling Siu,et al.  Safety climate and safety performance among construction workers in Hong Kong. The role of psychological strains as mediators. , 2004, Accident; analysis and prevention.

[40]  E. Sorensen,et al.  The Decision Tree Approach to Stock Selection , 2000 .

[41]  George Cybenko,et al.  Approximation by superpositions of a sigmoidal function , 1992, Math. Control. Signals Syst..