Decision support system for safety improvement: An approach using multiple correspondence analysis, t-SNE algorithm and K-means clustering

Abstract An attempt has been made to develop a decision support system (DSS) for safety improvement using a multi-step knowledge discovery process involving multiple correspondence analysis (MCA), t-SNE algorithm and K-means clustering. MCA is used for dimension reduction and perceptual mapping from categorical data. Usually, the first two dimensions are used for perceptual mapping if these two dimensions explain a significant percentage of variance. Otherwise, the traditional method of two dimensional mapping, leads to loss of important categorical information involved with other dimensions. Considering the above, a novel R2-profile approach, as an alternate to inertia based approach, is adopted to obtain the desired number of dimensions to be retained without loss of significant amount of information. t-SNE technique reduces the high dimensional data into two dimensional (2D) map, which provides the associations amongst different categories. K-means clustering grouped the 2D categories in homogenous clusters as per the similarities of the categories. A novel kernel category based chi-square distance method is proposed to identify sub-clusters within a cluster which subsequently provides useful rules for safety improvement. The methodology also provides a logical approach of dimension reduction in a form called ‘funnel diagram’. Finally, the DSS is applied to analysing near miss incidents occurred in electric overhead traveling (EOT) crane operations in a steel plant. Several safety rules are identified and safety interventions are proposed.

[1]  Ful-Chiang Wu,et al.  A comparative study on optimization methods for experiments with ordered categorical data , 2006, Comput. Ind. Eng..

[2]  Athanasios P. Chassiakos,et al.  Development of decision-support system for managing highway safety , 2005 .

[3]  Robert Babuska,et al.  Railway Track Circuit Fault Diagnosis Using Recurrent Neural Networks , 2017, IEEE Transactions on Neural Networks and Learning Systems.

[4]  Greg Nesmith,et al.  Investigating process safety near misses to improve performance , 2013 .

[5]  Huan-Jyh Shyur,et al.  A quantitative model for aviation safety risk assessment , 2008, Comput. Ind. Eng..

[6]  Antonio Fernando Catelli Infantosi,et al.  Multiple correspondence analysis in predictive logistic modelling: Application to a living-donor kidney transplantation data , 2009, Comput. Methods Programs Biomed..

[7]  Norberto Piccinini,et al.  Self-Organizing Map and clustering algorithms for the analysis of occupational accident databases , 2011 .

[8]  Geoffrey E. Hinton,et al.  Visualizing Data using t-SNE , 2008 .

[9]  Michael Greenacre,et al.  Interpreting multiple correspondence analysis , 1991 .

[10]  Mario De Luca,et al.  Road Safety Knowledge-Based Decision Support System , 2011 .

[11]  Jhareswar Maiti,et al.  Mining Safety Rules for Derailments in a Steel Plant Using Correspondence Analysis , 2014 .

[12]  Xiaoduan Sun,et al.  Investigation on the wrong way driving crash patterns using multiple correspondence analysis. , 2018, Accident; analysis and prevention.

[13]  Aviad Shapira,et al.  AHP-based analysis of the risk potential of safety incidents: Case study of cranes in the construction industry , 2017 .

[14]  Wei Wang,et al.  Evaluation of the impacts of traffic states on crash risks on freeways. , 2012, Accident; analysis and prevention.

[15]  Jhareswar Maiti,et al.  Identifying patterns of safety related incidents in a steel plant using association rule mining of incident investigation reports , 2014 .

[16]  Nuno Sousa,et al.  The Use of Multiple Correspondence Analysis to Explore Associations between Categories of Qualitative Variables in Healthy Ageing , 2013, Journal of aging research.

[17]  Odysseas Moschidis,et al.  Quality costing: a survey in Greek supermarkets using multiple correspondence analysis , 2016 .

[18]  Chih-Hsuan Wang,et al.  Combining multiple correspondence analysis with association rule mining to conduct user-driven product design of wearable devices , 2016, Comput. Stand. Interfaces.

[19]  Rein Schandersson,et al.  Knowledge Acquisition, Verification, and Validation in an Expert System for Improved Traffic Safety , 2000 .

[20]  Xiaoduan Sun,et al.  Association knowledge for fatal run-off-road crashes by Multiple Correspondence Analysis , 2016 .

[21]  Aviad Shapira,et al.  Analyzing risk factors in crane-related near-miss and accident reports , 2017 .

[22]  Wenkui Xi,et al.  A feature extraction and visualization method for fault detection of marine diesel engines , 2018 .

[23]  M. Hill Correspondence Analysis: A Neglected Multivariate Method , 1974 .

[24]  Andre Kleyner,et al.  A Bayesian Approach to Determine Test Sample Size Requirements for Reliability Demonstration Retesting after Product Design Change , 2015 .

[25]  Ferani E. Zulvia,et al.  Application of metaheuristic based fuzzy K-modes algorithm to supplier clustering , 2018, Comput. Ind. Eng..

[26]  Li Pheng Khoo,et al.  A fuzzy c-means based hybrid evolutionary approach to the clustering of supply chain , 2013, Comput. Ind. Eng..

[27]  Prianka N. Seneviratne Knowledge-Based System for Managing Intersection Safety , 1990 .

[28]  Tengke Xiong,et al.  DHCC: Divisive hierarchical clustering of categorical data , 2011, Data Mining and Knowledge Discovery.

[29]  Theodore T. Allen,et al.  Timely Decision Analysis Enabled by Efficient Social Media Modeling , 2017, Decis. Anal..

[30]  Kathleen L. McFadden,et al.  Risk models for analyzing pilot-error at US airlines: a comparative safety study , 2003 .

[31]  G. W. Shepherd,et al.  Crane fatalities -- a taxonomic analysis , 2000 .

[32]  Jhareswar Maiti,et al.  Human error identification and risk prioritization in overhead crane operations using HTA, SHERPA and fuzzy VIKOR method , 2015, Expert Syst. Appl..

[33]  Marcelo Embiruçu,et al.  A fuzzy modeling approach to optimize control and decision making in conflict management in air traffic control , 2018, Comput. Ind. Eng..

[34]  Don Jyh-Fu Jeng,et al.  Social influence on the use of Clinical Decision Support Systems: Revisiting the Unified Theory of Acceptance and Use of Technology by the fuzzy DEMATEL technique , 2012, Comput. Ind. Eng..