Clustering-based classification of road traffic accidents using hierarchical clustering and artificial neural networks

Artificial neural networks (ANNs) have been widely used in predicting the severity of road traffic crashes. All available information about previously occurred accidents is typically used for building a single prediction model (i.e., classifier). Too little attention has been paid to the differences between these accidents, leading, in most cases, to build less accurate predictors. Hierarchical clustering is a well-known clustering method that seeks to group data by creating a hierarchy of clusters. Using hierarchical clustering and ANNs, a clustering-based classification approach for predicting the injury severity of road traffic accidents was proposed. About 6000 road accidents occurred over a six-year period from 2008 to 2013 in Abu Dhabi were used throughout this study. In order to reduce the amount of variation in data, hierarchical clustering was applied on the data set to organize it into six different forms, each with different number of clusters (i.e., clusters from 1 to 6). Two ANN models were subsequently built for each cluster of accidents in each generated form. The first model was built and validated using all accidents (training set), whereas only 66% of the accidents were used to build the second model, and the remaining 34% were used to test it (percentage split). Finally, the weighted average accuracy was computed for each type of models in each from of data. The results show that when testing the models using the training set, clustering prior to classification achieves (11%–16%) more accuracy than without using clustering, while the percentage split achieves (2%–5%) more accuracy. The results also suggest that partitioning the accidents into six clusters achieves the best accuracy if both types of models are taken into account.

[1]  Wilfried N. Gansterer,et al.  Classification of Vehicle Collision Patterns in Road Accidents using Data Mining Algorithms , 2016 .

[2]  Ebadi Mohammad,et al.  Evolving Genetic Algorithm, Fuzzy Logic and Kalman Filter for Prediction of Asphaltene Precipitation due to Natural Depletion , 2011 .

[3]  Hussein Dia,et al.  Development and evaluation of neural network freeway incident detection models using field data , 1997 .

[4]  Jan M. Zytkow,et al.  From Contingency Tables to Various Forms of Knowledge in Databases , 1996, Advances in Knowledge Discovery and Data Mining.

[5]  Li-Yen Chang,et al.  Analysis of traffic injury severity: an application of non-parametric classification tree techniques. , 2006, Accident; analysis and prevention.

[6]  F Mannering,et al.  Statistical analysis of accident severity on rural freeways. , 1996, Accident; analysis and prevention.

[7]  Y. Raiwani,et al.  Vehicular Accident Analysis Using Neural Network , 2014 .

[8]  Ramesh Sharda,et al.  Identifying significant predictors of injury severity in traffic accidents using a series of artificial neural networks. , 2006, Accident; analysis and prevention.

[9]  Antonio D’Ambrosio,et al.  Analysis of powered two-wheeler crashes in Italy by classification trees and rules discovery. , 2012, Accident; analysis and prevention.

[10]  P. Prevedouros,et al.  Urban Freeway Crash Analysis , 2007 .

[11]  D. Signorini,et al.  Neural networks , 1995, The Lancet.

[12]  Olutayo V.A,et al.  Traffic Accident Analysis Using Decision Trees and Neural Networks , 2014 .

[13]  Mohamed Abdel-Aty,et al.  Development of Artificial Neural Network Models to Predict Driver Injury Severity in Traffic Accidents at Signalized Intersections , 2001 .

[14]  L Mussone,et al.  An analysis of urban collisions using an artificial intelligence model. , 1999, Accident; analysis and prevention.