Identification of rules induced through decision tree algorithm for detection of traffic accidents with victims: a study case from Brazil

Abstract As traffic demand continues to increase and in the face of limited possibilities to expand the road infrastructure, various attempts are being made to make road traffic safer. In order to decrease the number of traffic accidents, it is required knowledge of its causes, so that actions are taken to avoid them. This is possible through collection and management of the available information on traffic accidents. Thus, policies are needed to identify and manage all available information relating to traffic accidents. This study aims to identify rules induced by Decision Tree algorithms (DT) for detecting traffic accidents with victims in a road stretch from accidents records, as well as probable causes of the occurrence and type of accident. Data are from a road stretch of the Regis Bittencourt (highway BR-116) between km 509 to km 519 in the period 2012–2014, located in Sao Paulo, Brazil. Through the main results obtained, it can be concluded that the CART algorithm of the Decision Tree is a useful tool in identifying potential sites of accidents with victims. In this case, the two most important variables to identify the severity of accidents were the accident type and the accident cause.

[1]  Ahmed E. Radwan,et al.  Modeling traffic accident occurrence and involvement. , 2000, Accident; analysis and prevention.

[2]  Griselda López,et al.  Extracting decision rules from police accident reports through decision trees. , 2013, Accident; analysis and prevention.

[3]  Li-Yen Chang,et al.  Analysis of traffic injury severity: an application of non-parametric classification tree techniques. , 2006, Accident; analysis and prevention.

[4]  Li-Yen Chang,et al.  Data mining of tree-based models to analyze freeway accident frequency. , 2005, Journal of safety research.

[5]  Yu-Chiun Chiou,et al.  Modeling crash frequency and severity using multinomial-generalized Poisson model with error components. , 2013, Accident; analysis and prevention.

[6]  López Griselda,et al.  Using Decision Trees to Extract Decision Rules from Police Reports on Road Accidents , 2012 .

[7]  Leo Breiman,et al.  Classification and Regression Trees , 1984 .

[8]  R. Lewis An Introduction to Classification and Regression Tree (CART) Analysis , 2000 .

[9]  Antonio D’Ambrosio,et al.  Analysis of powered two-wheeler crashes in Italy by classification trees and rules discovery. , 2012, Accident; analysis and prevention.

[10]  Dominique Lord,et al.  The statistical analysis of highway crash-injury severities: a review and assessment of methodological alternatives. , 2011, Accident; analysis and prevention.

[11]  H Lum,et al.  Modeling vehicle accidents and highway geometric design relationships. , 1993, Accident; analysis and prevention.

[12]  Bruce N. Janson,et al.  Diagnostic Methodology for the Detection of Safety Problems at Intersections , 2002 .

[13]  Reza Sigari Tabrizi,et al.  The role of human factor in incidence and severity of road crashes based on the CART and LR regression: a data mining approach , 2011, WCIT.

[14]  Fred L. Mannering,et al.  The statistical analysis of crash-frequency data: A review and assessment of methodological alternatives , 2010 .

[15]  Juan de Oña,et al.  Using Data Mining Techniques to Road Safety Improvement in Spanish Roads , 2014 .

[16]  Mohamed Abdel-Aty,et al.  Market basket analysis of crash data from large jurisdictions and its potential as a decision support tool , 2009 .

[17]  F Mannering,et al.  Effect of roadway geometrics and environmental factors on rural freeway accident frequencies. , 1995, Accident; analysis and prevention.

[18]  Tibebe Beshah,et al.  Mining Road Traffic Accident Data to Improve Safety: Role of Road-Related Factors on Accident Severity in Ethiopia , 2010, AAAI Spring Symposium: Artificial Intelligence for Development.

[19]  Chao Wang,et al.  Predicting accident frequency at their severity levels and its application in site ranking using a two-stage mixed multivariate model. , 2011, Accident; analysis and prevention.

[20]  F L Mannering,et al.  THE RELATIONSHIP BETWEEN HIGHWAY GEOMETRICS, TRAFFIC RELATED ELEMENTS, AND MOTOR VEHICLE ACCIDENTS. FINAL REPORT , 1996 .

[21]  Pradeep Kumar Agarwal,et al.  A Methodology for Ranking Road Safety Hazardous Locations Using Analytical Hierarchy Process , 2013 .

[22]  Andrew P Tarko,et al.  Effective and Fair Identification of Hazardous Locations , 2004 .