Using visual data mining in highway traffic safety analysis and decision making

An ongoing, two-fold challenge involves extracting useful information from the massive amounts of highway crash data and explaining complicated statistical models to inform the public about highway safety. Highway safety is critical to the trucking industry and highway funding policy. One method to analyze complex data is through the application of visual data mining tools. In this paper, we address the following three questions: a) what existing data visualization tools can assist with highway safety theory development and in policy-making?; b) can visual data mining uncover unknown relationships to inform the development of theory or practice? and c) can a data visualization toolkit be developed to assist the stakeholders in understanding the impact of publicpolicy on transportation safety? To address these questions, we developed a visual data mining toolkit that allows for understanding safety datasets and evaluating the effectiveness of safety policies. INTRODUCTION AND LITERATURE REVIEW Transportation accidents levy a significant cost on societies in terms of personal death or injury in addition to the economic costs. Road traffic injuries are the eighth leading cause of death, and the leading cause of death for individuals aged 15-29 (Lozano et al., 2012; World Health Organization, 2008). In 2010, transportation injuries have resulted in 1.24 million fatalities worldwide according to the World Health Organization (WHO), World Health Organization (2013, p. v). In addition to the lost lives, the costs associated with road traffic crashes runs to billions of dollars (Jacobs, Aeron-Thomas, & Astrop, 2000). These numbers are unacceptably high, especially since many of these fatalities can be avoided with evidencedriven road safety interventions. Road safety interventions can be effective in reducing the number of accidents and/or mitigating their effects. The WHO states that “adopting and enforcing legislation relating to important risk factors – speed, drunk–driving, motorcycle helmets, seat-belts and child restraints – has been shown to lead to reductions in road traffic injuries” (World Health Organization, 2013, p. v). These five risk factors are a sample of a larger pool of behavioral factors that lead to accidents. There are increasing regulations worldwide that have been passed to cover these behavioral factors. However, “in many countries these laws are either not comprehensive in scope or lacking altogether. Governments must do more to ensure that their national road safety laws meet best practice, and do more to enforce these laws” (World Health Organization, 2013, p. v) The problem is complex in the U.S., since highway safety policies can be different in neighboring states and the identification of best practice is often unclear (Governors Highway Safety Association, 2013). One approach to identifying best practices is to investigate the causes of vehicle crashes, assess

[1]  Nikiforos Stamatiadis,et al.  Guidelines for Left-Turn Phasing Treatment , 1997 .

[2]  John F. Roddick,et al.  A Survey of Temporal Knowledge Discovery Paradigms and Methods , 2002, IEEE Trans. Knowl. Data Eng..

[3]  Haim Levkowitz,et al.  From Visual Data Exploration to Visual Data Mining: A Survey , 2003, IEEE Trans. Vis. Comput. Graph..

[4]  Douglas M. King,et al.  Assessing the long term benefit of banning the use of hand-held wireless devices while driving , 2012 .

[5]  Ibrahim Yilmaz,et al.  Geographical information systems aided traffic accident analysis system case study: city of Afyonkarahisar. , 2008, Accident; analysis and prevention.

[6]  Ping Yi,et al.  Rule-Based Model for Traffic Accident Visualization and Analysis , 2001 .

[7]  Jian Pei,et al.  Data Mining: Concepts and Techniques, 3rd edition , 2006 .

[8]  Joe Flaherty CLUSTER ANALYSIS OF ARIZONA AUTOMATIC TRAFFIC RECORDER DATA , 1993 .

[9]  R H Laessig,et al.  Involvement of alcohol in fatalities of Wisconsin drivers. , 1970, Public health reports.

[10]  Daniel I. Rees,et al.  Medical Marijuana Laws, Traffic Fatalities, and Alcohol Consumption , 2013, The Journal of Law and Economics.

[11]  Bernadette A. Thomas,et al.  Global and regional mortality from 235 causes of death for 20 age groups in 1990 and 2010: a systematic analysis for the Global Burden of Disease Study 2010 , 2012, The Lancet.

[12]  John L Kent,et al.  Driver safety and motor carrier profitability : identifying and understanding drivers in the fleet , 2011 .

[13]  B. Marx The Visual Display of Quantitative Information , 1985 .

[14]  Law Teik Hua,et al.  TRAFFIC ACCIDENT APPLICATION USING GEOGRAPHIC INFORMATION SYSTEM , 2005 .

[15]  J. V. van Wijk,et al.  Cluster and calendar based visualization of time series data , 1999, Proceedings 1999 IEEE Symposium on Information Visualization (InfoVis'99).

[16]  Aneesh Krishna,et al.  An intelligent system to enhance traffic safety analysis , 2011 .

[17]  F Mannering,et al.  Effect of roadway geometrics and environmental factors on rural freeway accident frequencies. , 1995, Accident; analysis and prevention.

[18]  Hadley Wickham,et al.  Graphical Criticism: Some Historical Notes , 2013 .

[19]  Kelvin K W Yau,et al.  Risk factors affecting the severity of single vehicle traffic accidents in Hong Kong. , 2004, Accident; analysis and prevention.

[20]  Michael H. Böhlen,et al.  Visual Data Mining - Theory, Techniques and Tools for Visual Analytics , 2008, Visual Data Mining.

[21]  Ingrid van Schagen,et al.  Driving speed and the risk of road crashes: a review. , 2006, Accident; analysis and prevention.

[22]  A. Shibata,et al.  Risk factors of fatality in motor vehicle traffic accidents. , 1994, Accident; analysis and prevention.

[23]  Nathan Yau,et al.  Data Points: Visualization That Means Something , 2013 .

[24]  Frank L. Greitzer,et al.  Cognitive Foundations for Visual Analytics , 2011 .

[25]  W. Weijermars,et al.  Analyzing highway flow patterns using cluster analysis , 2005, Proceedings. 2005 IEEE Intelligent Transportation Systems, 2005..

[26]  Breno Sampaio,et al.  Identifying peer states for transportation policy analysis with an application to New York's handheld cell phone ban , 2014 .

[27]  A. S. Al-Ghamdi Using logistic regression to estimate the influence of accident factors on accident severity. , 2002, Accident; analysis and prevention.

[28]  William R. Black,et al.  HIGHWAY ACCIDENTS: A SPATIAL AND TEMPORAL ANALYSIS , 1991 .

[29]  A. Aeron-Thomas,et al.  ESTIMATING GLOBAL ROAD FATALITIES , 2000 .

[30]  Sheldon Howard Jacobson,et al.  Evaluating the impact of legislation prohibiting hand-held cell phone use while driving , 2010 .

[31]  John N. Ivan New Approach for Including Traffic Volumes in Crash Rate Analysis and Forecasting , 2004 .