An explanatory analysis of driver injury severity in rear-end crashes using a decision table/Naïve Bayes (DTNB) hybrid classifier.

Rear-end crashes are a major type of traffic crashes in the U.S. Of practical necessity is a comprehensive examination of its mechanism that results in injuries and fatalities. Decision table (DT) and Naïve Bayes (NB) methods have both been used widely but separately for solving classification problems in multiple areas except for traffic safety research. Based on a two-year rear-end crash dataset, this paper applies a decision table/Naïve Bayes (DTNB) hybrid classifier to select the deterministic attributes and predict driver injury outcomes in rear-end crashes. The test results show that the hybrid classifier performs reasonably well, which was indicated by several performance evaluation measurements, such as accuracy, F-measure, ROC, and AUC. Fifteen significant attributes were found to be significant in predicting driver injury severities, including weather, lighting conditions, road geometry characteristics, driver behavior information, etc. The extracted decision rules demonstrate that heavy vehicle involvement, a comfortable traffic environment, inferior lighting conditions, two-lane rural roadways, vehicle disabled damage, and two-vehicle crashes would increase the likelihood of drivers sustaining fatal injuries. The research limitations on data size, data structure, and result presentation are also summarized. The applied methodology and estimation results provide insights for developing effective countermeasures to alleviate rear-end crash injury severities and improve traffic system safety performance.

[1]  Mohamed Abdel-Aty,et al.  Light truck vehicles (LTVs) contribution to rear-end collisions. , 2007, Accident; analysis and prevention.

[2]  Chuan Ding,et al.  Exploring the influential factors in incident clearance time: Disentangling causation from self-selection bias. , 2015, Accident; analysis and prevention.

[3]  Gavriel Salvendy,et al.  Risk illusions in car following: Is a smaller headway always perceived as more dangerous? , 2013 .

[4]  Mark Goadrich,et al.  The relationship between Precision-Recall and ROC curves , 2006, ICML.

[5]  Matthew G Karlaftis,et al.  Effects of road geometry and traffic volumes on rural roadway accident rates. , 2002, Accident; analysis and prevention.

[6]  Qiong Wu,et al.  Mixed logit model-based driver injury severity investigations in single- and multi-vehicle crashes on rural two-lane highways. , 2014, Accident; analysis and prevention.

[7]  Y. Zou,et al.  Analyzing different functional forms of the varying weight parameter for finite mixture of negative binomial regression models , 2014 .

[8]  Qiang Meng,et al.  Estimation of rear-end vehicle crash frequencies in urban road tunnels. , 2012, Accident; analysis and prevention.

[9]  Y Page,et al.  A Bayesian Neural Network approach to estimating the Energy Equivalent Speed. , 2006, Accident; analysis and prevention.

[10]  Fred Mannering,et al.  Probabilistic models of motorcyclists' injury severities in single- and multi-vehicle crashes. , 2007, Accident; analysis and prevention.

[11]  Myong Kee Jeong,et al.  Class dependent feature scaling method using naive Bayes classifier for text datamining , 2009, Pattern Recognit. Lett..

[12]  Hoong Chor Chin,et al.  Applying Bayesian hierarchical models to examine motorcycle crashes at signalized intersections. , 2010, Accident; analysis and prevention.

[13]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[14]  Mohamed Abdel-Aty,et al.  Multilevel data and bayesian analysis in traffic safety. , 2010, Accident; analysis and prevention.

[15]  Guohui Zhang,et al.  Exploratory multinomial logit model–based driver injury severity analyses for teenage and adult drivers in intersection-related crashes , 2016, Traffic injury prevention.

[16]  Yinhai Wang,et al.  Generalized nonlinear models for rear-end crash risk analysis. , 2014, Accident; analysis and prevention.

[17]  Ron Kohavi,et al.  Supervised and Unsupervised Discretization of Continuous Features , 1995, ICML.

[18]  Tom Fawcett,et al.  ROC Graphs: Notes and Practical Considerations for Researchers , 2007 .

[19]  Qiang Meng,et al.  Evaluation of rear-end crash risk at work zone using work zone traffic data. , 2011, Accident; analysis and prevention.

[20]  Simon Washington,et al.  Modeling crash outcome probabilities at rural intersections: application of hierarchical binomial logistic models. , 2007, Accident; analysis and prevention.

[21]  Tarek Sayed,et al.  Depth-based hotspot identification and multivariate ranking using the full Bayes approach. , 2013, Accident; analysis and prevention.

[22]  Kun Xie,et al.  Corridor-level signalized intersection safety analysis in Shanghai, China using Bayesian hierarchical models. , 2013, Accident; analysis and prevention.

[23]  Teresa O'Connor,et al.  Risk factors for fatal crashes in rural Australia. , 2011, Accident; analysis and prevention.

[24]  Shlomo Bekhor,et al.  Risk evaluation by modeling of passing behavior on two-lane rural highways. , 2009, Accident; analysis and prevention.

[25]  Alina A. von Davier,et al.  Cross-Validation , 2014 .

[26]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[27]  Wen Cheng,et al.  An Examination of the Endogeneity of Speed Limits and Accident Counts in Crash Models , 2013 .

[28]  Silja Renooij,et al.  Evidence and Scenario Sensitivities in Naive Bayesian Classifiers , 2006, Probabilistic Graphical Models.

[29]  Ashim Kumar Debnath,et al.  An investigation on multi-vehicle motorcycle crashes using log-linear models , 2012 .

[30]  Hoong Chor Chin,et al.  Severity of driver injury and vehicle damage in traffic crashes at intersections: a Bayesian hierarchical analysis. , 2008, Accident; analysis and prevention.

[31]  Griselda López,et al.  Analysis of traffic accidents on rural highways using Latent Class Clustering and Bayesian Networks. , 2013, Accident; analysis and prevention.

[32]  S. Singh,et al.  DRIVER ATTRIBUTES AND REAR-END CRASH INVOLVEMENT PROPENSITY , 2003 .

[33]  C. Farmer,et al.  Relationship of head restraint positioning to driver neck injury in rear-end crashes. , 1999, Accident; analysis and prevention.

[34]  Mohamed Abdel-Aty,et al.  Investigating different approaches to develop informative priors in hierarchical Bayesian safety performance functions. , 2013, Accident; analysis and prevention.

[35]  T. Tape,et al.  Interpretation of Diagnostic Tests , 2001, Annals of Internal Medicine.

[36]  Fred L Mannering,et al.  Highway accident severities and the mixed logit model: an exploratory empirical analysis. , 2008, Accident; analysis and prevention.

[37]  Y. Zhang,et al.  Decision table for classifying point sources based on FIRST and 2MASS databases , 2008 .

[38]  F. Mannering,et al.  Differences in rural and urban driver-injury severities in accidents involving large-trucks: an exploratory analysis. , 2005, Accident; analysis and prevention.

[39]  Jonathan M. Garibaldi,et al.  A 'non-parametric' version of the naive Bayes classifier , 2011, Knowl. Based Syst..

[40]  Ibrahim M. Abdalla Effectiveness of safety belts and Hierarchical Bayesian analysis of their relative use , 2005 .

[41]  Mohamed Abdel-Aty,et al.  Characteristics of rear-end accidents at signalized intersections using multiple logistic regression model. , 2005, Accident; analysis and prevention.

[42]  Yingfeng Li,et al.  Comparison of characteristics between fatal and injury accidents in the highway construction zones , 2008 .

[43]  Mohamed Abdel-Aty,et al.  Development of Artificial Neural Network Models to Predict Driver Injury Severity in Traffic Accidents at Signalized Intersections , 2001 .

[44]  Tzu-Tsung Wong,et al.  Individual attribute prior setting methods for naïve Bayesian classifiers , 2011, Pattern Recognit..

[45]  Yajie Zou,et al.  Application of finite mixture of negative binomial regression models with varying weight parameters for vehicle crash data analysis. , 2013, Accident; analysis and prevention.

[46]  John P. Seagle,et al.  Acquiring expert rules with the aid of decision tables , 1995 .

[47]  Dominique Lord,et al.  Modeling crash-flow-density and crash-flow-V/C ratio relationships for rural and urban freeway segments. , 2005, Accident; analysis and prevention.

[48]  Suren Chen,et al.  Injury severities of truck drivers in single- and multi-vehicle accidents on rural highways. , 2011, Accident; analysis and prevention.

[49]  Ron Kohavi,et al.  Error-Based and Entropy-Based Discretization of Continuous Features , 1996, KDD.

[50]  R Fredriksson,et al.  Comparison of car seats in low speed rear-end impacts using the BioRID dummy and the new neck injury criterion (NIC). , 2000, Accident; analysis and prevention.

[51]  Konstantina Gkritza,et al.  A mixed logit analysis of two-vehicle crash severities involving a motorcycle. , 2013, Accident; analysis and prevention.

[52]  Sigal Kaplan,et al.  Analysis of factors associated with injury severity in crashes involving young New Zealand drivers. , 2014, Accident; analysis and prevention.

[53]  Art Lew,et al.  Fuzzy decision tables for expert systems , 1991 .

[54]  Srinivas Reddy Geedipally,et al.  Investigating the effect of modeling single-vehicle and multi-vehicle crashes separately on confidence intervals of Poisson-gamma models. , 2010, Accident; analysis and prevention.

[55]  Hua Wang,et al.  Bayesian network-based formulation and analysis for toll road utilization supported by traffic information provision , 2015 .

[56]  Bart Baesens,et al.  An empirical evaluation of the comprehensibility of decision table, tree and rule based predictive models , 2011, Decis. Support Syst..

[57]  Yubian Wang,et al.  Safety Impacts of Auxiliary Lanes at Isolated Freeway On-Ramp Junctions , 2013 .

[58]  Mohamed Abdel-Aty,et al.  Analyzing crash injury severity for a mountainous freeway incorporating real-time traffic and weather data , 2014 .

[59]  Tijs Neutens,et al.  Introducing functional classification theory to land use planning by means of decision tables , 2009, Decis. Support Syst..

[60]  Chang-Hwan Lee Improving classification performance using unlabeled data: Naive Bayesian case , 2007, Knowl. Based Syst..

[61]  Yoonjin Yoon,et al.  Contributing factors to vehicle to vehicle crash frequency and severity under rainfall. , 2014, Journal of safety research.

[62]  Suren Chen,et al.  Modeling Crash Rates for a Mountainous Highway by Using Refined-Scale Panel Data , 2015 .

[63]  Hsinchun Chen,et al.  A hierarchical Naïve Bayes model for approximate identity matching , 2011, Decis. Support Syst..

[64]  Asad J. Khattak,et al.  What are the differences in driver injury outcomes at highway-rail grade crossings? Untangling the role of pre-crash behaviors. , 2015, Accident; analysis and prevention.

[65]  Zong Tian,et al.  Hierarchical Bayesian random intercept model-based cross-level interaction decomposition for truck driver injury severity investigations. , 2015, Accident; analysis and prevention.

[66]  Kaan Ozbay,et al.  A comparative Full Bayesian before-and-after analysis and application to urban road safety countermeasures in New Jersey. , 2010, Accident; analysis and prevention.

[67]  Eibe Frank,et al.  Combining Naive Bayes and Decision Tables , 2008, FLAIRS.

[68]  Young-Jun Kweon,et al.  Driver injury severity: an application of ordered probit models. , 2002, Accident; analysis and prevention.

[69]  Juneyoung Park,et al.  Estimating safety performance trends over time for treatments at intersections in Florida. , 2015, Accident; analysis and prevention.

[70]  Gudmundur F. Ulfarsson,et al.  Random parameter models of interstate crash frequencies by severity, number of vehicles involved, collision and location type. , 2013, Accident; analysis and prevention.

[71]  L. Teo,et al.  Comparison of severity and pattern of injuries between motorcycle riders and their pillions: a matched study. , 2014, Injury.

[72]  Tae Woon Kim,et al.  Development of a computer code AFTC for fault tree construction using decision table method and super component concept , 1989 .

[73]  Juan de Oña,et al.  Analysis of traffic accident injury severity on Spanish rural highways using Bayesian networks. , 2011, Accident; analysis and prevention.

[74]  Hongzhi Guan,et al.  A multinomial logit model-Bayesian network hybrid approach for driver injury severity analyses in rear-end crashes. , 2015, Accident; analysis and prevention.

[75]  J. Kruschke Doing Bayesian Data Analysis , 2010 .

[76]  L. Miranda-Moreno,et al.  Cyclist activity and injury risk analysis at signalized intersections: a Bayesian modelling approach. , 2013, Accident; analysis and prevention.

[77]  Don Scott,et al.  Car following decisions under three visibility conditions and two speeds tested with a driving simulator. , 2007, Accident; analysis and prevention.

[78]  Pedro M. Domingos,et al.  On the Optimality of the Simple Bayesian Classifier under Zero-One Loss , 1997, Machine Learning.