Tree‐Based Logistic Regression Approach for Work Zone Casualty Risk Assessment

This study presents a tree-based logistic regression approach to assessing work zone casualty risk, which is defined as the probability of a vehicle occupant being killed or injured in a work zone crash. First, a decision tree approach is employed to determine the tree structure and interacting factors. Based on the Michigan M-94\I-94\I-94BL\I-94BR highway work zone crash data, an optimal tree comprising four leaf nodes is first determined and the interacting factors are found to be airbag, occupant identity (i.e., driver, passenger), and gender. The data are then split into four groups according to the tree structure. Finally, the logistic regression analysis is separately conducted for each group. The results show that the proposed approach outperforms the pure decision tree model because the former has the capability of examining the marginal effects of risk factors. Compared with the pure logistic regression method, the proposed approach avoids the variable interaction effects so that it significantly improves the prediction accuracy.

[1]  Li-Yen Chang,et al.  Analysis of traffic injury severity: an application of non-parametric classification tree techniques. , 2006, Accident; analysis and prevention.

[2]  Tae-Jun Ha,et al.  DETAILED STUDY OF ACCIDENT EXPERIENCE IN CONSTRUCTION AND MAINTENANCE ZONES , 1995 .

[3]  X. Qu,et al.  A probabilistic quantitative risk assessment model for the long-term work zone crashes. , 2010, Accident; analysis and prevention.

[4]  L Mussone,et al.  An analysis of urban collisions using an artificial intelligence model. , 1999, Accident; analysis and prevention.

[5]  Mohamed Abdel-Aty,et al.  Modeling rear-end collisions including the role of driver's visibility and light truck vehicles using a nested logit structure. , 2004, Accident; analysis and prevention.

[6]  Bryan K Allery,et al.  Exploratory Analysis of Relationship Between Number of Lanes and Safety on Urban Freeways , 2008 .

[7]  Gerald L Ullman,et al.  Analysis of Crashes at Active Night Work Zones in Texas , 2006 .

[8]  Constantin F. Aliferis,et al.  Time and sample efficient discovery of Markov blankets and direct causal relations , 2003, KDD '03.

[9]  Srinivas Reddy Geedipally,et al.  Extension of the Application of Conway‐Maxwell‐Poisson Models: Analyzing Traffic Crash Data Exhibiting Underdispersion , 2010, Risk analysis : an official publication of the Society for Risk Analysis.

[10]  Dario Gregori,et al.  Predicting Severity of Foreign Body Injuries in Children in Upper Airways: An Approach Based on Regression Trees , 2007, Risk analysis : an official publication of the Society for Risk Analysis.

[11]  Lisa Dorn,et al.  Work‐Related Road Safety: An Analysis Based on U.K. Bus Driver Performance , 2008, Risk analysis : an official publication of the Society for Risk Analysis.

[12]  Qiang Meng,et al.  Evaluation of rear-end crash risk at work zone using work zone traffic data. , 2011, Accident; analysis and prevention.

[13]  Xiaojun Gu,et al.  The Roads Ahead: Collision Risks, Trends, and Safety of Drivers , 2009, Risk analysis : an official publication of the Society for Risk Analysis.

[14]  Mohamed Abdel-Aty,et al.  Freeway Work-Zone Crash Analysis and Risk Identification Using Multiple and Conditional Logistic Regression , 2008 .

[15]  Qiang Meng,et al.  Decision Tree–Based Model for Estimation of Work Zone Capacity , 2011 .

[16]  Qiang Meng,et al.  A Genetic algorithm approach to assessing work zone casualty risk , 2011 .

[17]  Constantin F. Aliferis,et al.  Identifying Markov blankets with decision tree induction , 2003, Third IEEE International Conference on Data Mining.

[18]  Chen Fei See Crash analysis of work zone lane closures with left-hand merge and downstream lane shift , 2008 .

[19]  Dipti Srinivasan,et al.  DEVELOPMENT AND ADAPTATION OF CONSTRUCTIVE PROBABILISTIC NEURAL NETWORK IN FREEWAY INCIDENT DETECTION , 2002 .

[20]  G. Guyatt,et al.  The independent contribution of driver, crash, and vehicle characteristics to driver fatalities. , 2002, Accident; analysis and prevention.

[21]  Eleni I. Vlahogianni,et al.  Statistical methods versus neural networks in transportation research: Differences, similarities and some insights , 2011 .