A crash severity analysis at highway-rail grade crossings: The random survival forest method.

This paper proposes a machine learning approach, the random survival forest (RSF) for competing risks, to investigate highway-rail grade crossing (HRGC) crash severity during a 29-year analysis period. The benefits of the RSF approach are that it (1) is a special type of survival analysis able to accommodate the competing nature of multiple-event outcomes to the same event of interest (here the competing multiple events are crash severities), (2) is able to conduct an event-specific selection of risk factors, (3) has the capability to determine long-term cumulative effects of contributors with the cumulative incidence function (CIF), (4) provides high prediction performance, and (5) is effective in high-dimensional settings. The RSF approach is able to consider complexities in HRGC safety analysis, e.g., non-linear relationships between HRGCs crash severities and the contributing factors and heterogeneity in data. Variable importance (VIMP) technique is adopted in this research for selecting the most predictive contributors for each crash-severity level. Moreover, marginal effect analysis results real several HRGC countermeasures' effectiveness. Several insightful findings are discovered. For examples, adding stop signs to HRGCs that already have a combination of gate, standard flashing lights, and audible devices will reduce the likelihood of property damage only (PDO) crashes for up to seven years; but after the seventh year, the crossings are more likely to have PDO crashes. Adding audible devices to crossing with gates and standard flashing lights will reduce crash likelihood, PDO, injury, and fatal crashes by 49 %, 52 %, 46 %, and 50 %, respectively.

[1]  Hemant Ishwaran,et al.  Random survival forests for competing risks. , 2014, Biostatistics.

[2]  Kirolos Haleem,et al.  Contributing factors of crash injury severity at public highway-railroad grade crossings in the U.S. , 2015, Journal of safety research.

[3]  Asad J. Khattak,et al.  What are the differences in driver injury outcomes at highway-rail grade crossings? Untangling the role of pre-crash behaviors. , 2015, Accident; analysis and prevention.

[4]  Liping Fu,et al.  A latent class modeling approach for identifying vehicle driver injury severity factors at highway-railway crossings. , 2012, Accident; analysis and prevention.

[5]  Wenjiang J. Fu,et al.  Estimating misclassification error with small samples via bootstrap cross-validation , 2005, Bioinform..

[6]  Dominique Lord,et al.  The statistical analysis of highway crash-injury severities: a review and assessment of methodological alternatives. , 2011, Accident; analysis and prevention.

[7]  Aemal Khattak,et al.  Motor vehicle drivers' injuries in train-motor vehicle crashes. , 2015, Accident; analysis and prevention.

[8]  E Graf,et al.  Assessment and comparison of prognostic classification schemes for survival data. , 1999, Statistics in medicine.

[9]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[10]  Robert Gray,et al.  A Proportional Hazards Model for the Subdistribution of a Competing Risk , 1999 .

[11]  Yang Feng,et al.  High-dimensional variable selection for Cox's proportional hazards model , 2010, 1002.3315.

[12]  Pan Lu,et al.  Predicting Highway–Rail Grade Crossing Collision Risk by Neural Network Systems , 2019 .

[13]  Denver Tolliver,et al.  A Gradient Boosting Crash Prediction Approach for Highway-Rail Grade Crossing Crash Analysis , 2020, Journal of Advanced Transportation.

[14]  Janice Daniel,et al.  Motor vehicle driver injury severity study under various traffic control at highway-rail grade crossings in the United States. , 2014, Journal of safety research.

[15]  Asad J Khattak,et al.  Gate-violation behavior at highway-rail grade crossings and the consequences: Using geo-Spatial modeling integrated with path analysis. , 2017, Accident; analysis and prevention.

[16]  R. Gray A Class of $K$-Sample Tests for Comparing the Cumulative Incidence of a Competing Risk , 1988 .

[17]  H Putter,et al.  Tutorial in biostatistics: competing risks and multi‐state models , 2007, Statistics in medicine.

[18]  G. Brier VERIFICATION OF FORECASTS EXPRESSED IN TERMS OF PROBABILITY , 1950 .

[19]  Hemant Ishwaran,et al.  Evaluating Random Forests for Survival Analysis using Prediction Error Curves. , 2012, Journal of statistical software.

[20]  Chi-Kang Lee,et al.  Investigation of Key Factors for Accident Severity at Railroad Grade Crossings by Using a Logit Model. , 2010, Safety science.

[21]  Archie Burnham Stop Sign Effectiveness at Railroad Grade Crossings (Abuse without Excuse) , 1995 .

[22]  Harald Binder,et al.  Allowing for mandatory covariates in boosting estimation of sparse high-dimensional survival models , 2008, BMC Bioinformatics.

[23]  Aemal J. Khattak,et al.  Cluster-Based Approach to Analyzing Crash Injury Severity at Highway–Rail Grade Crossings , 2017 .

[24]  Denver Tolliver,et al.  Accident prediction model for public highway-rail grade crossings. , 2016, Accident; analysis and prevention.

[25]  Doohee Nam,et al.  Accident prediction model for railway-highway interfaces. , 2006, Accident; analysis and prevention.

[26]  Denver Tolliver,et al.  Decision Tree Approach to Accident Prediction for Highway–Rail Grade Crossings: Empirical Analysis , 2016 .

[27]  Changxi Ma,et al.  The Impact of Aggressive Driving Behavior on Driver-Injury Severity at Highway-Rail Grade Crossings Accidents , 2018, Journal of Advanced Transportation.

[28]  Pan Lu,et al.  Geometric effect analysis of highway-rail grade crossing safety performance. , 2020, Accident; analysis and prevention.

[29]  Hemant Ishwaran,et al.  Random Survival Forests , 2008, Wiley StatsRef: Statistics Reference Online.

[30]  D. Cox Regression Models and Life-Tables , 1972 .

[31]  Mohamed Abdel-Aty,et al.  Exploring the overall and specific crash severity levels at signalized intersections. , 2005, Accident; analysis and prevention.

[32]  Liping Fu,et al.  Developing Safety Performance Functions for Railway Grade Crossings: A Case Study of Canada , 2015 .

[33]  Janice Daniel,et al.  Driver injury severity related to inclement weather at highway–rail grade crossings in the United States , 2016, Traffic injury prevention.

[34]  F Meeker,et al.  A comparison of driver behavior at railroad grade crossings with two different protection systems. , 1997, Accident; analysis and prevention.

[35]  Wei Fan,et al.  Analysis of Severity of Vehicle Crashes at Highway-Rail Grade Crossings: Multinomial Logit Modeling , 2014 .

[36]  Lalita Thakali,et al.  Benchmarking regions using a heteroskedastic grouped random parameters model with heterogeneity in mean and variance: Applications to grade crossing safety analysis , 2018, Analytic Methods in Accident Research.

[37]  Stephen H Richards,et al.  Evaluation of Effectiveness of Stop-Sign Treatment at Highway–Railroad Grade Crossings , 2009 .

[38]  One-Jang. Jeng Survey of Driver Perceptions of Railroad and Light Rail Warning Devices/Grade Crossings , 2005 .

[39]  Janice Daniel,et al.  Severity of Injuries to Motor Vehicle Drivers at Highway–Rail Grade Crossings in the United States , 2013 .

[40]  A. Raftery,et al.  Strictly Proper Scoring Rules, Prediction, and Estimation , 2007 .

[41]  Christina M. Rudin-Brown,et al.  Driver behaviour at rail level crossings: responses to flashing lights, traffic signals and stop signs in simulated rural driving. , 2011, Applied ergonomics.

[42]  Xuedong Yan,et al.  Using hierarchical tree-based regression model to predict train-vehicle crashes at passive highway-rail grade crossings. , 2010, Accident; analysis and prevention.

[43]  Brenda Lantz,et al.  Commercial truck crash injury severity analysis using gradient boosting data mining model. , 2018, Journal of safety research.

[44]  Udaya B. Kogalur,et al.  High-Dimensional Variable Selection for Survival Data , 2010 .

[45]  Dominique Lord,et al.  Crash data modeling with a generalized estimator. , 2018, Accident; analysis and prevention.

[46]  Liping Fu,et al.  Analyzing injury severity factors at highway railway grade crossing accidents involving vulnerable road users: A comparative study , 2016, Traffic injury prevention.

[47]  A. H. Murphy,et al.  “Good” Probability Assessors , 1968 .

[48]  Nassir Navab,et al.  Survival analysis for high-dimensional, heterogeneous medical data: Exploring feature extraction as an alternative to feature selection , 2016, Artif. Intell. Medicine.

[49]  M. Schumacher,et al.  Consistent Estimation of the Expected Brier Score in General Survival Models with Right‐Censored Event Times , 2006, Biometrical journal. Biometrische Zeitschrift.

[50]  Aemal J. Khattak,et al.  A clustering approach to injury severity in pedestrian-train crashes at highway-rail grade crossings , 2019 .