A Gradient Boosting Crash Prediction Approach for Highway-Rail Grade Crossing Crash Analysis

Highway-rail grade crossing (HRGC) crashes continue to be the major contributors to rail causalities in the United States and have been intensively researched in the past. Data-mining models focus on prediction while dominant general linear models focus on model and data fitness. Decision makers and traffic engineers rely on prediction models to examine at-grade crash frequency and make safety improvement. The gradient boosting (GB) model has gained popularity in many research areas. In this study, to fully understand the model performance on HRGC accident prediction performance, the GB model with functional gradient descent algorithm is selected to analyze crashes at highway-rail grade crossings (HRGCs) and to identify contributor factors. Moreover, contributors’ importance and partial-dependent relations are generated to further understand the relationship of identified contributors and HRGC crash likelihood to concur “black box” issues that most machine learning methods face. Furthermore, to fully demonstrate the model’s prediction performance, a comprehensive model prediction power assessment based on six measurements is conducted, and the prediction performance of the GB model is verified and compared with a decision tree model as a reference due to their popularity and comparable data availability. It is demonstrated that the GB model produces better prediction accuracy and reveals nonlinear relationships among contributors and crash likelihood. In general, HRGC crash likelihood is significantly impacted by several traffic exposure factors: highway traffic volume, railway traffic volume, and train travel speed and others.

[1]  Jerome H Friedman,et al.  Multiple additive regression trees with application in epidemiology , 2003, Statistics in medicine.

[2]  Denver Tolliver,et al.  Accident Prediction Accuracy Assessment for Highway-Rail Grade Crossings Using Random Forest Algorithm Compared with Decision Tree , 2020, Reliab. Eng. Syst. Saf..

[3]  Changxi Ma,et al.  The Impact of Aggressive Driving Behavior on Driver-Injury Severity at Highway-Rail Grade Crossings Accidents , 2018, Journal of Advanced Transportation.

[4]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[5]  H Lum,et al.  Modeling vehicle accidents and highway geometric design relationships. , 1993, Accident; analysis and prevention.

[6]  G. De’ath Boosted trees for ecological modeling and prediction. , 2007, Ecology.

[7]  G M McCollister,et al.  A model to predict the probability of highway rail crossing accidents , 2007 .

[8]  Xuesong Wang,et al.  Modeling signalized intersection safety with corridor-level spatial correlations. , 2010, Accident; analysis and prevention.

[9]  Denver Tolliver,et al.  Accident prediction model for public highway-rail grade crossings. , 2016, Accident; analysis and prevention.

[10]  Doohee Nam,et al.  Accident prediction model for railway-highway interfaces. , 2006, Accident; analysis and prevention.

[11]  H. Midi,et al.  Identification of Multiple Outliers in a Generalized Linear Model with Continuous Variables , 2016 .

[12]  Denver Tolliver,et al.  Decision Tree Approach to Accident Prediction for Highway–Rail Grade Crossings: Empirical Analysis , 2016 .

[13]  M J Maher,et al.  A comprehensive methodology for the fitting of predictive accident models. , 1996, Accident; analysis and prevention.

[14]  J. Friedman Stochastic gradient boosting , 2002 .

[15]  Matthew G Karlaftis,et al.  Effects of road geometry and traffic volumes on rural roadway accident rates. , 2002, Accident; analysis and prevention.

[16]  Xuedong Yan,et al.  Analyses of Rear-End Crashes Based on Classification Tree Models , 2006, Traffic injury prevention.

[17]  Federico Fraboni,et al.  Using data mining techniques to predict the severity of bicycle crashes. , 2017, Accident; analysis and prevention.

[18]  Fred L. Mannering,et al.  The statistical analysis of crash-frequency data: A review and assessment of methodological alternatives , 2010 .

[19]  Jodi L Carson,et al.  An alternative accident prediction model for highway-rail interfaces. , 2002, Accident; analysis and prevention.

[20]  Li-Yen Chang,et al.  Analysis of driver injury severity in truck-involved accidents using a non-parametric classification tree model , 2013 .

[21]  Liping Fu,et al.  Risk-Based Model for Identifying Highway-Rail Grade Crossing Blackspots , 2004 .

[22]  Li-Yen Chang,et al.  Data mining of tree-based models to analyze freeway accident frequency. , 2005, Journal of safety research.

[23]  Fred L. Mannering,et al.  The relationship among highway geometrics, traffic-related elements and motor-vehicle accident frequencies , 1998 .

[24]  Fred Mannering,et al.  Impact of roadside features on the frequency and severity of run-off-roadway accidents: an empirical analysis. , 2002, Accident; analysis and prevention.

[25]  Li-Yen Chang,et al.  Analysis of traffic injury severity: an application of non-parametric classification tree techniques. , 2006, Accident; analysis and prevention.

[26]  M G Karlaftis,et al.  Heterogeneity considerations in accident modeling. , 1998, Accident; analysis and prevention.

[27]  H C Chin,et al.  Modeling Accident Occurrence at Signalized Tee Intersections with Special Emphasis on Excess Zeros , 2003, Traffic injury prevention.

[28]  Pan Lu,et al.  Geometric effect analysis of highway-rail grade crossing safety performance. , 2020, Accident; analysis and prevention.

[29]  Xiao Qin,et al.  Variable Selection Issues in Tree-Based Regression Models , 2008 .

[30]  Osama A. Osman,et al.  A Comparative Analysis of Tree-Based Ensemble Methods for Detecting Imminent Lane Change Maneuvers in Connected Vehicle Environments , 2018, Transportation Research Record: Journal of the Transportation Research Board.

[31]  Xuedong Yan,et al.  Using hierarchical tree-based regression model to predict train-vehicle crashes at passive highway-rail grade crossings. , 2010, Accident; analysis and prevention.

[32]  Brenda Lantz,et al.  Commercial truck crash injury severity analysis using gradient boosting data mining model. , 2018, Journal of safety research.