Commercial truck crash injury severity analysis using gradient boosting data mining model.

INTRODUCTION Truck crashes contribute to a large number of injuries and fatalities. This study seeks to identify the contributing factors affecting truck crash severity using 2010 to 2016 North Dakota and Colorado crash data provided by the Federal Motor Carrier Safety Administration. METHOD To fulfill a gap of previous studies, broad considerations of company and driver characteristics, such as company size and driver's license class, along with vehicle types and crash characteristics are researched. Gradient boosting, a data mining technique, is applied to comprehensively analyze the relationship between crash severities and a set of heterogeneous risk factors. RESULTS Twenty five variables were tested and 22 of them are identified as significant variables contributing to injury severities, however, top 11 variables account for more than 80% of injury forecasting. The relative variable importance analysis is conducted and furthermore marginal effects of all contributing factors are also illustrated in this research. Several factors such as trucking company attributes (e.g., company size), safety inspection values, trucking company commerce status (e.g., interstate or intrastate), time of day, driver's age, first harmful events, and registration condition are found to be significantly associated with crash injury severity. Even though most of the identified contributing factors are significant for all four levels of crash severity, their relative importance and marginal effect are all different. CONCLUSIONS For the first time, trucking company and driver characteristics are proved to have significant impact on truck crash injury severity. Some of the results in this study reinforce previous studies' conclusions. PRACTICAL APPLICATIONS Findings in this study can be helpful for transportation agencies to reduce injury severity, and develop efficient strategies to improve safety.

[1]  Trevor Hastie,et al.  The Elements of Statistical Learning , 2001 .

[2]  Xiaoyu Zhu,et al.  A comprehensive analysis of factors influencing the injury severity of large-truck crashes. , 2011, Accident; analysis and prevention.

[3]  Zong Tian,et al.  Hierarchical Bayesian random intercept model-based cross-level interaction decomposition for truck driver injury severity investigations. , 2015, Accident; analysis and prevention.

[4]  Qiang Meng,et al.  Estimation of rear-end vehicle crash frequencies in urban road tunnels. , 2012, Accident; analysis and prevention.

[5]  Dibakar Saha,et al.  Prioritizing Highway Safety Manual's crash prediction variables using boosted regression trees. , 2015, Accident; analysis and prevention.

[6]  Shanshan Zhao,et al.  Weather impacts on single-vehicle truck crash injury severity. , 2016, Journal of safety research.

[7]  Guang Xiang Chen,et al.  Impact of federal compliance reviews of trucking companies in reducing highway truck crashes. , 2008, Accident; analysis and prevention.

[8]  Douglas J Gabauer,et al.  Influence of horizontally curved roadway section characteristics on motorcycle-to-barrier crash frequency. , 2015, Accident; analysis and prevention.

[9]  Nathan Huynh,et al.  Factors influencing injury severity of crashes involving HAZMAT trucks , 2017 .

[10]  Denver Tolliver,et al.  Accident prediction model for public highway-rail grade crossings. , 2016, Accident; analysis and prevention.

[11]  Avinash Unnikrishnan,et al.  Analysis of large truck crash severity using heteroskedastic ordered probit models. , 2011, Accident; analysis and prevention.

[12]  Yue Li,et al.  Determining the effective location of a portable changeable message sign on reducing the risk of truck-related crashes in work zones. , 2015, Accident; analysis and prevention.

[13]  Durga Toshniwal,et al.  A data mining framework to analyze road accident data , 2015, Journal of Big Data.

[14]  Loren Staplin,et al.  Job change rate as a crash predictor for interstate truck drivers. , 2005, Accident; analysis and prevention.

[15]  Eric T. Donnell,et al.  A method to account for and estimate underreporting in crash frequency research. , 2016, Accident; analysis and prevention.

[16]  Kara M. Kockelman,et al.  Freeway Speeds and Speed Variations Preceding Crashes, Within and Across Lanes , 2010 .

[17]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[18]  Haizhong Wang,et al.  Heterogeneous impacts of gender-interpreted contributing factors on driver injury severities in single-vehicle rollover crashes. , 2016, Accident; analysis and prevention.

[19]  Wei Zou,et al.  Truck crash severity in New York city: An investigation of the spatial and the time of day effects. , 2017, Accident; analysis and prevention.

[20]  Yu-Chiun Chiou,et al.  Modeling crash frequency and severity using multinomial-generalized Poisson model with error components. , 2013, Accident; analysis and prevention.

[21]  Jasmine Pahukula,et al.  A time of day analysis of crashes involving large trucks in urban areas. , 2015, Accident; analysis and prevention.

[22]  Denver Tolliver,et al.  Decision Tree Approach to Accident Prediction for Highway–Rail Grade Crossings: Empirical Analysis , 2016 .

[23]  Stephen H Richards,et al.  Differences in passenger car and large truck involved crash frequencies at urban signalized intersections: an exploratory analysis. , 2014, Accident; analysis and prevention.

[24]  Yi-Shih Chung,et al.  Factor complexity of crash occurrence: An empirical demonstration using boosted regression trees. , 2013, Accident; analysis and prevention.

[25]  Jerome H Friedman,et al.  Multiple additive regression trees with application in epidemiology , 2003, Statistics in medicine.

[26]  Qiong Wu,et al.  Analysis of driver injury severity in single-vehicle crashes on rural and urban roadways. , 2016, Accident; analysis and prevention.

[27]  Baoshan Huang,et al.  Multivariate random-parameters zero-inflated negative binomial regression model: an application to estimate crash frequencies at intersections. , 2014, Accident; analysis and prevention.

[28]  Mireille Chiron,et al.  Severity factors for truck drivers' injuries. , 2003, American journal of epidemiology.

[29]  G. De’ath Boosted trees for ecological modeling and prediction. , 2007, Ecology.

[30]  Guohui Zhang,et al.  An explanatory analysis of driver injury severity in rear-end crashes using a decision table/Naïve Bayes (DTNB) hybrid classifier. , 2016, Accident; analysis and prevention.

[31]  Denver Tolliver,et al.  Accident Prediction for Highway-Rail Grade Crossings Using Decision Tree Approach: An Empirical Analysis , 2016 .

[32]  E R Braver,et al.  Tractor-trailer crashes in Indiana: a case-control study of the role of truck configuration. , 1997, Accident; analysis and prevention.

[33]  K L Campbell,et al.  Fatal accident involvement rates by driver age for large trucks. , 1988, Accident; analysis and prevention.

[34]  Jon E Anderson,et al.  Obesity is associated with the future risk of heavy truck crashes among newly recruited commercial drivers. , 2012, Accident; analysis and prevention.

[35]  Joel Liesman,et al.  Estimating the relationship between measured wind speed and overturning truck crashes using a binary logit model. , 2007, Accident; analysis and prevention.