A Rare Event Modelling Approach to Assess Injury Severity Risk of Vulnerable Road Users

Vulnerable road users (VRUs) represent a large portion of fatalities and injuries occurring on European Union roads. It is therefore important to address the safety of VRUs, particularly in urban areas, by identifying which factors may affect the injury severity level that can be used to develop countermeasures. This paper aims to identify the risk factors that affect the severity of a VRU injured when involved in a motor vehicle crash. For that purpose, a comparative evaluation of two machine learning classifiers—decision tree and logistic regression—considering three different resampling techniques (under-, over- and synthetic oversampling) is presented, comparing both imbalanced and balanced datasets. Crash data records were analyzed involving VRUs from three different cities in Portugal and six years (2012–2017). The main conclusion that can be drawn from this study is that oversampling techniques improve the ability of the classifiers to identify risk factors. On the one hand, this analysis revealed that road markings, road conditions and luminosity affect the injury severity of a pedestrian. On the other hand, age group and temporal variables (month, weekday and time period) showed to be relevant to predict the severity of a cyclist injury when involved in a crash.

[1]  V. R. Duddu,et al.  Crash risk factors associated with injury severity of teen drivers , 2019, IATSS Research.

[2]  Mounir Belloumi,et al.  Spatio-temporal pattern of vulnerable road user’s collisions hot spots and related risk factors for injury severity in Tunisia , 2018, Transportation Research Part F: Traffic Psychology and Behaviour.

[3]  Carlo Giacomo Prato,et al.  Aggravating and mitigating factors associated with cyclist injury severity in Denmark. , 2014, Journal of safety research.

[4]  Margarida C. Coelho,et al.  Modeling the Impact of Subject and Opponent Vehicles on Crash Severity in Two-Vehicle Collisions , 2014 .

[5]  Dong-Kyu Kim,et al.  Hierarchical ordered model for injury severity of pedestrian crashes in South Korea. , 2017, Journal of safety research.

[6]  Dursun Delen,et al.  Investigating injury severity risk factors in automobile crashes with predictive analytics and sensitivity analysis methods , 2017 .

[7]  Sven F. Crone,et al.  Instance sampling in credit scoring: An empirical study of sample size and balancing , 2012 .

[8]  Federico Fraboni,et al.  Using data mining techniques to predict the severity of bicycle crashes. , 2017, Accident; analysis and prevention.

[9]  Charles J. DiMaggio,et al.  The Effect of Sharrows, Painted Bicycle Lanes and Physically Protected Paths on the Severity of Bicycle Injuries Caused by Motor Vehicles , 2016, Safety.

[10]  Quan Yuan,et al.  Factor comparison of passenger-vehicle to vulnerable road user crashes in Beijing, China , 2017 .

[11]  Fahim Ahmed,et al.  Pedestrian Injury Severity Analysis in Motor Vehicle Crashes in Ohio , 2018 .

[12]  Margarida C. Coelho,et al.  Statistical Analysis of the Occurrence and Severity of Crashes Involving Vulnerable Road Users , 2017 .

[13]  Panagiotis Ch. Anastasopoulos,et al.  Analysis of accident injury-severity outcomes: The zero-inflated hierarchical ordered probit model with correlated disturbances , 2018, Analytic Methods in Accident Research.

[14]  Ahmed E. Hassan,et al.  The Impact of Class Rebalancing Techniques on the Performance and Interpretation of Defect Prediction Models , 2018, IEEE Transactions on Software Engineering.

[15]  Peng Chen,et al.  Built environment effects on cyclist injury severity in automobile-involved bicycle crashes. , 2016, Accident; analysis and prevention.

[16]  Francisco Herrera,et al.  An insight into classification with imbalanced data: Empirical results and current trends on using data intrinsic characteristics , 2013, Inf. Sci..

[17]  Liping Fu,et al.  Using a flexible multivariate latent class approach to model correlated outcomes: A joint analysis of pedestrian and cyclist injuries , 2017 .

[18]  A. Moudon,et al.  The risk of pedestrian injury and fatality in collisions with motor vehicles, a social ecological study of state routes and city streets in King County, Washington. , 2011, Accident; analysis and prevention.

[19]  Rebecca A. Weast,et al.  Temporal factors in motor-vehicle crash deaths: Ten years later. , 2018, Journal of safety research.

[20]  Nicola Torelli,et al.  Training and assessing classification rules with imbalanced data , 2012, Data Mining and Knowledge Discovery.

[21]  Mark Stevenson,et al.  Detailed Analysis of Pedestrian Casualty Collisions in Victoria, Australia , 2014, Traffic injury prevention.

[22]  J. Hanley,et al.  The meaning and use of the area under a receiver operating characteristic (ROC) curve. , 1982, Radiology.

[23]  Sara Ferreira,et al.  Risk factors affecting injury severity determined by the MAIS score , 2017, Traffic injury prevention.

[24]  Panagiotis Ch. Anastasopoulos,et al.  Analysis of accident injury-severities using a correlated random parameters ordered probit approach with time variant covariates , 2018, Analytic Methods in Accident Research.

[25]  Juan de Oña,et al.  Analysis of traffic accident severity using Decision Rules via Decision Trees , 2013, Expert Syst. Appl..

[26]  Danya Yao,et al.  Bootstrap resampling approach to disaggregate analysis of road crashes in Hong Kong. , 2016, Accident; analysis and prevention.

[27]  A. Akobeng,et al.  Understanding diagnostic tests 3: receiver operating characteristic curves , 2007, Acta paediatrica.

[28]  Wei Tu,et al.  Analyzing Risk Factors for Fatality in Urban Traffic Crashes: A Case Study of Wuhan, China , 2017 .

[29]  Mahdi Pour-Rouholamin,et al.  Investigating the risk factors associated with pedestrian injury severity in Illinois. , 2016, Journal of safety research.

[30]  Yijing Li,et al.  Learning from class-imbalanced data: Review of methods and applications , 2017, Expert Syst. Appl..

[31]  Nicola Torelli,et al.  ROSE: a Package for Binary Imbalanced Learning , 2014, R J..

[32]  Xiaoxiang Ma,et al.  Multivariate space-time modeling of crash frequencies by injury severity levels , 2017 .

[33]  Majid Sarvi,et al.  Macroscopic modeling of pedestrian and bicycle crashes: A cross-comparison of estimation methods. , 2016, Accident; analysis and prevention.

[34]  Guodong Liu,et al.  Risk factors for extremely serious road accidents: Results from national Road Accident Statistical Annual Report of China , 2018, PloS one.

[35]  Deborah Salon,et al.  Determinants of pedestrian and bicyclist crash severity by party at fault in San Francisco, CA. , 2018, Accident; analysis and prevention.

[36]  Haibo He,et al.  Learning from Imbalanced Data , 2009, IEEE Transactions on Knowledge and Data Engineering.

[37]  F. Mannering,et al.  Determinants of bicyclist injury severities in bicycle-vehicle crashes: A random parameters approach with heterogeneity in means and variances , 2017 .

[38]  Pei-Sung Lin,et al.  The Effects of Neighbourhood Characteristics and the Built Environment on Pedestrian Injury Severity: A Random Parameters Generalized Ordered Probability Model with Heterogeneity in Means and Variances , 2017 .

[39]  A. Abdulhafedh Incorporating the Multinomial Logistic Regression in Vehicle Crash Severity Modeling: A Detailed Overview , 2017 .

[40]  Sergio A. Useche,et al.  Infrastructural and human factors affecting safety outcomes of cyclists , 2018 .

[41]  L Mussone,et al.  Analysis of factors affecting the severity of crashes in urban road intersections. , 2017, Accident; analysis and prevention.

[42]  Humberto Barrera-Jiménez Urban form, mobility and sustainability: A macroscopic prospective spatial analysis for road traffic safety planning in Medellin, Colombia , 2020 .

[43]  Luis F. Miranda-Moreno,et al.  Estimating Potential Effect of Speed Limits, Built Environment, and Other Factors on Severity of Pedestrian and Cyclist Injuries in Crashes , 2011 .

[44]  Fred L. Mannering,et al.  Analysis of vehicle accident-injury severities: A comparison of segment- versus accident-based latent class ordered probit models with class-probability functions , 2018, Analytic Methods in Accident Research.

[45]  Xiaoqi Zhai,et al.  Diagnostic analysis of the effects of weather condition on pedestrian crash severity. , 2019, Accident; analysis and prevention.

[46]  Madhar Taamneh,et al.  Data-mining techniques for traffic accident modeling and prediction in the United Arab Emirates , 2017 .

[47]  Laura Garach,et al.  Bayes classifiers for imbalanced traffic accidents datasets. , 2016, Accident; analysis and prevention.