Injury Severity Prediction From Two-Vehicle Crash Mechanisms With Machine Learning and Ensemble Models

Machine learning algorithms aim to improve the power of predictors over conventional regression models. This study aims to tap the predictive potential of crash mechanism-related variables using ensemble machine learning models. The results demonstrate selected models can predict severity at a high level of accuracy. The stacking model with a linear blender is preferred for the designed ensemble combination. Most bagging, boosting, and stacking algorithms perform well, indicating ensemble models are capable of improving upon individual models.

[1]  Yoav Freund,et al.  A Short Introduction to Boosting , 1999 .

[2]  James M. Keller,et al.  A fuzzy K-nearest neighbor algorithm , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[3]  Ajith Abraham,et al.  Traffic Accident Analysis Using Machine Learning Paradigms , 2005, Informatica.

[4]  Cha Zhang,et al.  Ensemble Machine Learning: Methods and Applications , 2012 .

[5]  Robert P. W. Duin,et al.  Using two-class classifiers for multiclass classification , 2002, Object recognition supported by user interaction for service robots.

[6]  Atorod Azizinamini,et al.  Improved Support Vector Machine Models for Work Zone Crash Injury Severity Prediction and Analysis , 2019, Transportation Research Record: Journal of the Transportation Research Board.

[7]  Yinhai Wang,et al.  Short-Term Speed Prediction Using Remote Microwave Sensor Data: Machine Learning versus Statistical Model , 2016 .

[8]  Alan E Hubbard,et al.  Dynamic multi-outcome prediction after injury: Applying adaptive machine learning for precision medicine in trauma , 2019, PloS one.

[9]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[10]  Ziyuan Pu,et al.  Comparing Prediction Performance for Crash Injury Severity Among Various Machine Learning and Statistical Methods , 2018, IEEE Access.

[11]  Haobin Jiang,et al.  A comparative study on machine learning based algorithms for prediction of motorcycle crash severity , 2019, PloS one.

[12]  Stef van Buuren,et al.  MICE: Multivariate Imputation by Chained Equations in R , 2011 .

[13]  M. Kubát An Introduction to Machine Learning , 2017, Springer International Publishing.

[14]  Asad J. Khattak,et al.  Applying the Ordered Probit Model to Injury Severity in Truck-Passenger Car Rear-End Collisions , 1998 .

[15]  David A Noyce,et al.  Injury Severity of Multivehicle Crash in Rainy Weather , 2010 .

[16]  Chandra R. Bhat,et al.  Joint Analysis of Injury Severity of Drivers in Two-Vehicle Crashes Accommodating Seat Belt Use Endogeneity , 2013 .

[17]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[18]  Richard Tay,et al.  Examining driver injury severity in two vehicle crashes - a copula based approach. , 2014, Accident; analysis and prevention.

[19]  Madhar Taamneh,et al.  Severity Prediction of Traffic Accident Using an Artificial Neural Network , 2017 .

[20]  Nicola Torelli,et al.  Training and assessing classification rules with imbalanced data , 2012, Data Mining and Knowledge Discovery.

[21]  J. Friedman Greedy function approximation: A gradient boosting machine. , 2001 .

[22]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[23]  Ramesh Sharda,et al.  Identifying significant predictors of injury severity in traffic accidents using a series of artificial neural networks. , 2006, Accident; analysis and prevention.

[24]  David Logan,et al.  A kinetic energy model of two-vehicle crash injury severity. , 2011, Accident; analysis and prevention.

[25]  Mohamed Abdel-Aty,et al.  Predicting Injury Severity Levels in Traffic Crashes: A Modeling Comparison , 2004 .

[26]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[27]  Juan de Oña,et al.  Analysis of traffic accident severity using Decision Rules via Decision Trees , 2013, Expert Syst. Appl..

[28]  Yoram Singer,et al.  Reducing Multiclass to Binary: A Unifying Approach for Margin Classifiers , 2000, J. Mach. Learn. Res..

[29]  Mohamed Abdel-Aty,et al.  A classification tree based modeling approach for segment related crashes on multilane highways. , 2010, Journal of safety research.

[30]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.

[31]  D. Kibler,et al.  Instance-based learning algorithms , 2004, Machine Learning.

[32]  Yuanchang Xie,et al.  Predicting motor vehicle collisions using Bayesian neural network models: an empirical analysis. , 2007, Accident; analysis and prevention.

[33]  Mohamed Abdel-Aty,et al.  Utilizing support vector machine in real-time crash risk evaluation. , 2013, Accident; analysis and prevention.

[34]  Fred L. Mannering,et al.  The statistical analysis of crash-frequency data: A review and assessment of methodological alternatives , 2010 .

[35]  Markus König,et al.  Assessment and weighting of meteorological ensemble forecast members based on supervised machine learning with application to runoff simulations and flood warning , 2017, Adv. Eng. Informatics.

[36]  Mohamed Abdel-Aty,et al.  Using conditional inference forests to identify the factors affecting crash severity on arterial corridors. , 2009, Journal of safety research.

[37]  อนิรุธ สืบสิงห์,et al.  Data Mining Practical Machine Learning Tools and Techniques , 2014 .

[38]  Veronica Czitrom,et al.  One-Factor-at-a-Time versus Designed Experiments , 1999 .

[39]  Kirolos Haleem,et al.  Effect of driver's age and side of impact on crash severity along urban freeways: a mixed logit approach. , 2013, Journal of safety research.

[40]  Aixia Guo,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2014 .

[41]  Xiugang Li,et al.  Predicting motor vehicle crashes using Support Vector Machine models. , 2008, Accident; analysis and prevention.

[42]  Kristofer D. Kusano,et al.  Comparison and Validation of Injury Risk Classifiers for Advanced Automated Crash Notification Systems , 2014, Traffic injury prevention.

[43]  Amirfarrokh Iranitalab,et al.  Comparison of four statistical and machine learning methods for crash severity prediction. , 2017, Accident; analysis and prevention.

[44]  Carol A C Flannagan,et al.  Identification and validation of a logistic regression model for predicting serious injuries associated with motor vehicle crashes. , 2011, Accident; analysis and prevention.

[45]  Wolfgang Utschick,et al.  A statistical learning approach for estimating the reliability of crash severity predictions , 2016, 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC).

[46]  Ali Tavakoli Kashani,et al.  Identifying the Most Important Factors in the At-Fault Probability of Motorcyclists by Data Mining, Based on Classification Tree Models , 2017, International Journal of Civil Engineering.

[47]  Wei Wang,et al.  Using support vector machine models for crash injury severity analysis. , 2012, Accident; analysis and prevention.

[48]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[49]  Tibebe Beshah,et al.  Mining Road Traffic Accident Data to Improve Safety: Role of Road-Related Factors on Accident Severity in Ethiopia , 2010, AAAI Spring Symposium: Artificial Intelligence for Development.

[50]  Qi Zhao,et al.  Predicting Drug-Induced Liver Injury Using Ensemble Learning Methods and Molecular Fingerprints , 2018, Toxicological sciences : an official journal of the Society of Toxicology.

[51]  Dirk Van den Poel,et al.  Handling class imbalance in customer churn prediction , 2009, Expert Syst. Appl..

[52]  Zong Tian,et al.  Investigating driver injury severity patterns in rollover crashes using support vector machine models. , 2016, Accident; analysis and prevention.

[53]  Marjan Simoncic,et al.  A Bayesian Network Model of Two-Car Accidents , 2004 .

[54]  Mohamed Abdel-Aty,et al.  Analyzing crash injury severity for a mountainous freeway incorporating real-time traffic and weather data , 2014 .

[55]  Dursun Delen,et al.  Investigating injury severity risk factors in automobile crashes with predictive analytics and sensitivity analysis methods , 2017 .

[56]  Jinjun Tang,et al.  Crash injury severity analysis using a two-layer Stacking framework. , 2019, Accident; analysis and prevention.

[57]  Sara Ferreira,et al.  Risk factors affecting injury severity determined by the MAIS score , 2017, Traffic injury prevention.

[58]  R. Schapire The Strength of Weak Learnability , 1990, Machine Learning.

[59]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[60]  David Levinson,et al.  An energy loss-based vehicular injury severity model. , 2020, Accident; analysis and prevention.

[61]  Wei-Yin Loh,et al.  A Comparison of Prediction Accuracy, Complexity, and Training Time of Thirty-Three Old and New Classification Algorithms , 2000, Machine Learning.

[62]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[63]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[64]  G. W. Davis,et al.  Sensitivity analysis in neural net solutions , 1989, IEEE Trans. Syst. Man Cybern..