Real-time crash prediction on urban expressways: identification of key variables and a hybrid support vector machine model

The traffic safety on expressways is crucial for the efficient operation of the expressway system, and there is a close relationship between traffic states and crashes on expressways, and the occurrence of crashes may be influenced by the interaction of different combinations of traffic states upstream and downstream of the crash location. Based on the crash data and the corresponding traffic flow detector data collected on expressways in Shanghai, this study proposes a hybrid model combining a support vector machine (SVM) model with a k-means clustering algorithm to predict the likelihood of crashes. The random forest (RF) model is employed to select the important and significant variables for model construction from the data of the traffic flow 5-10 min before the crash occurred. Then, the cross-validation and transferability of different models (SVM model without variable selection, SVM model with variable selection, and hybrid SVM model with variable selection) are determined using 577 crashes and 5794 matched non-crash events. The results show that the crash prediction model along with the four most important variables selected using the RF model can obtain a satisfactory prediction performance for crashes. With the combination of the clustering algorithm and SVM model, the accuracy of the crash prediction model can be as high as 78.0%. Moreover, the results of the transferability of the three different models imply that the variable selection and clustering algorithm both have an advantage for crash prediction.

[1]  Weixu Wang,et al.  Using the Bayesian updating approach to improve the spatial and temporal transferability of real-time crash risk prediction models , 2014 .

[2]  Kellie J. Archer,et al.  Empirical characterization of random forest variable importance measures , 2008, Comput. Stat. Data Anal..

[3]  Yanqing Zhang,et al.  SVMs Modeling for Highly Imbalanced Classification , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[4]  Alexander Skabardonis,et al.  Impact of traffic states on freeway crash involvement rates. , 2013, Accident; analysis and prevention.

[5]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[6]  Thomas F. Golob,et al.  Probabilistic models of freeway safety performance using traffic flow data as predictors , 2008 .

[7]  Vikash V. Gayah,et al.  Crash Risk Assessment Using Intelligent Transportation Systems Data and Real-Time Intervention Strategies to Improve Safety on Freeways , 2007, J. Intell. Transp. Syst..

[8]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[9]  Wei Wang,et al.  A Genetic Programming Model for Real-Time Crash Prediction on Freeways , 2013, IEEE Transactions on Intelligent Transportation Systems.

[10]  Wei Wang,et al.  Predicting crash likelihood and severity on freeways with real-time loop detector data. , 2013, Accident; analysis and prevention.

[11]  Mohamed Abdel-Aty,et al.  Real-time prediction of visibility related crashes , 2012 .

[12]  Mohamed Abdel-Aty,et al.  Predicting Freeway Crashes from Loop Detector Data by Matched Case-Control Logistic Regression , 2004 .

[13]  Moinul Hossain,et al.  A real-time crash prediction model for the ramp vicinities of urban expressways , 2013 .

[14]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[15]  Moinul Hossain,et al.  A Bayesian network based framework for real-time crash prediction on the basic freeway segments of urban expressways. , 2012, Accident; analysis and prevention.

[16]  Mohamed Abdel-Aty,et al.  Bayesian random effect models incorporating real-time weather and traffic data to investigate mountainous freeway hazardous factors. , 2013, Accident; analysis and prevention.

[17]  Jian Sun,et al.  Use of Support Vector Machine Models for Real-Time Prediction of Crash Risk on Urban Expressways , 2014 .

[18]  Wei Wang,et al.  Using support vector machine models for crash injury severity analysis. , 2012, Accident; analysis and prevention.

[19]  Mohamed Abdel-Aty,et al.  Split Models for Predicting Multivehicle Crashes during High-Speed and Low-Speed Operating Conditions on Freeways , 2005 .

[20]  Xiugang Li,et al.  Predicting motor vehicle crashes using Support Vector Machine models. , 2008, Accident; analysis and prevention.

[21]  Mohamed Abdel-Aty,et al.  Assessment of freeway traffic parameters leading to lane-change related collisions. , 2006, Accident; analysis and prevention.

[22]  Chris Lee,et al.  Real-Time Crash Prediction Model for Application to Crash Prevention in Freeway Traffic , 2002 .

[23]  Wei Wang,et al.  Development of a Real-Time Crash Risk Prediction Model Incorporating the Various Crash Mechanisms Across Different Traffic States , 2015, Traffic injury prevention.

[24]  Mohamed Abdel-Aty,et al.  Identifying crash propensity using specific traffic speed conditions. , 2005, Journal of safety research.

[25]  Soyoung Ahn,et al.  Impact of traffic oscillations on freeway crash occurrences. , 2010, Accident; analysis and prevention.

[26]  Mohamed Abdel-Aty,et al.  Utilizing support vector machine in real-time crash risk evaluation. , 2013, Accident; analysis and prevention.