Real-time crash prediction in an urban expressway using disaggregated data

Abstract We develop accident prediction models for a stretch of the urban expressway Autopista Central in Santiago, Chile, using disaggregate data captured by free-flow toll gates with Automatic Vehicle Identification (AVI) which, besides their low failure rate, have the advantage of providing disaggregated data per type of vehicle. The process includes a random forest procedure to identify the strongest precursors of accidents, and the calibration/estimation of two classification models, namely, Support Vector Machine and Logistic regression. We find that, for this stretch of the highway, vehicle composition does not play a first-order role. Our best model accurately predicts 67.89% of the accidents with a low false positive rate of 20.94%. These results are among the best in the literature even though, and as opposed to previous efforts, (i) we do not use only one partition of the data set for calibration and validation but conduct 300 repetitions of randomly selected partitions; (ii) our models are validated on the original unbalanced data set (where accidents are quite rare events), rather than on artificially balanced data.

[1]  Xuesong Wang,et al.  Utilizing Microscopic Traffic and Weather Data to Analyze Real-Time Crash Patterns in the Context of Active Traffic Management , 2014, IEEE Transactions on Intelligent Transportation Systems.

[2]  Qi Shi,et al.  Multi-level Bayesian safety analysis with unprocessed Automatic Vehicle Identification data for an urban expressway. , 2016, Accident; analysis and prevention.

[3]  Adel W. Sadek,et al.  A Novel Variable Selection Method based on Frequent Pattern Tree for Real-time Traffic Accident Risk Prediction , 2015, ArXiv.

[4]  T. Golob,et al.  A Method for Relating Type of Crash to Traffic Flow Characteristics on Urban Freeways , 2002 .

[5]  George Yannis,et al.  A review of the effect of traffic and weather characteristics on road safety. , 2014, Accident; analysis and prevention.

[6]  Moinul Hossain,et al.  A Bayesian network based framework for real-time crash prediction on the basic freeway segments of urban expressways. , 2012, Accident; analysis and prevention.

[7]  Wei Wang,et al.  A Genetic Programming Model for Real-Time Crash Prediction on Freeways , 2013, IEEE Transactions on Intelligent Transportation Systems.

[8]  Tarek Sayed,et al.  Traffic accident modeling: some statistical issues , 2006 .

[9]  Mohamed Abdel-Aty,et al.  Predicting Freeway Crashes from Loop Detector Data by Matched Case-Control Logistic Regression , 2004 .

[10]  Mohamed Abdel-Aty,et al.  Utilizing support vector machine in real-time crash risk evaluation. , 2013, Accident; analysis and prevention.

[11]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[12]  Stephen Kwek,et al.  Applying Support Vector Machines to Imbalanced Datasets , 2004, ECML.

[13]  Jian Sun,et al.  A dynamic Bayesian network model for real-time crash prediction using traffic speed conditions data , 2015 .

[14]  Mohamed Abdel-Aty,et al.  Assessment of freeway traffic parameters leading to lane-change related collisions. , 2006, Accident; analysis and prevention.

[15]  Nitesh V. Chawla,et al.  SMOTE: Synthetic Minority Over-sampling Technique , 2002, J. Artif. Intell. Res..

[16]  Mohamed M. Ahmed,et al.  The Viability of Using Automatic Vehicle Identification Data for Real-Time Crash Prediction , 2012, IEEE Transactions on Intelligent Transportation Systems.

[17]  Christos Koukouvinos,et al.  Support Vector Machines Classification on Class Imbalanced Data: A Case Study with Real Medical Data , 2021 .

[18]  Mohamed Abdel-Aty,et al.  Real-time prediction of visibility related crashes , 2012 .

[19]  Juan de Dios Ortúzar,et al.  Stated preference in the valuation of interurban road safety. , 2003, Accident; analysis and prevention.

[20]  Bart Baesens,et al.  Comprehensible Credit Scoring Models Using Rule Extraction from Support Vector Machines , 2007, Eur. J. Oper. Res..

[21]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[22]  Qi Shi,et al.  Big Data applications in real-time traffic operation and safety monitoring and improvement on urban expressways , 2015 .

[23]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[24]  George Yannis,et al.  Predicting road accidents: a rare-events modeling approach , 2016 .

[25]  Ho-Chan Kwak,et al.  Predicting crash risk and identifying crash precursors on Korean expressways using loop detector data. , 2016, Accident; analysis and prevention.