Accident Risk Prediction based on Heterogeneous Sparse Data: New Dataset and Insights

Reducing traffic accidents is an important public safety challenge, therefore, accident analysis and prediction has been a topic of much research over the past few decades. Using small-scale datasets with limited coverage, being dependent on extensive set of data, and being not applicable for real-time purposes are the important shortcomings of the existing studies. To address these challenges, we propose a new solution for real-time traffic accident prediction using easy-to-obtain, but sparse data. Our solution relies on a deep-neural-network model (which we have named DAP, for Deep Accident Prediction); which utilizes a variety of data attributes such as traffic events, weather data, points-of-interest, and time. DAP incorporates multiple components including a recurrent (for time-sensitive data), a fully connected (for time-insensitive data), and a trainable embedding component (to capture spatial heterogeneity). To fill the data gap, we have - through a comprehensive process of data collection, integration, and augmentation - created a large-scale publicly available database of accident information named US-Accidents. By employing the US-Accidents dataset and through an extensive set of experiments across several large cities, we have evaluated our proposal against several baselines. Our analysis and results show significant improvements to predict rare accident events. Further, we have shown the impact of traffic information, time, and points-of-interest data for real-time accident prediction.

[1]  Tianbao Yang,et al.  Predicting Traffic Accidents Through Heterogeneous Urban Data : A Case Study , 2017 .

[2]  Rajiv Ramnath,et al.  Characterizing Driving Context from Driver Behavior , 2017, SIGSPATIAL/GIS.

[3]  D. Eisenberg The mixed effects of precipitation on traffic crashes. , 2004, Accident; analysis and prevention.

[4]  Florence March,et al.  2016 , 2016, Affair of the Heart.

[5]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[6]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[7]  Juan de Oña,et al.  Analysis of traffic accident severity using Decision Rules via Decision Trees , 2013, Expert Syst. Appl..

[8]  Xun Zhou,et al.  Precipitation Effects on Motor Vehicle Crashes Vary by Space, Time, and Environmental Conditions , 2016 .

[9]  Li-Yen Chang,et al.  Analysis of freeway accident frequencies: Negative binomial regression versus artificial neural network , 2005 .

[10]  Xuan Song,et al.  Learning Deep Representation from Big and Heterogeneous Data for Traffic Accident Inference , 2016, AAAI.

[11]  Uchendu O. Onwurah,et al.  Road traffic accidents prediction modelling: An analysis of Anambra State, Nigeria. , 2018, Accident; analysis and prevention.

[12]  Jinzhi Lei,et al.  A Deep Learning Approach to the Citywide Traffic Accident Risk Prediction , 2017, 2018 21st International Conference on Intelligent Transportation Systems (ITSC).

[13]  D. Jaroszweski,et al.  The influence of rainfall on road accidents in urban areas: A weather radar approach , 2014 .

[14]  Maurizio Guida,et al.  A crash-prediction model for multilane roads. , 2007, Accident; analysis and prevention.

[15]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[16]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[17]  A. Azzouz 2011 , 2020, City.

[18]  Tianbao Yang,et al.  Hetero-ConvLSTM: A Deep Learning Approach to Traffic Accident Prediction on Heterogeneous Spatio-Temporal Data , 2018, KDD.

[19]  Lu Wenqi,et al.  A model of traffic accident prediction based on convolutional neural network , 2017, 2017 2nd IEEE International Conference on Intelligent Transportation Engineering (ICITE).

[20]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[21]  Shun'ichi Kaneko,et al.  Combining Satellite Imagery and Open Data to Map Road Safety , 2017, AAAI.

[22]  Srinivasan Parthasarathy,et al.  Short and Long-term Pattern Discovery Over Large-Scale Geo-Spatiotemporal Data , 2019, KDD.

[23]  Martín Abadi,et al.  TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems , 2016, ArXiv.

[24]  Adel W. Sadek,et al.  A Novel Variable Selection Method based on Frequent Pattern Tree for Real-time Traffic Accident Risk Prediction , 2015, ArXiv.

[25]  Durga Toshniwal,et al.  A data mining framework to analyze road accident data , 2015, Journal of Big Data.

[26]  Li-Yen Chang,et al.  Data mining of tree-based models to analyze freeway accident frequency. , 2005, Journal of safety research.

[27]  Cheng Wang,et al.  SDCAE: Stack Denoising Convolutional Autoencoder Model for Accident Risk Prediction Via Traffic Big Data , 2018, 2018 Sixth International Conference on Advanced Cloud and Big Data (CBD).

[28]  Chandra R. Bhat,et al.  Unobserved heterogeneity and the statistical analysis of highway accident data , 2016 .

[29]  Athanasios Theofilatos,et al.  Incorporating real-time traffic and weather data to explore road accident likelihood and severity in urban arterials. , 2017, Journal of safety research.