An Adaptive Clustering Approach for Accident Prediction

Traffic accident prediction is a crucial task in the mobility domain. State-of-the-art accident prediction approaches are based on static and uniform grid-based geospatial aggregations, limiting their capability for fine-grained predictions. This property becomes particularly problematic in more complex regions such as city centers. In such regions, a grid cell can contain subregions with different properties; furthermore, an actual accident-prone region can be split across grid cells arbitrarily. This paper proposes Adaptive Clustering Accident Prediction (ACAP) - a novel accident prediction method based on a grid growing algorithm. ACAP applies adaptive clustering to the observed geospatial accident distribution and performs embeddings of temporal, accident-related, and regional features to increase prediction accuracy. We demonstrate the effectiveness of the proposed ACAP method using open real-world accident datasets from three cities in Germany. We demonstrate that ACAP improves the accident prediction performance for complex regions by 2–3 percent points in F1-score by adapting the geospatial aggregation to the distribution of the underlying spatio-temporal events. Our grid growing approach outperforms the clustering-based baselines by four percent points in terms of F1-score on average.

[1]  Tarek Sayed,et al.  Accident prediction models with random corridor parameters. , 2009, Accident; analysis and prevention.

[2]  Srinivasan Parthasarathy,et al.  Accident Risk Prediction based on Heterogeneous Sparse Data: New Dataset and Insights , 2019, SIGSPATIAL/GIS.

[3]  Maurizio Guida,et al.  A crash-prediction model for multilane roads. , 2007, Accident; analysis and prevention.

[4]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[5]  Esa Alhoniemi,et al.  Clustering of the self-organizing map , 2000, IEEE Trans. Neural Networks Learn. Syst..

[6]  Pasi Fränti,et al.  A grid-growing clustering algorithm for geo-spatial data , 2015, Pattern Recognit. Lett..

[7]  Erkki Oja,et al.  Engineering applications of the self-organizing map , 1996, Proc. IEEE.

[8]  Tianbao Yang,et al.  Predicting Traffic Accidents Through Heterogeneous Urban Data : A Case Study , 2017 .

[9]  Xuan Song,et al.  Learning Deep Representation from Big and Heterogeneous Data for Traffic Accident Inference , 2016, AAAI.

[10]  Martin Hess,et al.  Investigating Roundabout Properties and Bicycle Accident Occurrence at Swiss Roundabouts: A Logistic Regression Approach , 2019, ISPRS Int. J. Geo Inf..

[11]  Tianbao Yang,et al.  Hetero-ConvLSTM: A Deep Learning Approach to Traffic Accident Prediction on Heterogeneous Spatio-Temporal Data , 2018, KDD.

[12]  Jinzhi Lei,et al.  A Deep Learning Approach to the Citywide Traffic Accident Risk Prediction , 2017, 2018 21st International Conference on Intelligent Transportation Systems (ITSC).

[13]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[14]  Lu Wenqi,et al.  A model of traffic accident prediction based on convolutional neural network , 2017, 2017 2nd IEEE International Conference on Intelligent Transportation Engineering (ICITE).

[15]  I. S. Sitanggang,et al.  Determination of Optimal Epsilon (Eps) Value on DBSCAN Algorithm to Clustering Data on Peatland Hotspots in Sumatra , 2016 .

[16]  Li-Yen Chang,et al.  Analysis of freeway accident frequencies: Negative binomial regression versus artificial neural network , 2005 .

[17]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.

[18]  David West,et al.  A comparison of SOM neural network and hierarchical clustering methods , 1996 .

[19]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .