BTCI: A new framework for identifying congestion cascades using bus trajectory data

The knowledge of traffic health status is essential to the general public and urban traffic management. To identify congestion cascades, an important phenomenon of traffic health, we propose a Bus Trajectory based Congestion Identification (BTCI) framework that explores the anomalous traffic health status and structure properties of congestion cascades using bus trajectory data. BTCI consists of two main steps, congested segment extraction and congestion cascades identification. The former constructs path speed models from historical vehicle transitions and design a non-parametric Kernel Density Estimation (KDE) function to derive a measure of congestion score. The latter aggregates congested segments (i.e., those with high congestion scores) into traffic congestion cascades by unifying both attribute coherence and spatio-temporal closeness of congested segments within a cascade. Extensive evaluations on 11.8 million bus trajectory data show that (1) BTCI can effectively identify congestion cascades, (2) the proposed congestion score is effective in extracting congested segments, (3) the proposed unified approach significantly outperforms alternative approaches in terms of extended precision, and (4) the identified congestion cascades are realistic, matching well with the traffic news and highly correlated with vehicle speed bands.

[1]  Xing Xie,et al.  Discovering spatio-temporal causal interactions in traffic data streams , 2011, KDD.

[2]  Daniel A. Keim,et al.  An Efficient Approach to Clustering in Large Multimedia Databases with Noise , 1998, KDD.

[3]  Hong Cheng,et al.  GBAGC: A General Bayesian Framework for Attributed Graph Clustering , 2014, TKDD.

[4]  Zhoujun Li,et al.  Citywide traffic congestion estimation with social media , 2015, SIGSPATIAL/GIS.

[5]  Zhen Qian,et al.  Road Traffic Congestion Monitoring in Social Media with Hinge-Loss Markov Random Fields , 2014, 2014 IEEE International Conference on Data Mining.

[6]  W. Pattara-atikom,et al.  Estimating Road Traffic Congestion using Vehicle Velocity , 2006, 2006 6th International Conference on ITS Telecommunications.

[7]  Eleonora D'Andrea,et al.  Real-Time Detection of Traffic From Twitter Stream Analysis , 2015, IEEE Transactions on Intelligent Transportation Systems.

[8]  Chetan Gupta,et al.  Forecasting Spatiotemporal Impact of Traffic Incidents on Road Networks , 2013, 2013 IEEE 13th International Conference on Data Mining.

[9]  Sanjay Chawla,et al.  Inferring the Root Cause in Road Traffic Anomalies , 2012, 2012 IEEE 12th International Conference on Data Mining.

[10]  Chang-Tien Lu,et al.  A search and summary application for traffic events detection based on Twitter data , 2014, SIGSPATIAL/GIS.

[11]  Gaetano Valenti,et al.  Traffic Estimation And Prediction Based On Real Time Floating Car Data , 2008, 2008 11th International IEEE Conference on Intelligent Transportation Systems.

[12]  Freddy Lécué,et al.  Westland row why so slow?: fusing social media and linked data sources for understanding real-time traffic conditions , 2013, IUI '13.

[13]  Hong Cheng,et al.  Graph Clustering Based on Structural/Attribute Similarities , 2009, Proc. VLDB Endow..

[14]  Ling Liu,et al.  Social influence based clustering of heterogeneous information networks , 2013, KDD.

[15]  F. Porikli,et al.  Traffic congestion estimation using HMM models without vehicle tracking , 2004, IEEE Intelligent Vehicles Symposium, 2004.

[16]  Siyuan Liu,et al.  Detecting Crowdedness Spot in City Transportation , 2013, IEEE Transactions on Vehicular Technology.

[17]  R. Horowitz,et al.  Traffic density estimation with the cell transmission model , 2003, Proceedings of the 2003 American Control Conference, 2003..

[18]  Cyrus Shahabi,et al.  Crowd sensing of traffic anomalies based on human mobility and social media , 2013, SIGSPATIAL/GIS.

[19]  Yu Zheng,et al.  Detecting collective anomalies from multiple spatio-temporal datasets across different domains , 2015, SIGSPATIAL/GIS.

[20]  Sergei Vassilvitskii,et al.  k-means++: the advantages of careful seeding , 2007, SODA '07.

[21]  E. Parzen On Estimation of a Probability Density Function and Mode , 1962 .

[22]  Charu C. Aggarwal,et al.  Relation Strength-Aware Clustering of Heterogeneous Information Networks with Incomplete Attributes , 2012, Proc. VLDB Endow..