Detecting Crowdedness Spot in City Transportation

Crowdedness spot is a crowded area with an abnormal number of objects. Detecting the crowdedness spots of moving vehicles in an urban area is essential to many applications. An intuitive method is to cluster the objects in areas to get the density information. Unfortunately, the data capturing vehicle mobility possesses some new features, such as highly mobile environments, supremely limited size of sample objects, and nonuniform biased samples, and all these features have raised new challenges that make traditional density-based clustering algorithms fail to retrieve the real clustering property of objects, making the results less meaningful. In this paper, we propose a novel nondensity-based approach called mobility-based clustering. The key idea is that sample objects are employed as “sensors” to perceive the vehicle crowdedness in nearby areas using their instant mobility rather than the “object representatives.” As such, the mobility of samples is naturally incorporated. Several key factors beyond the vehicle crowdedness have been identified, and techniques to compensate these effects are accordingly proposed. Furthermore, taking the detected crowdedness spots as a label of the taxi, we can identify one particular taxi to be a crowdedness taxi that crosses a number of different crowdedness spots. We evaluate the performance of our methods and baseline approaches based on real traffic situations (to retrieve the real traffic crowdedness) and real-life data sets. Finally, the interesting findings are provided for further discussions.

[1]  Jae-Gil Lee,et al.  Traffic Density-Based Discovery of Hot Routes in Road Networks , 2007, SSTD.

[2]  Benjamin Coifman Identifying the Onset of Congestion Rapidly with Existing Traffic Detectors , 1999 .

[3]  Ashutosh Sabharwal,et al.  On Achieving Local View Capacity Via Maximal Independent Graph Scheduling , 2011, IEEE Transactions on Information Theory.

[4]  Philip S. Yu,et al.  A Framework for Clustering Uncertain Data Streams , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[5]  Lionel M. Ni,et al.  SEER: Metropolitan-Scale Traffic Perception Based on Lossy Sensory Data , 2009, IEEE INFOCOM 2009.

[6]  David Evans,et al.  Using Real-Time Road Traffic Data to Evaluate Congestion , 2011, Dependable and Historic Computing.

[7]  Tao Li,et al.  HIREL: An Incremental Clustering Algorithm for Relational Datasets , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[8]  David Wai-Lok Cheung,et al.  Clustering Uncertain Data Using Voronoi Diagrams , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[9]  Xing Xie,et al.  Urban computing with taxicabs , 2011, UbiComp '11.

[10]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[11]  Jiawei Han,et al.  CLARANS: A Method for Clustering Objects for Spatial Data Mining , 2002, IEEE Trans. Knowl. Data Eng..

[12]  Mingyan Liu,et al.  Surface street traffic estimation , 2007, MobiSys '07.

[13]  Jian Pei,et al.  Query answering techniques on uncertain and probabilistic data: tutorial summary , 2008, SIGMOD Conference.

[14]  Guangzhong Sun,et al.  Driving with knowledge from the physical world , 2011, KDD.

[15]  Daqing Zhang,et al.  Urban Traffic Modelling and Prediction Using Large Scale Taxi GPS Traces , 2012, Pervasive.

[16]  Siyuan Liu,et al.  Towards mobility-based clustering , 2010, KDD.

[17]  Hans-Peter Kriegel,et al.  Statistical Density Prediction in Traffic Networks , 2008, SDM.

[18]  Philip S. Yu,et al.  Evolutionary Clustering by Hierarchical Dirichlet Process with Hidden Markov State , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[19]  Fabrizio Granelli,et al.  Intelligent extended floating car data collection , 2009, Expert Syst. Appl..

[20]  Mehryar Mohri,et al.  L2 Regularization for Learning Kernels , 2009, UAI.

[21]  Deepayan Chakrabarti,et al.  Evolutionary clustering , 2006, KDD '06.

[22]  Jaakko Hollmén,et al.  Spatio-temporal Road Condition Forecasting with Markov Chains and Artificial Neural Networks , 2008, HAIS.

[23]  Hans-Peter Kriegel,et al.  Density-Based Clustering in Spatial Databases: The Algorithm GDBSCAN and Its Applications , 1998, Data Mining and Knowledge Discovery.

[24]  Tian Zhang,et al.  BIRCH: an efficient data clustering method for very large databases , 1996, SIGMOD '96.

[25]  Jeffrey Xu Yu,et al.  Sliding-window top-k queries on uncertain streams , 2008, Proc. VLDB Endow..

[26]  Sergio Greco,et al.  A Hierarchical Algorithm for Clustering Uncertain Data via an Information-Theoretic Approach , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[27]  Yifan Li,et al.  Clustering moving objects , 2004, KDD.

[28]  Padhraic Smyth,et al.  Trajectory clustering with mixtures of regression models , 1999, KDD '99.

[29]  Samir Khuller,et al.  Achieving anonymity via clustering , 2006, PODS '06.

[30]  Jeffrey Xu Yu,et al.  Sliding-window top-k queries on uncertain streams , 2008, The VLDB Journal.

[31]  Li Tu,et al.  Density-based clustering for real-time stream data , 2007, KDD '07.

[32]  Cyrus Shahabi,et al.  Robust Time-Referenced Segmentation of Moving Object Trajectories , 2008, 2008 Eighth IEEE International Conference on Data Mining.

[33]  Xing Xie,et al.  Discovering spatio-temporal causal interactions in traffic data streams , 2011, KDD.