Outlier Trajectory Detection: A Trajectory Analytics Based Approach

Trajectories obtained from GPS-enabled devices give us great opportunities to mine out hidden knowledge about the urban mobility, traffic dynamics and human behaviors. In this paper, we aim to understand historical trajectory data for discovering outlier trajectories of taxis. An outlier trajectory is a trajectory grossly different from others, meaning there are few or even no trajectories following a similar route in a dataset. To identify outlier trajectories, we first present a prefix tree based algorithm called PTS, which traverses the search space on-the-fly to calculate the number of trajectories following similar routes for outlier detection. Then we propose two trajectory clustering based approaches PBOTD and DBOTD to cluster trajectories and extract representative routes in different ways. Outlier detection is carried out on the representatives directly, and the accuracy can be guaranteed by some proven error bounds. The evaluation of the proposed methods on a real dataset of taxi trajectories verifies the high efficiency and accuracy of the DBOTD algorithm.

[1]  Henry A. Kautz,et al.  Learning and inferring transportation routines , 2004, Artif. Intell..

[2]  Jae-Gil Lee,et al.  Trajectory Outlier Detection: A Partition-and-Detect Framework , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[3]  Jiajie Xu,et al.  SeaCloudDM: a database cluster framework for managing and querying massive heterogeneous sensor sampling data , 2013, The Journal of Supercomputing.

[4]  Liang Liu,et al.  Uncovering cabdrivers' behavior patterns from their digital traces , 2010, Comput. Environ. Urban Syst..

[5]  Jie Zhu,et al.  Time-Dependent Popular Routes Based Trajectory Outlier Detection , 2015, WISE.

[6]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[7]  Elio Masciari,et al.  Trajectory Outlier Detection Using an Analytical Approach , 2011, 2011 IEEE 23rd International Conference on Tools with Artificial Intelligence.

[8]  Hae-Sang Park,et al.  A simple and fast algorithm for K-medoids clustering , 2009, Expert Syst. Appl..

[9]  Ramayya Krishnan,et al.  Fraud Detection From Taxis' Driving Behaviors , 2014, IEEE Transactions on Vehicular Technology.

[10]  Jiawei Han,et al.  Adaptive Fastest Path Computation on a Road Network: A Traffic Mining Approach , 2007, VLDB.

[11]  Lin Sun,et al.  iBOAT: Isolation-Based Online Anomalous Trajectory Detection , 2013, IEEE Transactions on Intelligent Transportation Systems.

[12]  Christian S. Jensen,et al.  Discovery of convoys in trajectory databases , 2008, Proc. VLDB Endow..

[13]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[14]  Eric Horvitz,et al.  Predestination: Inferring Destinations from Partial Trajectories , 2006, UbiComp.

[15]  Lei Cao,et al.  Detecting moving object outliers in massive-scale trajectory streams , 2014, KDD.

[16]  Jiawei Han,et al.  Data Mining: Concepts and Techniques , 2000 .

[17]  Shazia Wasim Sadiq,et al.  SharkDB: An In-Memory Column-Oriented Trajectory Storage , 2014, CIKM.

[18]  Kai Zheng,et al.  Calibrating trajectory data for spatio-temporal similarity analysis , 2014, The VLDB Journal.

[19]  Verena Kantere,et al.  On-line discovery of hot motion paths , 2008, EDBT '08.

[20]  Jae-Gil Lee,et al.  Trajectory clustering: a partition-and-group framework , 2007, SIGMOD '07.

[21]  Sangkyum Kim,et al.  ROAM: Rule- and Motif-Based Anomaly Detection in Massive Moving Object Data Sets , 2007, SDM.

[22]  Jing Yuan,et al.  On Discovery of Traveling Companions from Streaming Trajectories , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[23]  Heng Tao Shen,et al.  Discovering popular routes from trajectories , 2011, 2011 IEEE 27th International Conference on Data Engineering.

[24]  Zhi-Hua Zhou,et al.  iBAT: detecting anomalous taxi trajectories from GPS traces , 2011, UbiComp '11.