Feature Grouping-Based Outlier Detection Upon Streaming Trajectories

Outlier detection acts as one of the most important analysis tasks for trajectory stream. In stream scenarios, such properties as unlimitedness, time-varying evolutionary, sparsity, and skewness distribution of trajectories pose new challenges to outlier detection technique. Trajectory outlier detection techniques mainly focus on finding trajectory that is dissimilar to the majority of the others, which is based on the hypothesis that they are probably generated by a different mechanism. Most distance-based methods tend to utilize a function (e.g., weighted linear sum) to measure the similarity of two arbitrary objects provided that representative features have been extracted in advance. However, this kind of method is not tailored to identify the outlier which is close to its neighbors according to some features, but behaves significantly different from its neighbors in terms of the other features. To address this issue, we propose a feature grouping-based mechanism that divides all the features into two groups, where the first group (Similarity Feature) is used to find close neighbors and the second group (Difference Feature) is used to find outliers within the similar neighborhood. According to the feature differences among local adjacent objects in one or more time intervals, we present two outlier definitions, including local anomaly trajectory fragment (TF-outlier) and evolutionary anomaly moving object (MO-outlier ). We devise a basic solution and then an optimized algorithm to detect both types of outliers. Experimental results show that our proposal is both effective and efficient to detect outliers upon trajectory data streams.

[1]  Vaibhav Muddebihalkar,et al.  Searching Trajectories by Regions of Interest , 2018 .

[2]  Divesh Srivastava,et al.  Forward Decay: A Practical Time Decay Model for Streaming Systems , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[3]  Raymond T. Ng,et al.  Distance-based outliers: algorithms and applications , 2000, The VLDB Journal.

[4]  Nicholas Jing Yuan,et al.  On discovery of gathering patterns from trajectories , 2013, 2013 IEEE 29th International Conference on Data Engineering (ICDE).

[5]  Sangkyum Kim,et al.  ROAM: Rule- and Motif-Based Anomaly Detection in Massive Moving Object Data Sets , 2007, SDM.

[6]  Philip S. Yu,et al.  Global distance-based segmentation of trajectories , 2006, KDD '06.

[7]  Po-Ruey Lei,et al.  A framework for anomaly detection in maritime trajectory behavior , 2015, Knowledge and Information Systems.

[8]  Jae-Gil Lee,et al.  Trajectory Outlier Detection: A Partition-and-Detect Framework , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[9]  Hans-Peter Kriegel,et al.  LOF: identifying density-based local outliers , 2000, SIGMOD '00.

[10]  Aoying Zhou,et al.  TSCluWin: Trajectory Stream Clustering over Sliding Window , 2016, DASFAA.

[11]  Jae-Gil Lee,et al.  Temporal Outlier Detection in Vehicle Traffic Data , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[12]  Tao Wang,et al.  HyMU: A Hybrid Map Updating Framework , 2017, DASFAA.

[13]  Hui Xiong,et al.  A Taxi Driving Fraud Detection System , 2011, 2011 IEEE 11th International Conference on Data Mining.

[14]  Yin Wang,et al.  Mining large-scale gps streams for connectivity refinement of road maps , 2013, SIGSPATIAL/GIS.

[15]  Sridhar Ramaswamy,et al.  Efficient algorithms for mining outliers from large data sets , 2000, SIGMOD '00.

[16]  Douglas M. Hawkins Identification of Outliers , 1980, Monographs on Applied Probability and Statistics.

[17]  Lei Cao,et al.  Detecting moving object outliers in massive-scale trajectory streams , 2014, KDD.

[18]  Cyrus Shahabi,et al.  Crowd sensing of traffic anomalies based on human mobility and social media , 2013, SIGSPATIAL/GIS.

[19]  Yang Gao,et al.  TRASMIL: A local anomaly detection framework based on trajectory segmentation and multi-instance learning , 2013, Comput. Vis. Image Underst..

[20]  Nicholas Jing Yuan,et al.  Online Discovery of Gathering Patterns over Trajectories , 2014, IEEE Transactions on Knowledge and Data Engineering.

[21]  Hui Xiong,et al.  Top-Eye: top-k evolving trajectory outlier detection , 2010, CIKM.

[22]  Xiaoyi Duan,et al.  Real-Time Personalized Taxi-Sharing , 2016, DASFAA.

[23]  Lin Sun,et al.  Real-Time Detection of Anomalous Taxi Trajectories from GPS Traces , 2011, MobiQuitous.

[24]  Sanjay Chawla,et al.  Inferring the Root Cause in Road Traffic Anomalies , 2012, 2012 IEEE 12th International Conference on Data Mining.

[25]  Nikos Pelekis,et al.  Segmentation and Sampling of Moving Object Trajectories Based on Representativeness , 2012, IEEE Transactions on Knowledge and Data Engineering.

[26]  Raymond T. Ng,et al.  Finding Intensional Knowledge of Distance-Based Outliers , 1999, VLDB.

[27]  Jae-Gil Lee,et al.  Trajectory clustering: a partition-and-group framework , 2007, SIGMOD '07.

[28]  Dawei Liu,et al.  Efficient anomaly monitoring over moving object trajectory streams , 2009, KDD.

[29]  Aoying Zhou,et al.  Popular Route Planning with Travel Cost Estimation , 2016, DASFAA.

[30]  Zhi-Hua Zhou,et al.  iBAT: detecting anomalous taxi trajectories from GPS traces , 2011, UbiComp '11.

[31]  Ramayya Krishnan,et al.  Fraud Detection From Taxis' Driving Behaviors , 2014, IEEE Transactions on Vehicular Technology.

[32]  Lin Sun,et al.  iBOAT: Isolation-Based Online Anomalous Trajectory Detection , 2013, IEEE Transactions on Intelligent Transportation Systems.

[33]  Weiwei Sun,et al.  GLUE: a Parameter-Tuning-Free Map Updating System , 2015, CIKM.

[34]  Raymond T. Ng,et al.  Algorithms for Mining Distance-Based Outliers in Large Datasets , 1998, VLDB.

[35]  Panos Kalnis,et al.  User oriented trajectory search for trip recommendation , 2012, EDBT '12.

[36]  Xing Xie,et al.  Discovering spatio-temporal causal interactions in traffic data streams , 2011, KDD.

[37]  Nicholas Jing Yuan,et al.  Towards efficient search for activity trajectories , 2013, 2013 IEEE 29th International Conference on Data Engineering (ICDE).

[38]  Panos Kalnis,et al.  Personalized trajectory matching in spatial networks , 2014, The VLDB Journal.

[39]  Vic Barnett,et al.  Outliers in Statistical Data , 1980 .

[40]  Nikos Pelekis,et al.  Clustering Trajectories of Moving Objects in an Uncertain World , 2009, 2009 Ninth IEEE International Conference on Data Mining.