Detecting Urban Anomalies Using Multiple Spatio-Temporal Data Sources

Urban anomalies, such as abnormal movements of crowds and accidents, may result in loss of life or property if not handled properly. It would be of great value for governments if anomalies can be automatically alerted in their early stage. However, detecting anomalies in urban area has two main challenges. First, the criteria to determine an anomaly on different occasions (e.g. rainy days vs. sunny days, or holidays vs. workdays) and in different places (e.g. tourist attractions vs. office areas) are distinctly different, as these occasions and places have their own definitions on normal patterns. Second, urban anomalies often exhibit complex forms (e.g. road closure may cause decrease in taxi flow and increase in bike flow). We need an algorithm that not only models the anomaly degree of individual data source but also the combination of changes in multiple data sources. In this paper, we propose a two-step method to tackle those challenges. In the first step, we use a similarity-based algorithm to estimate an anomaly score for each individual data source in each region and time slot based on the values of historically similar regions. Those scores are fed into the second step, where we propose an algorithm based on one-class Support Vector Machine to capture rare patterns occurred in multiple data sources, nearby regions or time slots, and give a final, integrated anomaly score for each region. Evaluations based on both synthetic and real world datasets show the advantages of our method beyond baseline techniques such as distance-based, probability-based methods.

[1]  Mingyan Teng,et al.  Anomaly detection on time series , 2010, 2010 IEEE International Conference on Progress in Informatics and Computing.

[2]  Zhi-Hua Zhou,et al.  iBAT: detecting anomalous taxi trajectories from GPS traces , 2011, UbiComp '11.

[3]  Yoshihide Sekimoto,et al.  Anomalous event detection on large-scale GPS data from mobile phones using hidden markov model and cloud platform , 2013, UbiComp.

[4]  Jae-Gil Lee,et al.  Temporal Outlier Detection in Vehicle Traffic Data , 2009, 2009 IEEE 25th International Conference on Data Engineering.

[5]  Sayan Ranu,et al.  MANTRA: A Scalable Approach to Mining Temporally Anomalous Sub-trajectories , 2016, KDD.

[6]  Tieniu Tan,et al.  Similarity based vehicle trajectory clustering and anomaly detection , 2005, IEEE International Conference on Image Processing 2005.

[7]  Su Yang,et al.  Anomaly Detection on Collective Moving Patterns: Manifold Learning Based Analysis of Traffic Streams , 2011, 2011 IEEE Third Int'l Conference on Privacy, Security, Risk and Trust and 2011 IEEE Third Int'l Conference on Social Computing.

[8]  William Chad Young,et al.  Detecting and classifying anomalous behavior in spatiotemporal network data ∗ , 2014 .

[9]  Nicholas Jing Yuan,et al.  Segmentation of Urban Areas Using Road Networks , 2012 .

[10]  Sanjay Chawla,et al.  Inferring the Root Cause in Road Traffic Anomalies , 2012, 2012 IEEE 12th International Conference on Data Mining.

[11]  Georg Langs,et al.  Unsupervised Anomaly Detection with Generative Adversarial Networks to Guide Marker Discovery , 2017, IPMI.

[12]  Yu Zheng,et al.  Detecting collective anomalies from multiple spatio-temporal datasets across different domains , 2015, SIGSPATIAL/GIS.

[13]  Bernhard Schölkopf,et al.  Support Vector Method for Novelty Detection , 1999, NIPS.

[14]  Cláudio T. Silva,et al.  Using Topological Analysis to Support Event-Guided Exploration in Urban Data , 2014, IEEE Transactions on Visualization and Computer Graphics.

[15]  Barnabás Póczos,et al.  Hierarchical Probabilistic Models for Group Anomaly Detection , 2011, AISTATS.

[16]  G. Smaragdakis,et al.  Spatio-Temporal Network Anomaly Detection by Assessing Deviations of Empirical Measures , 2009, IEEE/ACM Transactions on Networking.

[17]  Sanjay Chawla,et al.  On Mining Anomalous Patterns in Road Traffic Streams , 2011, ADMA.

[18]  Qiang Yang,et al.  Spatio-temporal event detection using dynamic conditional random fields , 2009, IJCAI 2009.

[19]  Arindam Banerjee,et al.  Semi-Markov Switching Vector Autoregressive Model-Based Anomaly Detection in Aviation Systems , 2016, J. Aerosp. Inf. Syst..

[20]  Jae-Gil Lee,et al.  Trajectory Outlier Detection: A Partition-and-Detect Framework , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[21]  Daniel A. Keim,et al.  Visual Analysis of Time‐Series Similarities for Anomaly Detection in Sensor Networks , 2014, Comput. Graph. Forum.

[22]  Charu C. Aggarwal,et al.  Outlier Detection for Temporal Data: A Survey , 2014, IEEE Transactions on Knowledge and Data Engineering.

[23]  Alfred O. Hero,et al.  Multicriteria Similarity-Based Anomaly Detection Using Pareto Depth Analysis , 2016, IEEE Transactions on Neural Networks and Learning Systems.

[24]  K. Zou,et al.  Correlation and simple linear regression. , 2003, Radiology.

[25]  A. Shevyrnogov,et al.  Spatial and temporal anomalies of sea surface temperature in global scale (by space-based data)☆ , 2004 .

[26]  Yoshua Bengio,et al.  Generative Adversarial Nets , 2014, NIPS.

[27]  Barnabás Póczos,et al.  Group Anomaly Detection using Flexible Genre Models , 2011, NIPS.

[28]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[29]  Su Yang,et al.  Anomaly Detection on Collective Moving Patterns: A Hidden Markov Model Based Solution , 2011, 2011 International Conference on Internet of Things and 4th International Conference on Cyber, Physical and Social Computing.

[30]  Man Lung Yiu,et al.  Discovering Longest-lasting Correlation in Sequence Databases , 2013, Proc. VLDB Endow..

[31]  Katrien van Driessen,et al.  A Fast Algorithm for the Minimum Covariance Determinant Estimator , 1999, Technometrics.

[32]  Xing Xie,et al.  Discovering spatio-temporal causal interactions in traffic data streams , 2011, KDD.

[33]  Oliver Obst,et al.  Spatiotemporal Anomaly Detection in Gas Monitoring Sensor Networks , 2008, EWSN.