Pattern and Anomaly Detection in Urban Temporal Networks

Broad spectrum of urban activities including mobility can be modeled as temporal networks evolving over time. Abrupt changes in urban dynamics caused by events such as disruption of civic operations, mass crowd gatherings, holidays and natural disasters are potentially reflected in these temporal mobility networks. Identification and early detecting of such abnormal developments is of critical importance for transportation planning and security. Anomaly detection from high dimensional network data is a challenging task as edge level measurements often have low values and high variance resulting in high noise-to-signal ratio. In this study, we propose a generic three-phase pipeline approach to tackle curse of dimensionality and noisiness of the original data. Our pipeline consists of i) initial network aggregation leveraging community detection ii) unsupervised dimensionality reduction iii) clustering of the resulting representations for outlier detection. We perform extensive experiments to evaluate the proposed approach on mobility data collected from two major cities, New York City and Taipei. Our results empirically prove that proposed methodology outperforms traditional approaches for anomaly detection. We further argue that the proposed anomaly detection framework is potentially generalizable to various other types of temporal networks e.g. social interactions, information propagation and epidemic spread.

[1]  Paolo Santi,et al.  Supporting Information for Quantifying the Benefits of Vehicle Pooling with Shareability Networks Data Set and Pre-processing , 2022 .

[2]  Christophe Diot,et al.  Diagnosing network-wide traffic anomalies , 2004, SIGCOMM.

[3]  Lingjing Wang,et al.  Structure of 311 service requests as a signature of urban location , 2016, PloS one.

[4]  S. Strogatz,et al.  Redrawing the Map of Great Britain from a Network of Human Interactions , 2010, PloS one.

[5]  Jaideep Srivastava,et al.  Event detection from time series data , 1999, KDD '99.

[6]  Daniel B. Neill,et al.  Non-parametric scan statistics for event detection and forecasting in heterogeneous social media graphs , 2014, KDD.

[7]  Fabrizio Angiulli,et al.  Anomaly Detection in Networks with Temporal Information , 2016, DS.

[8]  R. Bellman Dynamic programming. , 1957, Science.

[9]  Geoffrey E. Hinton,et al.  Reducing the Dimensionality of Data with Neural Networks , 2006, Science.

[10]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[11]  Gene H. Golub,et al.  Singular value decomposition and least squares solutions , 1970, Milestones in Matrix Computation.

[12]  Carlo Ratti,et al.  The impact of social segregation on human mobility in developing and industrialized regions , 2014, EPJ Data Science.

[13]  Carlo Ratti,et al.  A General Optimization Technique for High Quality Community Detection in Complex Networks , 2013, Physical review. E, Statistical, nonlinear, and soft matter physics.

[14]  Douglas A. Reynolds Gaussian Mixture Models , 2009, Encyclopedia of Biometrics.

[15]  Jeremy Kepner,et al.  A scalable signal processing architecture for massive graph analysis , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[16]  Stanislav Sobolevsky,et al.  Digital Urban Sensing: A Multi-layered Approach , 2018, ArXiv.

[17]  Yizhou Sun,et al.  Integrating community matching and outlier detection for mining evolutionary community outliers , 2012, KDD.

[18]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[19]  Rikard Laxhammar,et al.  Anomaly detection for sea surveillance , 2008, 2008 11th International Conference on Information Fusion.

[20]  Michael Gertz,et al.  EvenTweet: Online Localized Event Detection from Twitter , 2013, Proc. VLDB Endow..

[21]  Sharon Weinberger,et al.  Spies to use Twitter as crystal ball , 2011, Nature.

[22]  Carlo Ratti,et al.  Urban magnetism through the lens of geo-tagged photography , 2015, EPJ Data Science.

[23]  Cordelia Schmid,et al.  High-dimensional data clustering , 2006, Comput. Stat. Data Anal..

[24]  Sean R Eddy,et al.  What is dynamic programming? , 2004, Nature Biotechnology.

[25]  Carlo Ratti,et al.  Exploring Universal Patterns in Human Home-Work Commuting from Mobile Phone Data , 2013, PloS one.

[26]  John Bohannon In battle. Tweeting the London riots. , 2012, Science.

[27]  Arthur Zimek,et al.  Outlier Detection , 2018, Encyclopedia of Database Systems.

[28]  Josep Blat,et al.  An Analysis of Visitors' Behavior in the Louvre Museum: A Study Using Bluetooth Data , 2014, ArXiv.

[29]  Zbigniew Smoreda,et al.  Delineating Geographical Regions with Networks of Human Interactions in an Extensive Set of Countries , 2013, PloS one.

[30]  Hans-Peter Kriegel,et al.  Clustering high-dimensional data: A survey on subspace clustering, pattern-based clustering, and correlation clustering , 2009, TKDD.

[31]  Carlo Ratti,et al.  Global multi-layer network of human mobility , 2016, Int. J. Geogr. Inf. Sci..

[32]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[33]  H. Abdi,et al.  Principal component analysis , 2010 .

[34]  Victoria J. Hodge,et al.  A Survey of Outlier Detection Methodologies , 2004, Artificial Intelligence Review.

[35]  Gaël Varoquaux,et al.  Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[36]  Zhengding Lu,et al.  Community mining on dynamic weighted directed graphs , 2009, CIKM-CNIKM.

[37]  Carlo Ratti,et al.  Geo-located Twitter as proxy for global mobility patterns , 2013, Cartography and geographic information science.

[38]  Zbigniew Smoreda,et al.  Identifying and modeling the structural discontinuities of human interactions , 2015, Scientific Reports.

[39]  Carlo Ratti,et al.  Cities through the Prism of People’s Spending Behavior , 2015, PloS one.

[40]  Carlo Ratti,et al.  Socioeconomic characterization of regions through the lens of individual financial transactions , 2017, PloS one.

[41]  I. Jolliffe Principal Component Analysis , 2005 .