Network-constrained bivariate clustering method for detecting urban black holes and volcanoes

ABSTRACT Urban black holes and volcanoes are typical traffic anomalies that are useful for optimizing urban planning and maintaining public safety. It is still challenging to detect arbitrarily shaped urban black holes and volcanoes considering the network constraints with less prior knowledge. This study models urban black holes and volcanoes as bivariate spatial clusters and develops a network-constrained bivariate clustering method for detecting statistically significant urban black holes and volcanoes with irregular shapes. First, an edge-expansion strategy is proposed to construct the network-constrained neighborhoods without the time-consuming calculation of the network distance between each pair of objects. Then, a network-constrained spatial scan statistic is constructed to detect urban black holes and volcanoes, and a multidirectional optimization method is developed to identify arbitrarily shaped urban black holes and volcanoes. Finally, the statistical significance of multiscale urban black holes and volcanoes is evaluated using Monte Carlo simulation. The proposed method is compared with three state-of-the-art methods using both simulated data and Beijing taxicab spatial trajectory data. The comparison shows that the proposed method can detect urban black holes and volcanoes more accurately and completely and is useful for detecting spatiotemporal variations of traffic anomalies.

[1]  Ting Li,et al.  ESCIP: An Expansion-Based Spatial Clustering Method for Inhomogeneous Point Processes , 2019, Annals of the American Association of Geographers.

[2]  M. Kulldorff,et al.  A Space–Time Permutation Scan Statistic for Disease Outbreak Detection , 2005, PLoS medicine.

[3]  F. Benjamin Zhan,et al.  A Comparison of Three Exploratory Methods for Cluster Detection in Spatial Point Patterns , 2010 .

[4]  Atsuyuki Okabe,et al.  SANET: A Toolbox for Spatial Analysis on a Network , 2006 .

[5]  Lei Zou,et al.  Detecting urban black holes based on human mobility data , 2015, SIGSPATIAL/GIS.

[6]  Peng Gao,et al.  Discovering Spatial Patterns in Origin‐Destination Mobility Data , 2012, Trans. GIS.

[7]  J. Hancock,et al.  On the use of Ripley's K-function and its derivatives to analyze domain size. , 2009, Biophysical journal.

[8]  T. Tango,et al.  International Journal of Health Geographics a Flexibly Shaped Spatial Scan Statistic for Detecting Clusters , 2005 .

[9]  Chenghu Zhou,et al.  Density-based clustering for data containing two types of points , 2015, Int. J. Geogr. Inf. Sci..

[10]  Y. Benjamini,et al.  THE CONTROL OF THE FALSE DISCOVERY RATE IN MULTIPLE TESTING UNDER DEPENDENCY , 2001 .

[11]  J. Thill,et al.  Local Indicators of Network-Constrained Clusters in Spatial Patterns Represented by a Link Attribute , 2010 .

[12]  Gemma C. Garriga,et al.  Permutation Tests for Studying Classifier Performance , 2009, 2009 Ninth IEEE International Conference on Data Mining.

[13]  Lei Shi,et al.  Anomalous Window Discovery for Linear Intersecting Paths , 2011, IEEE Transactions on Knowledge and Data Engineering.

[14]  Hui Xiong,et al.  Mining blackhole and volcano patterns in directed graphs: a general approach , 2012, Data Mining and Knowledge Discovery.

[15]  B. Singer,et al.  Controlling the False Discovery Rate: A New Application to Account for Multiple and Dependent Tests in Local Statistics of Spatial Association , 2006 .

[16]  D. Stoyan,et al.  Statistical Analysis and Modelling of Spatial Point Patterns , 2008 .

[17]  Narushige Shiode,et al.  Network-based space-time search-window technique for hotspot detection of street-level crime incidents , 2013, Int. J. Geogr. Inf. Sci..

[18]  M. Kulldorff,et al.  Spatial Scan Statistics Adjusted for Multiple Clusters , 2010 .

[19]  Hui Xiong,et al.  Detecting Blackhole and Volcano Patterns in Directed Networks , 2010, 2010 IEEE International Conference on Data Mining.

[20]  Jean-Claude Thill,et al.  Comparison of planar and network K-functions in traffic accident analysis , 2004 .

[21]  Daqing Zhang,et al.  From taxi GPS traces to social and community dynamics , 2013, ACM Comput. Surv..

[22]  David W. S. Wong The Modifiable Areal Unit Problem (MAUP) , 2004 .

[23]  Xingjian Liu,et al.  Featured Graphic. How Mixed is Beijing, China? A Visual Exploration of Mixed Land Use , 2013 .

[24]  Jirí Síma,et al.  On the NP-Completeness of Some Graph Cluster Measures , 2005, SOFSEM.

[25]  M Kulldorff,et al.  Spatial disease clusters: detection and inference. , 1995, Statistics in medicine.

[26]  Chaogui Kang,et al.  Social Sensing: A New Approach to Understanding Our Socioeconomic Environments , 2015 .

[27]  Yu Zheng,et al.  Trajectory Data Mining , 2015, ACM Trans. Intell. Syst. Technol..

[28]  Chenghu Zhou,et al.  ACOMCD: A multiple cluster detection algorithm based on the spatial scan statistic and ant colony optimization , 2012, Comput. Stat. Data Anal..

[29]  Shashi Shekhar,et al.  Transdisciplinary Foundations of Geospatial Data Science , 2017, ISPRS Int. J. Geo Inf..

[30]  M. Charlton,et al.  An Assessment of the Effectiveness of Multiple Hypothesis Testing for Geographical Anomaly Detection , 2011 .

[31]  Shino Shiode,et al.  Street‐level Spatial Scan Statistic and STAC for Analysing Street Crime Concentrations , 2011, Trans. GIS.

[32]  Licia Capra,et al.  Urban Computing: Concepts, Methodologies, and Applications , 2014, TIST.

[33]  Sang-Il Lee,et al.  Correlation and Spatial Autocorrelation , 2017, Encyclopedia of GIS.

[34]  Min Deng,et al.  Modeling the effect of scale on clustering of spatial points , 2015, Comput. Environ. Urban Syst..

[35]  Di Zhu,et al.  Street as a big geo-data assembly and analysis unit in urban studies: A case study using Beijing taxi data , 2017 .

[36]  Kim Dovey,et al.  What is functional mix? An assemblage approach , 2017 .

[37]  Edsger W. Dijkstra,et al.  A note on two problems in connexion with graphs , 1959, Numerische Mathematik.

[38]  M. Kulldorff,et al.  An elliptic spatial scan statistic , 2006, Statistics in medicine.

[39]  David Bernstein,et al.  Some map matching algorithms for personal navigation assistants , 2000 .

[40]  M. Kulldorff A spatial scan statistic , 1997 .

[41]  Alan T. Murray,et al.  Spatial Clustering Overview and Comparison: Accuracy, Sensitivity, and Computational Expense , 2014 .