Mining Network Hotspots with Holes: A Summary of Results

Given a spatial network and a collection of activities (e.g. crime locations), the problem of Mining Network Hotspots with Holes (MNHH) finds network hotspots with doughnut shaped spatial footprint, where the concentration of activities is unusually high (e.g. statistically significant). MNHH is important for societal applications such as criminology, where it may focus the efforts of officials to identify a crime source. MNHH is challenging because of the large number of candidates and the high computational cost of statistical significance test. Previous work focused either on geometry based hotspots (e.g. circular, ring-shaped) on Euclidean space or connected subgraphs (e.g. shortest path), limiting the ability to detect statistically significant hotspots with holes on a spatial network. This paper proposes a novel Network Hotspot with Hole Generator (NHHG) algorithm to detect network hotspots with holes. The proposed algorithm features refinements that improve the performance of a naive approach. Case studies on real crime datasets confirm the superiority of NHHG over previous approaches. Experimental results on real data show that the proposed approach yields substantial computational savings without reducing result quality.

[1]  Lei Shi,et al.  Anomalous Window Discovery for Linear Intersecting Paths , 2011, IEEE Transactions on Knowledge and Data Engineering.

[2]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[3]  C. Brunsdon,et al.  University of Huddersfield Repository How places influence crime: The impact of surrounding areas on neighbourhood burglary rates in a British City , 2021 .

[4]  Shashi Shekhar,et al.  Geographically Robust Hotspot Detection: A Summary of Results , 2015, 2015 IEEE International Conference on Data Mining Workshop (ICDMW).

[5]  David J. C. MacKay,et al.  Information Theory, Inference, and Learning Algorithms , 2004, IEEE Transactions on Information Theory.

[6]  J. Mennis,et al.  Does vegetation encourage or suppress urban crime? Evidence from Philadelphia, PA , 2012 .

[7]  Alan T. Murray,et al.  Spatial Clustering Overview and Comparison: Accuracy, Sensitivity, and Computational Expense , 2014 .

[8]  Shashi Shekhar,et al.  Significant Route Discovery: A Summary of Results , 2014, GIScience.

[9]  Shashi Shekhar,et al.  Ring-Shaped Hotspot Detection: A Summary of Results , 2014, 2014 IEEE International Conference on Data Mining.

[10]  Atsuyuki Okabe,et al.  Spatial Analysis Along Networks: Statistical and Computational Methods , 2012 .

[11]  M. Kulldorff,et al.  An elliptic spatial scan statistic , 2006, Statistics in medicine.

[12]  Andrew W. Moore,et al.  A Fast Multi-Resolution Method for Detection of Significant Spatial Disease Clusters , 2003, NIPS.

[13]  Trisalyn A. Nelson,et al.  Predictive crime mapping , 2015 .

[14]  Diansheng Guo,et al.  Local entropy map: a nonparametric approach to detecting spatially varying multivariate relationships , 2010, Int. J. Geogr. Inf. Sci..

[15]  Narushige Shiode,et al.  Network-based space-time search-window technique for hotspot detection of street-level crime incidents , 2013, Int. J. Geogr. Inf. Sci..

[16]  Marcelo Azevedo Costa,et al.  Constrained spanning tree algorithms for irregularly-shaped spatial clustering , 2012, Comput. Stat. Data Anal..

[17]  Jane Law,et al.  Bayesian Spatio-Temporal Modeling for Analysing Local Patterns of Crime Over Time at the Small-Area Level , 2014 .

[18]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[19]  Shashi Shekhar,et al.  Elliptical Hotspot Detection: A Summary of Results , 2015, BigSpatial@SIGSPATIAL.

[20]  Atsuyuki Okabe,et al.  The SANET Toolbox: New Methods for Network Spatial Analysis , 2006, Trans. GIS.

[21]  N. Levine Crime Mapping and the CrimeStat Program , 2006 .

[22]  M. Kulldorff A spatial scan statistic , 1997 .