Spatial co-location pattern mining of facility points-of-interest improved by network neighborhood and distance decay effects

ABSTRACT The aim of mining spatial co-location patterns is to find the corresponding subsets of spatial features that have strong spatial correlation in the real world. This is an important technology for the extraction and comprehension of implicit knowledge in large spatial databases. However, existing methods of co-location mining consider events as taking place in a homogeneous and isotropic context in Euclidean space, whereas the physical movement in an urban space is usually constrained by a road network. Furthermore, previous works do not take the ‘distance decay effect’ of spatial interactions into account, which may reduce the effectiveness of the result. Here we propose an improved spatial co-location pattern mining method, including the network-constrained neighborhood and addition of a distance-decay function, to find the spatial dependence between network phenomena (e.g. urban facilities). The underlying idea is to utilize a model function in the interest measure calculation to weight the contribution of a co-location to the overall interest measure instance inversely proportional to the separation distance. Our approach was evaluated through extensive experiments using facility points-of-interest data sets. The results show that the network-constrained approach is a more effective method than the traditional one in network-structured space. The proposed approach can also be applied to other human activities (e.g. traffic accidents) constrained by a street network.

[1]  Atsuyuki Okabe,et al.  The K-Function Method on a Network and Its Computational Implementation , 2010 .

[2]  Yan Huang,et al.  Discovering Spatial Co-location Patterns: A Summary of Results , 2001, SSTD.

[3]  Shino Shiode,et al.  Analysis of a Distribution of Point Events Using the Network-Based Quadrat Method , 2008 .

[4]  Jiawei Han,et al.  Discovery of Spatial Association Rules in Geographic Information Databases , 1995, SSD.

[5]  W. Tobler A Computer Movie Simulating Urban Growth in the Detroit Region , 1970 .

[6]  Yue-Hong Chou,et al.  Exploring spatial analysis in geographic information systems , 1997 .

[7]  Jiaogen Zhou,et al.  KNFCOM-T: a k-nearest features-based co-location pattern mining algorithm for large spatial data sets by using T-trees , 2008, Int. J. Bus. Intell. Data Min..

[8]  Maribel Yasmina Santos,et al.  Geo-spatial data mining in the analysis of a demographic database , 2005, Soft Comput..

[9]  Tinghua Ai,et al.  The analysis and delimitation of Central Business District using network kernel density estimation , 2015 .

[10]  Robert Bembenik,et al.  FARICS: a method of mining spatial association rules and collocations using clustering and Delaunay diagrams , 2009, Journal of Intelligent Information Systems.

[11]  Shawn W. Laffan,et al.  The effect of neighbourhood definitions on spatio-temporal models of disease outbreaks: Separation distance versus range overlap. , 2011, Preventive veterinary medicine.

[12]  Noel A Cressie,et al.  Statistics for Spatial Data. , 1992 .

[13]  Nazha Selmaoui-Folcher,et al.  Domain-driven co-location mining , 2014, GeoInformatica.

[14]  Shino Shiode,et al.  Street‐level Spatial Scan Statistic and STAC for Analysing Street Crime Concentrations , 2011, Trans. GIS.

[15]  Hui Xiong,et al.  Discovering colocation patterns from spatial data sets: a general approach , 2004, IEEE Transactions on Knowledge and Data Engineering.

[16]  Shashi Shekhar,et al.  A partial join approach for mining co-location patterns , 2004, GIS '04.

[17]  Jing Tian,et al.  Mining Co-location Patterns Between Network Spatial Phenomena , 2015 .

[18]  Edsger W. Dijkstra,et al.  A note on two problems in connexion with graphs , 1959, Numerische Mathematik.

[19]  Raúl Sierra-Alcocer,et al.  Exploratory analysis of the interrelations between co-located boolean spatial features using network graphs , 2012, Int. J. Geogr. Inf. Sci..

[20]  Jiawei Han,et al.  Geographic Data Mining and Knowledge Discovery , 2001 .

[21]  Atsuyuki Okabe,et al.  A kernel density estimation method for networks, its computational method and a GIS‐based tool , 2009, Int. J. Geogr. Inf. Sci..

[22]  Atsuyuki Okabe,et al.  Spatial Analysis Along Networks: Statistical and Computational Methods , 2012 .

[23]  Rakesh Agarwal,et al.  Fast Algorithms for Mining Association Rules , 1994, VLDB 1994.

[24]  Shashi Shekhar,et al.  A Joinless Approach for Mining Spatial Colocation Patterns , 2006, IEEE Transactions on Knowledge and Data Engineering.

[25]  Robert Haining,et al.  Global and local indicators of spatial association between points and polygons: A study of land use change , 2013, Int. J. Appl. Earth Obs. Geoinformation.

[26]  Jun Wei Liu,et al.  Mining Association Rules in Spatio‐Temporal Data: An Analysis of Urban Socioeconomic and Land Cover Change , 2005, Trans. GIS.

[27]  Tinghua Ai,et al.  Generation of constrained network Voronoi diagram using linear tessellation and expansion method , 2015, Comput. Environ. Urban Syst..

[28]  Jordi Jofre-Monseny,et al.  The mechanisms of agglomeration: Evidence from the effect of inter-industry relations on the location of new firms , 2011 .

[29]  XiongHui,et al.  Mining Co-Location Patterns with Rare Events from Spatial Data Sets , 2006 .

[30]  Barry J. Kronenfeld,et al.  The Colocation Quotient: A New Measure of Spatial Association Between Categorical Subsets of Points. 协同区位商:点集分类子集间空间关联性的新度量标准 , 2011 .

[31]  V. Latora,et al.  Street Centrality and Densities of Retail and Services in Bologna, Italy , 2009 .

[32]  Hui Xiong,et al.  Mining Co-Location Patterns with Rare Events from Spatial Data Sets , 2006, GeoInformatica.