DASSCAN: A Density and Adjacency Expansion-Based Spatial Structural Community Detection Algorithm for Networks

Existing spatial community detection algorithms are usually modularity based. Motivated by different applications, these algorithms build appropriate spatial null models to describe spatial effects on the connection of nodes. Then, by choosing certain modularity maximizing strategies, they try to find interesting community structures hidden behind the null models. In this paper, a novel structural similarity-based spatial network community is defined, which is based on the shared neighbors of nodes. In addition, there are two other special node roles defined: the spatial hub and outlier. Then, a density and adjacency expansion-based spatial structural community detection algorithm for networks (DASSCAN) is proposed for mining these communities, hubs and outliers. DASSCAN uses structural similarity to measure the relationship between nodes, and then, structurally similar and spatially adjacent nodes are merged into communities using a density-based clustering method and spatial adjacency expansion strategy. Comparative experiments on two kinds of Chinese train line networks clarified the accuracy and efficiency of DASSCAN in finding the spatial structural communities, spatial hubs and outliers. The communities found can be used to uncover more interesting spatial structural patterns, and the hubs and outliers are more accurate and have more valuable meanings.

[1]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[2]  M E J Newman,et al.  Fast algorithm for detecting community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[3]  Diansheng Guo,et al.  Flow Mapping and Multivariate Visualization of Large Spatial Interaction Data , 2009, IEEE Transactions on Visualization and Computer Graphics.

[4]  R. Guimerà,et al.  The worldwide air transportation network: Anomalous centrality, community structure, and cities' global roles , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[5]  Benjamin H. Good,et al.  Performance of modularity maximization in practical contexts. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[6]  Nikolaos Geroliminis,et al.  On the spatial partitioning of urban transportation networks , 2012 .

[7]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[8]  Yiannis Kompatsiaris,et al.  Community detection in Social Media , 2012, Data Mining and Knowledge Discovery.

[9]  Renaud Lambiotte,et al.  Uncovering space-independent communities in spatial networks , 2010, Proceedings of the National Academy of Sciences.

[10]  Paulo Shakarian,et al.  Mining for Spatially-Near Communities in Geo-Located Social Networks , 2013, AAAI Fall Symposia.

[11]  Yu Chen,et al.  Finding community structure in spatially constrained complex networks , 2015, Int. J. Geogr. Inf. Sci..

[12]  Tsuyoshi Murata,et al.  Extending modularity by incorporating distance functions in the null model , 2012, ArXiv.

[13]  Paulo Shakarian,et al.  Mining for geographically disperse communities in social networks by leveraging distance modularity , 2013, KDD.

[14]  Yong Gao,et al.  Uncovering Patterns of Inter-Urban Trip and Spatial Interaction from Social Media Check-In Data , 2013, PloS one.

[15]  P. Borgnat,et al.  Enhancing Space-Aware Community Detection Using Degree Constrained Spatial Null Model , 2017 .

[16]  Xiaowei Xu,et al.  SCAN: a structural clustering algorithm for networks , 2007, KDD '07.

[17]  K. Seaton,et al.  Stations, trains and small-world networks , 2003, cond-mat/0311254.

[18]  Carlo Ratti,et al.  A General Optimization Technique for High Quality Community Detection in Complex Networks , 2013, Physical review. E, Statistical, nonlinear, and soft matter physics.

[19]  Diansheng Guo,et al.  Regionalization with dynamically constrained agglomerative clustering and partitioning (REDCAP) , 2008, Int. J. Geogr. Inf. Sci..

[20]  Gerardo Chowell,et al.  Null Models for Community Detection in Spatially-Embedded, Temporal Networks , 2014, bioRxiv.

[21]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[22]  Steve H. L. Liang,et al.  Geo-located community detection in Twitter with enhanced fast-greedy optimization of modularity: the case study of typhoon Haiyan , 2015, Int. J. Geogr. Inf. Sci..

[23]  Manfred M. Fischer,et al.  The Community Structure of Research and Development Cooperation in Europe: Evidence from a Social Network Perspective , 2011 .

[24]  Marc Barthelemy,et al.  Spatial Correlations in Attribute Communities , 2011, PloS one.

[25]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[26]  Li Gong,et al.  Revealing travel patterns and city structure with taxi trip data , 2016 .

[27]  Hans-Peter Kriegel,et al.  DBSCAN Revisited, Revisited , 2017, ACM Trans. Database Syst..

[28]  John Riedl,et al.  Item-based collaborative filtering recommendation algorithms , 2001, WWW '01.

[29]  P. Tan,et al.  Node roles and community structure in networks , 2007, WebKDD/SNA-KDD '07.

[30]  Hai Le Vu,et al.  Spatial Partitioning of Large Urban Road Networks , 2014, EDBT.

[31]  Hai Le Vu,et al.  Partitioning road networks using density peak graphs: Efficiency vs. accuracy , 2017, Inf. Syst..

[32]  Hans-Peter Kriegel,et al.  Density-Based Clustering in Spatial Databases: The Algorithm GDBSCAN and Its Applications , 1998, Data Mining and Knowledge Discovery.

[33]  Song Gao,et al.  Discovering Spatial Interaction Communities from Mobile Phone Data , 2013 .

[34]  Emanuele Strano,et al.  The Structure of Spatial Networks and Communities in Bicycle Sharing Systems , 2013, PloS one.