Spatial Correlations in Attribute Communities

Community detection is an important tool for exploring and classifying the properties of large complex networks and should be of great help for spatial networks. Indeed, in addition to their location, nodes in spatial networks can have attributes such as the language for individuals, or any other socio-economical feature that we would like to identify in communities. We discuss in this paper a crucial aspect which was not considered in previous studies which is the possible existence of correlations between space and attributes. Introducing a simple toy model in which both space and node attributes are considered, we discuss the effect of space-attribute correlations on the results of various community detection methods proposed for spatial networks in this paper and in previous studies. When space is irrelevant, our model is equivalent to the stochastic block model which has been shown to display a detectability-non detectability transition. In the regime where space dominates the link formation process, most methods can fail to recover the communities, an effect which is particularly marked when space-attributes correlations are strong. In this latter case, community detection methods which remove the spatial component of the network can miss a large part of the community structure and can lead to incorrect results.

[1]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[2]  Alain Guénoche,et al.  Comparison of Distance Indices Between Partitions , 2006, Data Science and Classification.

[3]  Michael T. Gastner,et al.  The complex network of global cargo ship movements , 2010, Journal of The Royal Society Interface.

[4]  Alessandro Chessa,et al.  Commuter networks and community detection: A method for planning sub regional areas , 2011, ArXiv.

[5]  Cristopher Moore,et al.  Phase transition in the detection of modules in sparse networks , 2011, Physical review letters.

[6]  N. F. Stewart,et al.  The Gravity Model in Transportation Analysis - Theory and Extensions , 1990 .

[7]  Steve Gregory,et al.  Ordered community structure in networks , 2011, 1104.0923.

[8]  Marc Barthelemy,et al.  Spatial Networks , 2010, Encyclopedia of Social Network Analysis and Mining.

[9]  P. Ronhovde,et al.  Phase transitions in random Potts systems and the community detection problem: spin-glass type and dynamic perspectives , 2010, 1008.2699.

[10]  Fabian J. Theis,et al.  Modularity maximization and tree clustering: Novel ways to determine effective geographic borders , 2011, ArXiv.

[11]  Andrea Lancichinetti,et al.  Community detection algorithms: a comparative analysis: invited presentation, extended abstract , 2009, VALUETOOLS.

[12]  R. Guimerà,et al.  Modularity from fluctuations in random graphs and complex networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[13]  Zohar Nussinov,et al.  A Replica Inference Approach to Unsupervised Multi-Scale Image Segmentation , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[14]  Mason A. Porter,et al.  Communities in Networks , 2009, ArXiv.

[15]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[16]  S. Fortunato,et al.  Resolution limit in community detection , 2006, Proceedings of the National Academy of Sciences.

[17]  Zbigniew Smoreda,et al.  Interplay between Telecommunications and Face-to-Face Interactions: A Study Using Mobile Phone Data , 2011, PloS one.

[18]  Renaud Lambiotte,et al.  Uncovering space-independent communities in spatial networks , 2010, Proceedings of the National Academy of Sciences.

[19]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[20]  Ricardo J. G. B. Campello,et al.  A fuzzy extension of the Rand index and other related indexes for clustering and classification assessment , 2007, Pattern Recognit. Lett..

[21]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[22]  Leon Danon,et al.  Comparing community structure identification , 2005, cond-mat/0505245.

[23]  R. Guimerà,et al.  The worldwide air transportation network: Anomalous centrality, community structure, and cities' global roles , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Bill Mitchell,et al.  Networks and geography: Modelling community network structures as the outcome of both spatial and network processes , 2012, Soc. Networks.

[25]  Alessandro Vespignani,et al.  The effects of spatial constraints on the evolution of weighted complex networks , 2005, physics/0504029.

[26]  Michalis Vazirgiannis,et al.  c ○ 2001 Kluwer Academic Publishers. Manufactured in The Netherlands. On Clustering Validation Techniques , 2022 .

[27]  M. Newman,et al.  Robustness of community structure in networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[28]  Vito Latora,et al.  Complex Networks: New Trends for the Analysis of Brain Connectivity , 2010, Int. J. Bifurc. Chaos.

[29]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .