Finding communities in networks in the strong and almost-strong sense.

Finding communities, or clusters or modules, in networks can be done by optimizing an objective function defined globally and/or by specifying conditions which must be satisfied by all communities. Radicchi et al. [Proc. Natl. Acad. Sci. USA 101, 2658 (2004)] define a susbset of vertices of a network to be a community in the strong sense if each vertex of that subset has a larger inner degree than its outer degree. A partition in the strong sense has only strong communities. In this paper we first define an enumerative algorithm to list all partitions in the strong sense of a network of moderate size. The results of this algorithm are given for the Zachary karate club data set, which is solved by hand, as well as for several well-known real-world problems of the literature. Moreover, this algorithm is slightly modified in order to apply it to larger networks, keeping only partitions with the largest number of communities. It is shown that some of the partitions obtained are informative, although they often have only a few communities, while they fail to give any information in other cases having only one community. It appears that degree 2 vertices play a big role in forcing large inhomogeneous communities. Therefore, a weakening of the strong condition is proposed and explored: we define a partition in the almost-strong sense by substituting a nonstrict inequality to a strict one in the definition of strong community for all vertices of degree 2. Results, for the same set of problems as before, then give partitions with a larger number of communities and are more informative.

[1]  F. Luccio,et al.  On the Decomposition of Networks in Minimally Interconnected Subnetworks , 1969 .

[2]  Claude Berge,et al.  Graphs and Hypergraphs , 2021, Clustering.

[3]  W. Zachary,et al.  An Information Flow Model for Conflict and Fission in Small Groups , 1977, Journal of Anthropological Research.

[4]  John Scott What is social network analysis , 2010 .

[5]  Laurence A. Wolsey,et al.  Integer and Combinatorial Optimization , 1988, Wiley interscience series in discrete mathematics and optimization.

[6]  Laurence A. Wolsey,et al.  Integer and Combinatorial Optimization , 1988 .

[7]  Andrew B. Kahng,et al.  New spectral methods for ratio cut partitioning and clustering , 1991, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[8]  Donald E. Knuth,et al.  The Stanford GraphBase - a platform for combinatorial computing , 1993 .

[9]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994 .

[10]  Martine D. F. Schlag,et al.  Spectral K-way ratio-cut partitioning and clustering , 1994, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[11]  Judd Harrison Michael,et al.  Labor dispute reconciliation in a forest products manufacturing facility , 1997 .

[12]  Jitendra Malik,et al.  Normalized Cuts and Image Segmentation , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Jonathan L. Gross,et al.  Handbook of graph theory , 2007, Discrete mathematics and its applications.

[15]  D. Lusseau,et al.  The bottlenose dolphin community of Doubtful Sound features a large proportion of long-lasting associations , 2003, Behavioral Ecology and Sociobiology.

[16]  Khurram Hassan-Shafique,et al.  Partitioning A Graph In Alliances And Its Application To Data Clustering , 2004 .

[17]  William I. Gasarch Review of "Handbook of Graph Theory edited by Gross and Yellen." CRC, 2004. , 2004, SIGA.

[18]  Claudio Castellano,et al.  Defining and identifying communities in networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Peter Koch,et al.  Forest Products Journal 23 , 2004 .

[20]  Martin Rosvall,et al.  An information-theoretic framework for resolving community structure in complex networks , 2007, Proceedings of the National Academy of Sciences.

[21]  Luonan Chen,et al.  Quantitative function for community detection. , 2008 .

[22]  Peng Zhang,et al.  Comparative definition of community and corresponding identifying algorithm. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[23]  Xiang-Sun Zhang,et al.  Modularity optimization in community detection of complex networks , 2009 .

[24]  C O Dorso,et al.  Alternative approach to community detection in networks. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[25]  A. D. Medus,et al.  Community Detection in Networks , 2010, Int. J. Bifurc. Chaos.

[26]  Pierre Hansen,et al.  Edge ratio and community structure in networks. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[27]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[28]  P. Hansen,et al.  Column generation algorithms for exact modularity maximization in networks. , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[29]  Tom A. B. Snijders,et al.  Social Network Analysis , 2011, International Encyclopedia of Statistical Science.

[30]  Mark E. J. Newman,et al.  An efficient and principled method for detecting communities in networks , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[31]  V A Traag,et al.  Narrow scope for resolution-limit-free community detection. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[32]  Marianna Bolla,et al.  Penalized versions of the Newman-Girvan modularity and their relation to normalized cuts and k-means clustering. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.