Adding cohesion constraints to models for modularity maximization in networks

Finding communities in complex networks is a topic of much current research and has applications in many domains. On the one hand, criteria for doing so have been proposed, the most studied of which is modularity. On the other hand, properties to be satisfied by each community have been suggested. It has recently been observed that one of the best known such properties, i.e. the weak condition, proposed by Radicchi et al. (2004, Proc. Natl. Acad. Sci. USA, 101, 2658) was not satisfied by one or more communities in a partition which maximizes (approximately) some of the best known criteria. It was therefore proposed by Wang et al. (2009, Lect. Notes in Oper. Res., 11, 142) to merge both approaches by maximizing a criterion subject to the weak condition. We consider five community-defining conditions, which we call cohesion conditions (strong, semi-strong, almost-strong, weak and extra-weak conditions). We add cohesion conditions, one at a time, as constraints to a modularity maximization problem, thus obtaining new mathematical optimization models, which we solve exactly. We study the impact of cohesion conditions on modularity maximization. Strong, semi-strong and almost-strong cohesion conditions appear to be generally too restrictive and the extra-weak condition too lax. The weak condition is verified by some but not all modularity maximizing partitions of the considered real-world networks. Imposition of this condition on those partitions for which some communities do not verify it reduces modularity moderately but sometimes changes the optimal number of communities and their composition. We also show, on a known example, that the strong, semi-strong and almost-strong conditions allow us to overcome the resolution limit of modularity. The behaviour of modularity maximization subject to cohesion constraints appears to be coherent with the detectability of the modular structure of the considered networks.

[1]  S. Fortunato,et al.  Resolution limit in community detection , 2006, Proceedings of the National Academy of Sciences.

[2]  Luonan Chen,et al.  Quantitative function for community detection. , 2008 .

[3]  Roger Guimerà,et al.  Extracting the hierarchical organization of complex systems , 2007, Proceedings of the National Academy of Sciences.

[4]  Donald E. Knuth,et al.  The Stanford GraphBase - a platform for combinatorial computing , 1993 .

[5]  Lazaros G. Papageorgiou,et al.  Finding community structures in complex networks using mixed integer optimisation , 2007 .

[6]  Filippo Radicchi,et al.  A paradox in community detection , 2013, ArXiv.

[7]  D. Lusseau,et al.  The bottlenose dolphin community of Doubtful Sound features a large proportion of long-lasting associations , 2003, Behavioral Ecology and Sociobiology.

[8]  Judd Harrison Michael,et al.  Modeling the communication network in a sawmill , 1997 .

[9]  R. Guimerà,et al.  Functional cartography of complex metabolic networks , 2005, Nature.

[10]  Alberto Costa,et al.  Some remarks on modularity density , 2014, 1409.4063.

[11]  Pierre Hansen,et al.  Loops and multiple edges in modularity maximization of networks. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[12]  Pierre Hansen,et al.  Reformulation of a model for hierarchical divisive graph modularity maximization , 2012, Annals of Operations Research.

[13]  A. Medus,et al.  Detection of community structures in networks via global optimization , 2005 .

[14]  Ulrik Brandes,et al.  On Modularity Clustering , 2008, IEEE Transactions on Knowledge and Data Engineering.

[15]  Elchanan Mossel,et al.  Spectral redemption in clustering sparse networks , 2013, Proceedings of the National Academy of Sciences.

[16]  Andreas Noack,et al.  Multi-level Algorithms for Modularity Clustering , 2008, SEA.

[17]  Frank Plastria,et al.  Formulating logical implications in combinatorial optimisation , 2002, Eur. J. Oper. Res..

[18]  Mark Newman,et al.  Networks: An Introduction , 2010 .

[19]  J. Reichardt,et al.  Statistical mechanics of community detection. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[20]  Cristopher Moore,et al.  Asymptotic analysis of the stochastic block model for modular networks and its algorithmic applications , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[21]  S. Snyder,et al.  Proceedings of the National Academy of Sciences , 1999 .

[22]  Benjamin H. Good,et al.  Performance of modularity maximization in practical contexts. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[23]  A. Arenas,et al.  Community detection in complex networks using extremal optimization. , 2005, Physical review. E, Statistical, nonlinear, and soft matter physics.

[24]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[25]  Xiang-Sun Zhang,et al.  A Constrained Optimization Method for Community Detection , 2009 .

[26]  Yoshiko Wakabayashi,et al.  A cutting plane algorithm for a clustering problem , 1989, Math. Program..

[27]  W. Zachary,et al.  An Information Flow Model for Conflict and Fission in Small Groups , 1977, Journal of Anthropological Research.

[28]  Pierre Hansen,et al.  Edge ratio and community structure in networks. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[29]  P. Hansen,et al.  Column generation algorithms for exact modularity maximization in networks. , 2010, Physical review. E, Statistical, nonlinear, and soft matter physics.

[30]  Judd Harrison Michael,et al.  Labor dispute reconciliation in a forest products manufacturing facility , 1997 .

[31]  Xiang-Sun Zhang,et al.  Modularity optimization in community detection of complex networks , 2009 .

[32]  Alberto Costa,et al.  MILP formulations for the modularity density maximization problem , 2015, Eur. J. Oper. Res..

[33]  P. Hansen,et al.  Finding communities in networks in the strong and almost-strong sense. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[34]  David Kempe,et al.  Modularity-maximizing graph communities via mathematical programming , 2007, 0710.2533.

[35]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[36]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[37]  Peng Zhang,et al.  Comparative definition of community and corresponding identifying algorithm. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[38]  Xiang-Sun Zhang,et al.  A combinatorial model and algorithm for globally searching community structure in complex networks , 2010, Journal of Combinatorial Optimization.

[39]  Everett M. Rogers,et al.  Communication Networks: Toward a New Paradigm for Research , 1980 .

[40]  Cristopher Moore,et al.  Phase transition in the detection of modules in sparse networks , 2011, Physical review letters.

[41]  P. Hansen,et al.  Locally optimal heuristic for modularity maximization of networks. , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[42]  Alex Arenas,et al.  Analysis of the structure of complex networks at different resolution levels , 2007, physics/0703218.

[43]  M E J Newman,et al.  Modularity and community structure in networks. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[44]  Sophia Tsoka,et al.  Robustness of the p53 network and biological hackers , 2005, FEBS letters.

[45]  Raj Rao Nadakuditi,et al.  Graph spectra and the detectability of community structure in networks , 2012, Physical review letters.

[46]  Amedeo Caflisch,et al.  Efficient modularity optimization by multistep greedy algorithm and vertex mover refinement. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[47]  Filippo Radicchi,et al.  Detectability of communities in heterogeneous networks. , 2013, Physical review. E, Statistical, nonlinear, and soft matter physics.

[48]  Claudio Castellano,et al.  Defining and identifying communities in networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[49]  Peter Koch,et al.  Forest Products Journal 23 , 2004 .

[50]  J. Doye,et al.  Identifying communities within energy landscapes. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[51]  Lars Kai Hansen,et al.  Deterministic modularity optimization , 2007 .