Community Detection in Bipartite Networks: Algorithms and Case studies

There is increasing motivation to study bipartite complex networks as a separate category and, in particular, to investigate their community structure. We outline recent work in the area and focus on two high-performing algorithms for unipartite networks, the modularity-based Louvain and the flow-based Infomap. We survey modifications of modularity-based algorithms to adapt them to the bipartite case. As Infomap cannot be applied to bipartite networks for theoretical reasons, our solution is to work with the primary projected network. We apply both algorithms to four projected networks of increasing size and complexity. Our results support the conclusion that the clusters found by Infomap are meaningful and better represent ground truth in the bipartite network than those found by Louvain.

[1]  Burleigh B. Gardner,et al.  Deep South: A Social Anthropological Study of Caste and Class , 1942 .

[2]  D. Huffman A Method for the Construction of Minimum-Redundancy Codes , 1952 .

[3]  R. Carter 11 – IT and society , 1991 .

[4]  Bruno R. Preiss,et al.  Data Structures and Algorithms with Object-Oriented Design Patterns in Java , 1999 .

[5]  M. Newman,et al.  The structure of scientific collaboration networks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[6]  M E J Newman,et al.  Community structure in social and biological networks , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Isabelle Guyon,et al.  A Stability Based Method for Discovering Structure in Clustered Data , 2001, Pacific Symposium on Biocomputing.

[8]  Adilson E Motter,et al.  Heterogeneity in oscillator networks: are smaller worlds easier to synchronize? , 2003, Physical review letters.

[9]  L. Freeman Finding Social Groups: A Meta-Analysis of the Southern Women Data , 2003 .

[10]  Mark E. J. Newman,et al.  The Structure and Function of Complex Networks , 2003, SIAM Rev..

[11]  M. Newman,et al.  Finding community structure in very large networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[12]  R. Guimerà,et al.  Modularity from fluctuations in random graphs and complex networks. , 2004, Physical review. E, Statistical, nonlinear, and soft matter physics.

[13]  Jean-Loup Guillaume,et al.  Bipartite structure of all complex networks , 2004, Inf. Process. Lett..

[14]  Claudio Castellano,et al.  Defining and identifying communities in networks. , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[15]  M E J Newman,et al.  Finding and evaluating community structure in networks. , 2003, Physical review. E, Statistical, nonlinear, and soft matter physics.

[16]  Mark A. Pitt,et al.  Advances in Minimum Description Length: Theory and Applications , 2005 .

[17]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[18]  Leon Danon,et al.  Comparing community structure identification , 2005, cond-mat/0505245.

[19]  M. Newman,et al.  Finding community structure in networks using the eigenvectors of matrices. , 2006, Physical review. E, Statistical, nonlinear, and soft matter physics.

[20]  Matthieu Latapy,et al.  Computing Communities in Large Networks Using Random Walks , 2004, J. Graph Algorithms Appl..

[21]  M. Barber Modularity and community detection in bipartite networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[22]  S. Fortunato,et al.  Resolution limit in community detection , 2006, Proceedings of the National Academy of Sciences.

[23]  Réka Albert,et al.  Near linear time algorithm to detect community structures in large-scale networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[24]  Roger Guimerà,et al.  Module identification in bipartite and directed networks. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[25]  Jean-Loup Guillaume,et al.  Fast unfolding of communities in large networks , 2008, 0803.0476.

[26]  F. Radicchi,et al.  Benchmark graphs for testing community detection algorithms. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[27]  Peng Zhang,et al.  Comparative definition of community and corresponding identifying algorithm. , 2008, Physical review. E, Statistical, nonlinear, and soft matter physics.

[28]  Martin Rosvall,et al.  Maps of random walks on complex networks reveal community structure , 2007, Proceedings of the National Academy of Sciences.

[29]  M. Emiliani,et al.  Terrorism in Indonesia: Noordin's Networks , 2009 .

[30]  Linyuan Lü,et al.  Predicting missing links via local information , 2009, 0901.0553.

[31]  M. Barber,et al.  Detecting network communities by propagating labels under constraints. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[32]  Santo Fortunato,et al.  Community detection in graphs , 2009, ArXiv.

[33]  Andrea Lancichinetti,et al.  Community detection algorithms: a comparative analysis: invited presentation, extended abstract , 2009, VALUETOOLS.

[34]  R. Lambiotte,et al.  Line graphs, link partitions, and overlapping communities. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[35]  Rik Sarkar,et al.  Community Detection , 2014, Encyclopedia of Machine Learning and Data Mining.

[36]  Tsuyoshi Murata,et al.  An Efficient Algorithm for Optimizing Bipartite Modularity in Bipartite Networks , 2010, J. Adv. Comput. Intell. Intell. Informatics.

[37]  Sune Lehmann,et al.  Link communities reveal multiscale complexity in networks , 2009, Nature.

[38]  Hocine Cherifi,et al.  On Accuracy of Community Structure Discovery Algorithms , 2011, ArXiv.

[39]  Niloy Ganguly,et al.  Understanding how both the partitions of a bipartite network affect its one-mode projection , 2011, 1105.3902.

[40]  Nancy Roberts,et al.  Strategies for Combating Dark Networks , 2011, J. Soc. Struct..

[41]  Sean F. Everton Disrupting Dark Networks , 2012 .

[42]  V. Climenhaga Markov chains and mixing times , 2013 .

[43]  Tiago P Peixoto,et al.  Parsimonious module inference in large networks. , 2012, Physical review letters.

[44]  Kathy J. Horadam,et al.  Analysis of two crime-related networks derived from bipartite social networks , 2014, 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2014).

[45]  Kathy J. Horadam,et al.  Community Detection in Bipartite Networks Using Random Walks , 2014, CompleNet.

[46]  Murray Aitkin,et al.  Statistical modelling of the group structure of social networks , 2014, Soc. Networks.

[47]  Andrea Lancichinetti,et al.  Erratum: Community detection algorithms: A comparative analysis [Phys. Rev. E 80, 056117 (2009)] , 2014 .

[48]  Michel Crampes,et al.  A Unified Community Detection, Visualization and Analysis Method , 2013, Adv. Complex Syst..

[49]  Murray Aitkin,et al.  Statistical modelling of a terrorist network , 2017 .