Maximal Labelled-Clique and Click-Biclique Problems for Networked Community Detection

Discovering "closely related entities" in any network, a.k.a. communities, is a key goal for various network analytic applications. In particular, cliques and bicliques are two community structures that have influenced several tools and techniques in Big Data and social networking. A clique is a complete subgraph of an undirected graph and similarly, a biclique is a complete bipartite subgraph. A maximal clique (or biclique) is a clique that is not subset of any other clique (or biclique). Algorithms to list all maximal cliques in general graphs or bicliques in bipartite graphs have been previously studied. In this paper, we enhance these solutions explaining how a novel structure, that we call clique-biclique can be used to unravel richer communities with respect to different problems in a wide variety of networks. We then give two algorithms for efficiently listing maximal labelled-cliques and maximal clique-bicliques. The first algorithm is an extension of the maximal clique enumeration and the second cleverly combines enumeration of maximal cliques and maximal bicliques. We conduct an experimental analysis over different synthetic and real datasets to evaluate performance of the algorithms, and we found that even richer communities compared to cliques and bicliques can be efficiently found in most networks of interest.

[1]  Qiang Wang,et al.  Topic oriented community detection through social objects and link analysis in social networks , 2012, Knowl. Based Syst..

[2]  Akira Tanaka,et al.  The worst-case time complexity for generating all maximal cliques and computational experiments , 2006, Theor. Comput. Sci..

[3]  Yun Zhang,et al.  On finding bicliques in bipartite graphs: a novel algorithm and its application to the integration of diverse biological data types , 2013, BMC Bioinformatics.

[4]  Lior Rokach,et al.  Intruder or Welcome Friend: Inferring Group Membership in Online Social Networks , 2013, SBP.

[5]  Wen-Kwang Tsao,et al.  Mining Web Browsing Log by Using Relaxed Biclique Enumeration Algorithm in MapReduce , 2012, 2012 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology.

[6]  R. Carter 11 – IT and society , 1991 .

[7]  Shi-Hua Zhang,et al.  Identification of functional modules in a PPI network by clique percolation clustering , 2006, Comput. Biol. Chem..

[8]  Theodoros Lappas,et al.  Profit-maximizing cluster hires , 2014, KDD.

[9]  Paolo Dell'Olmo,et al.  A mathematical programming approach for the maximum labeled clique problem , 2014 .

[10]  S. Lehmann,et al.  Biclique communities. , 2007, Physical review. E, Statistical, nonlinear, and soft matter physics.

[11]  T. Vicsek,et al.  Uncovering the overlapping community structure of complex networks in nature and society , 2005, Nature.

[12]  C. Bron,et al.  Algorithm 457: finding all cliques of an undirected graph , 1973 .