Balanced Clique Computation in Signed Networks: Concepts and Algorithms

Clique is one of the most fundamental models for cohesive subgraph mining in network analysis. Existing clique model mainly focuses on unsigned networks. However, in real world, many applications are modeled as signed networks with positive and negative edges. As the signed networks hold their own properties different from the unsigned networks, the existing clique model is inapplicable for the signed networks. Motivated by this, we propose the balanced clique model that considers the most fundamental and dominant theory, structural balance theory, for signed networks. Following the balanced clique model, we study the maximal balanced clique enumeration problem (MBCE) which computes all the maximal balanced cliques in a given signed network. Moreover, in some applications, users prefer a unique and representative balanced clique with maximum size rather than all balanced cliques. Thus, we also study the maximum balanced clique search problem (MBCS) which computes the balanced clique with maximum size. We show that MBCE problem and MBCS problem are both NP-Hard. For the MBCE problem, a straightforward solution is to treat the signed network as two unsigned networks and leverage the off-the-shelf techniques for unsigned networks. However, such a solution is inefficient for large signed networks. To address this problem, in this paper, we first propose a new maximal balanced clique enumeration algorithm by exploiting the unique properties of signed networks. Based on the new proposed algorithm, we devise two optimization strategies to further improve the efficiency of the enumeration. For the MBCS problem, we first propose a baseline solution. To overcome the huge search space problem of the baseline solution, we propose a new search framework based on search space partition. To further improve the efficiency of the new framework, we propose multiple optimization strategies regarding to redundant search branches and invalid candidates. We conduct extensive experiments on large real datasets. The experimental results demonstrate the efficiency, effectiveness and scalability of our proposed algorithms for MBCE problem and MBCS problem.

[1]  Minghao Yin,et al.  NuQClq: An Effective Local Search Algorithm for Maximum Quasi-Clique Problem , 2021, AAAI.

[2]  Zhengping Qian,et al.  Maximum biclique search at billion scale , 2020, Proc. VLDB Endow..

[3]  Tansel Dökeroglu,et al.  A novel parallel local search algorithm for the maximum vertex weight clique problem in large graphs , 2020, Soft Comput..

[4]  Lijun Chang,et al.  Efficient maximum clique computation and enumeration over large sparse graphs , 2020, The VLDB Journal.

[5]  Lu Qin,et al.  Efficient shortest path index maintenance on dynamic road networks with theoretical guarantees , 2020, Proc. VLDB Endow..

[6]  Ganesh Ramakrishnan,et al.  Cross-Lingual Training for Automatic Question Generation , 2019, ACL.

[7]  Xuemin Lin,et al.  Efficient (α, β)-core Computation: an Index-based Approach , 2019, WWW.

[8]  Xuemin Lin,et al.  Towards Efficient k-TriPeak Decomposition on Large Graphs , 2019, DASFAA.

[9]  Jeffrey Xu Yu,et al.  I/O Efficient Core Graph Decomposition: Application to Degeneracy Ordering , 2019, IEEE Transactions on Knowledge and Data Engineering.

[10]  Lijun Chang,et al.  Distributed computing connected components with linear communication cost , 2018, Distributed and Parallel Databases.

[11]  Sameep Mehta,et al.  Leveraging semantic resources in diversified query expansion , 2018, World Wide Web.

[12]  Fan Zhang,et al.  Towards Efficient Path Skyline Computation in Bicriteria Networks , 2018, DASFAA.

[13]  Fan Zhang,et al.  External Topological Sorting in Large Graphs , 2018, DASFAA.

[14]  Lijun Chang,et al.  Index-Based Densest Clique Percolation Community Search in Networks , 2018, IEEE Transactions on Knowledge and Data Engineering.

[15]  Jeffrey Xu Yu,et al.  Efficient Signed Clique Search in Signed Networks , 2018, 2018 IEEE 34th International Conference on Data Engineering (ICDE).

[16]  Ahmed Hadj Kacem,et al.  Algorithms for Finding Maximal and Maximum Cliques: A Survey , 2017, ISDA.

[17]  Lijun Chang,et al.  Effective and Efficient Dynamic Graph Coloring , 2017, Proc. VLDB Endow..

[18]  Charu C. Aggarwal,et al.  Signed Network Modeling Based on Structural Balance Theory , 2017, CIKM.

[19]  Jeffrey Xu Yu,et al.  Finding the maximum clique in massive graphs , 2017, Proc. VLDB Endow..

[20]  Ümit V. Çatalyürek,et al.  Nucleus Decompositions for Identifying Hierarchy of Dense Subgraphs , 2017, ACM Trans. Web.

[21]  Christos Faloutsos,et al.  Edge Weight Prediction in Weighted Signed Networks , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[22]  J. Pei,et al.  Finding Gangs in War from Signed Networks , 2016, KDD.

[23]  Lijun Chang,et al.  I/O efficient ECC graph decomposition via graph reduction , 2016, The VLDB Journal.

[24]  Xiao-Fei Zhang,et al.  Detecting Protein Complexes from Signed Protein-Protein Interaction Networks , 2015, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[25]  Daniel Dajun Zeng,et al.  Social balance in signed networks , 2014, Information Systems Frontiers.

[26]  Lijun Chang,et al.  Diversified top-k clique search , 2015, The VLDB Journal.

[27]  Ümit V. Çatalyürek,et al.  Finding the Hierarchy of Dense Subgraphs using Nucleus Decompositions , 2014, WWW.

[28]  Jeffrey Xu Yu,et al.  Querying k-truss community in large and dynamic graphs , 2014, SIGMOD Conference.

[29]  E. Chesler,et al.  On finding bicliques in bipartite graphs: a novel algorithm and its application to the integration of diverse biological data types , 2014, BMC Bioinformatics.

[30]  Stephen S. Yau,et al.  Detecting k-Balanced Trusted Cliques in Signed Social Networks , 2014, IEEE Internet Computing.

[31]  Sergiy Butenko,et al.  On clique relaxation models in network analysis , 2013, Eur. J. Oper. Res..

[32]  Enhong Chen,et al.  Towards Annotating Media Contents through Social Diffusion Analysis , 2012, 2012 IEEE 12th International Conference on Data Mining.

[33]  James Cheng,et al.  Fast algorithms for maximal clique enumeration with limited memory , 2012, KDD.

[34]  Weifa Liang,et al.  Finding maximal k-edge-connected subgraphs from a large graph , 2012, EDBT '12.

[35]  David Eppstein,et al.  Listing All Maximal Cliques in Large Sparse Real-World Graphs , 2011, JEAL.

[36]  Robert D. Kleinberg,et al.  Continuous-time model of structural balance , 2010, Proceedings of the National Academy of Sciences.

[37]  David Eppstein,et al.  Listing All Maximal Cliques in Sparse Graphs in Near-optimal Time , 2010, Exact Complexity of NP-hard Problems.

[38]  Jure Leskovec,et al.  Signed networks in social media , 2010, CHI.

[39]  Steven H Strogatz,et al.  Energy landscape of social balance. , 2009, Physical review letters.

[40]  Christian Bauckhage,et al.  The slashdot zoo: mining a social network with negative edges , 2009, WWW.

[41]  Nagiza F. Samatova,et al.  A scalable, parallel algorithm for maximal clique enumeration , 2009, J. Parallel Distributed Comput..

[42]  Peter Abell,et al.  Structural Balance: A Dynamic Perspective , 2009 .

[43]  Yun Chi,et al.  Identifying opinion leaders in the blogosphere , 2007, CIKM '07.

[44]  Akira Tanaka,et al.  The worst-case time complexity for generating all maximal cliques and computational experiments , 2006, Theor. Comput. Sci..

[45]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[46]  R. Axelrod,et al.  A Landscape Theory of Aggregation , 1993, British Journal of Political Science.

[47]  Michael E. Saks,et al.  An on-line graph coloring algorithm with sublinear performance ratio , 1989, Discret. Math..

[48]  Stephen B. Seidman,et al.  Network structure and minimum degree , 1983 .

[49]  E. A. Akkoyunlu,et al.  The Enumeration of Maximal Cliques of Large Graphs , 1973, SIAM J. Comput..

[50]  F. Harary,et al.  STRUCTURAL BALANCE: A GENERALIZATION OF HEIDER'S THEORY1 , 1977 .

[51]  Yue Wang,et al.  Finding large diverse communities on networks , 2020, Proc. VLDB Endow..

[52]  Jun Wu,et al.  An Efficient Local Search for the Maximum Edge Weighted Clique Problem , 2018, IEEE Access.

[53]  Chris Arney,et al.  Networks, Crowds, and Markets: Reasoning about a Highly Connected World (Easley, D. and Kleinberg, J.; 2010) [Book Review] , 2013, IEEE Technology and Society Magazine.

[54]  Elena Baranov,et al.  IMPORTANCE OF STRATEGIC ALLIANCES IN COMPANY’S ACTIVITY , 2013 .

[55]  Coenraad Bron,et al.  Finding All Cliques of an Undirected Graph (Algorithm 457) , 1973, Commun. ACM.

[56]  F. Harary On the notion of balance of a signed graph. , 1953 .

[57]  F. Heider Attitudes and cognitive organization. , 1946, The Journal of psychology.