Towards Efficient k-TriPeak Decomposition on Large Graphs

Analyzing the structure of real-world networks has attracted much attention over years and cohesive subgraph models are commonly used to characterize the structure of a network. Recently, a model named k-Peak is proposed to address the issue failing to detect sparser regions if the network contains distinct regions of different densities in the cohesive subgraph models. However, k-Peak only considers the edge connection (i.e., degree) in the network and the loose structure restricts the effectiveness of the k-Peak. On the other hand, triangles are fundamental building blocks of a network and are widely used in the literature. Motivated by this, in this paper, we propose the k-TriPeak model based on the triangles and study the problem of k-TriPeak decomposition that computes the k-TriPeak for all possible k values to understand the structure of a network. Through investigating the drawbacks of the baseline algorithm following the idea of k-Peak decomposition, we devise a new efficient algorithm to perform the k-TriPeak decomposition. Our new algorithm adopts a top-down decomposition paradigm and integrates two novel upper bounds with which large unnecessary computation can be pruned. We conduct extensive experiments on several large real-world datasets and the experimental results demonstrate the efficiency and effectiveness of our proposed algorithm.

[1]  Matthieu Latapy,et al.  Main-memory triangle computations for very large (sparse (power-law)) graphs , 2008, Theor. Comput. Sci..

[2]  Srinivasan Parthasarathy,et al.  Extracting Analyzing and Visualizing Triangle K-Core Motifs within Networks , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[3]  Sucheta Soundarajan,et al.  The k-peak Decomposition: Mapping the Global Structure of Graphs , 2017, WWW.

[4]  Lijun Chang,et al.  Index-Based Densest Clique Percolation Community Search in Networks , 2018, IEEE Transactions on Knowledge and Data Engineering.

[5]  Lijun Chang,et al.  Diversified top-k clique search , 2015, The VLDB Journal.

[6]  Jia Wang,et al.  Truss Decomposition in Massive Networks , 2012, Proc. VLDB Endow..

[7]  Jeffrey Xu Yu,et al.  Querying k-truss community in large and dynamic graphs , 2014, SIGMOD Conference.

[8]  R. Luce,et al.  A method of matrix analysis of group structure , 1949, Psychometrika.

[9]  Stephen B. Seidman,et al.  A graph‐theoretic generalization of the clique concept* , 1978 .

[10]  Lijun Chang,et al.  I/O Efficient ECC Graph Decomposition via Graph Reduction , 2016, Proc. VLDB Endow..

[11]  Lijun Chang,et al.  Effective and Efficient Dynamic Graph Coloring , 2017, Proc. VLDB Endow..

[12]  Stanley Wasserman,et al.  Social Network Analysis: Methods and Applications , 1994 .

[13]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[14]  Anthony K. H. Tung,et al.  On Triangulation-based Dense Neighborhood Graphs Discovery , 2010, Proc. VLDB Endow..

[15]  Lei Chen,et al.  Efficient cohesive subgraphs detection in parallel , 2014, SIGMOD Conference.

[16]  Fan Zhang,et al.  Discovering Strong Communities with User Engagement and Tie Strength , 2018, DASFAA.

[17]  S H Strogatz,et al.  Random graph models of social networks , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[18]  Alessandro Vespignani,et al.  K-core decomposition of Internet graphs: hierarchies, self-similarity and measurement biases , 2005, Networks Heterog. Media.

[19]  Lijun Chang,et al.  Distributed computing connected components with linear communication cost , 2018, Distributed and Parallel Databases.

[20]  Jure Leskovec,et al.  Local Higher-Order Graph Clustering , 2017, KDD.

[21]  R. J. Mokken,et al.  Cliques, clubs and clans , 1979 .

[22]  Anthony K. H. Tung,et al.  Large Scale Cohesive Subgraphs Discovery for Social Network Visual Analysis , 2012, Proc. VLDB Endow..

[23]  C. Bron,et al.  Algorithm 457: finding all cliques of an undirected graph , 1973 .

[24]  Vladimir Batagelj,et al.  An O(m) Algorithm for Cores Decomposition of Networks , 2003, ArXiv.

[25]  Jure Leskovec,et al.  Higher-order organization of complex networks , 2016, Science.