Batch Processing for Truss Maintenance in Large Dynamic Graphs

This article studies the batch processing of truss maintenance in large graphs. Trussness is a widely used index in graph analytics for cohesive subgraph mining. It is defined on edges to reflect the closeness of vertices connected by the edges. The trussness maintenance problem, i.e., updating trussness after edge insertions/deletions and avoiding recomputation, was proposed by Cohen (2008) with the assumption that real graphs are continuously evolving in nature and their dynamicity usually only affects few edges. Different from the existing work that mainly focuses on simple single edge/vertex insertion/deletion, we propose batch processing algorithms for truss maintenance with multiedge insertions/deletions. By presenting an edge structure called triangle disjoint set, whose insertion/deletion can make each edge change its trussness by at most 1, we tackle the difficulty in quantifying the trussness changes of the edges in batch processing. More specifically, we propose two indices, namely sustain support and pivotal support, to help measure whether the edges have the potential to change their trussness, such that the search range of the potential edges is greatly reduced. The extensive experiments on real-world graphs illustrate that compared with the single-edge processing approach, our batch processing algorithms can significantly improve the processing efficiency, and the improvement becomes more obvious when more edges are inserted/deleted.

[1]  Christos Faloutsos,et al.  Graph evolution: Densification and shrinking diameters , 2006, TKDD.

[2]  Jiguo Yu,et al.  Fast skyline community search in multi-valued networks , 2020, Big Data Min. Anal..

[3]  Aric Hagberg,et al.  Exploring Network Structure, Dynamics, and Function using NetworkX , 2008, Proceedings of the Python in Science Conference.

[4]  Duncan J. Watts,et al.  Collective dynamics of ‘small-world’ networks , 1998, Nature.

[5]  Laks V. S. Lakshmanan,et al.  Approximate Closest Community Search in Networks , 2015, Proc. VLDB Endow..

[6]  Albert,et al.  Emergence of scaling in random networks , 1999, Science.

[7]  Peixiang Zhao,et al.  Truss-based Community Search: a Truss-equivalence Based Indexing Approach , 2017, Proc. VLDB Endow..

[8]  R. J. Mokken,et al.  Cliques, clubs and clans , 1979 .

[9]  Aristides Gionis,et al.  The community-search problem and how to plan a successful cocktail party , 2010, KDD.

[10]  Ali Pinar,et al.  Fast Hierarchy Construction for Dense Subgraphs , 2016, Proc. VLDB Endow..

[11]  Srinivasan Parthasarathy,et al.  Extracting Analyzing and Visualizing Triangle K-Core Motifs within Networks , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[12]  Jiguo Yu,et al.  Triangle edge deletion on planar glasses-free RGB-digraphs , 2019, Theor. Comput. Sci..

[13]  Humayun Kabir,et al.  Shared-Memory Graph Truss Decomposition , 2017, 2017 IEEE 24th International Conference on High Performance Computing (HiPC).

[14]  Animesh Mukherjee,et al.  On the Formation of Circles in Co-authorship Networks , 2015, KDD.

[15]  Jure Leskovec,et al.  Defining and evaluating network communities based on ground-truth , 2012, Knowledge and Information Systems.

[16]  Stephen B. Seidman,et al.  A graph‐theoretic generalization of the clique concept* , 1978 .

[17]  George Karypis,et al.  Streaming and Batch Algorithms for Truss Decomposition , 2019, 2019 First International Conference on Graph Computing (GC).

[18]  Philip S. Yu,et al.  Enterprise Employee Training via Project Team Formation , 2017, WSDM.

[19]  Lars Backstrom,et al.  Structural diversity in social contagion , 2012, Proceedings of the National Academy of Sciences.

[20]  Yue Wang,et al.  Accelerating Truss Decomposition on Heterogeneous Processors , 2020, Proc. VLDB Endow..

[21]  Jiguo Yu,et al.  Distributed Core Decomposition in Probabilistic Graphs , 2019, CSoNet.

[22]  Srikanta Tirthapura,et al.  Incremental maintenance of maximal cliques in a dynamic graph , 2016, VLDB J..

[23]  Chaokun Wang,et al.  Forbidden Nodes Aware Community Search , 2019, AAAI.

[24]  Philip S. Yu,et al.  Edge2vec: Edge-based Social Network Embedding , 2020, ACM Trans. Knowl. Discov. Data.

[25]  Jiguo Yu,et al.  Faster Parallel Core Maintenance Algorithms in Dynamic Graphs , 2020, IEEE Transactions on Parallel and Distributed Systems.

[26]  Jia Wang,et al.  Truss Decomposition in Massive Networks , 2012, Proc. VLDB Endow..

[27]  Jure Leskovec,et al.  Community Structure in Large Networks: Natural Cluster Sizes and the Absence of Large Well-Defined Clusters , 2008, Internet Math..

[28]  Jonathan Cohen,et al.  Graph Twiddling in a MapReduce World , 2009, Computing in Science & Engineering.

[29]  Tamara G. Kolda,et al.  A Scalable Generative Graph Model with Community Structure , 2013, SIAM J. Sci. Comput..

[30]  Na Wang,et al.  Parallel Algorithm for Core Maintenance in Dynamic Graphs , 2016, 2017 IEEE 37th International Conference on Distributed Computing Systems (ICDCS).

[31]  Lei Chen,et al.  Efficient cohesive subgraphs detection in parallel , 2014, SIGMOD Conference.

[32]  Humayun Kabir,et al.  Parallel k-truss decomposition on multicore systems , 2017, 2017 IEEE High Performance Extreme Computing Conference (HPEC).

[33]  Jure Leskovec,et al.  Friendship and mobility: user movement in location-based social networks , 2011, KDD.

[34]  Jeffrey Xu Yu,et al.  Unboundedness and Efficiency of Truss Maintenance in Evolving Graphs , 2019, SIGMOD Conference.

[35]  Xin Huang,et al.  Fast Algorithm for K-Truss Discovery on Public-Private Graphs , 2019, IJCAI.

[36]  Jeffrey Xu Yu,et al.  Querying k-truss community in large and dynamic graphs , 2014, SIGMOD Conference.

[37]  Dongjing Miao,et al.  Maximum reachability preserved graph cut , 2020, Theor. Comput. Sci..

[38]  Beom Jun Kim,et al.  Growing scale-free networks with tunable clustering. , 2001, Physical review. E, Statistical, nonlinear, and soft matter physics.

[39]  Xiang Zhang,et al.  Second-order random walk-based proximity measures in graph analysis: formulations and algorithms , 2018, The VLDB Journal.

[40]  Stephen B. Seidman,et al.  Network structure and minimum degree , 1983 .

[41]  Ali Pinar,et al.  Local Algorithms for Hierarchical Dense Subgraph Discovery , 2017, Proc. VLDB Endow..

[42]  Ming-Syan Chen,et al.  Distributed algorithms for k-truss decomposition , 2014, 2014 IEEE International Conference on Big Data (Big Data).

[43]  Sandra Sudarsky,et al.  Massive Quasi-Clique Detection , 2002, LATIN.

[44]  Hai Jin,et al.  Core Maintenance in Dynamic Graphs: A Parallel Approach Based on Matching , 2017, IEEE Transactions on Parallel and Distributed Systems.

[45]  Charu C. Aggarwal,et al.  A Survey of Algorithms for Dense Subgraph Discovery , 2010, Managing and Mining Graph Data.

[46]  James Cheng,et al.  Triangle listing in massive networks and its applications , 2011, KDD.