Local Motif Clustering on Time-Evolving Graphs

Graph motifs are subgraph patterns that occur in complex networks, which are of key importance for gaining deep insights into the structure and functionality of the graph. Motif clustering aims at finding clusters consisting of dense motif patterns. It is commonly used in various application domains, ranging from social networks to collaboration networks, from market-basket analysis to neuroscience applications. More recently, local clustering techniques have been proposed for motif-aware clustering, which focuses on a small neighborhood of the input seed node instead of the entire graph. However, most of these techniques are designed for static graphs and may render sub-optimal results when applied to large time-evolving graphs. To bridge this gap, in this paper, we propose a novel framework, Local Motif Clustering on Time-Evolving Graphs (L-MEGA), which provides the evolution pattern of the local motif cluster in an effective and efficient way. The core of L-MEGA is approximately tracking the temporal evolution of the local motif cluster via novel techniques such as edge filtering, motif push operation, and incremental sweep cut. Furthermore, we theoretically analyze the efficiency and effectiveness of these techniques on time-evolving graphs. Finally, we evaluate the L-MEGA framework via extensive experiments on both synthetic and real-world temporal networks.

[1]  Liyi Wen,et al.  ON THE LIMITING PROBABILITY DISTRIBUTION OF A TRANSITION PROBABILITY TENSOR , 2011 .

[2]  Jure Leskovec,et al.  Motifs in Temporal Networks , 2016, WSDM.

[3]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[4]  Hongyang Zhang,et al.  Approximate Personalized PageRank on Dynamic Graphs , 2016, KDD.

[5]  Tao Wu,et al.  General Tensor Spectral Co-clustering for Higher-Order Data , 2016, NIPS.

[6]  Jirí Síma,et al.  On the NP-Completeness of Some Graph Cluster Measures , 2005, SOFSEM.

[7]  Xin Liu,et al.  No Place to Hide: Catching Fraudulent Entities in Tensors , 2018, WWW.

[8]  Philip S. Yu,et al.  GraphScope: parameter-free mining of large time-evolving graphs , 2007, KDD '07.

[9]  Jingrui He,et al.  A Randomized Approach for Crowdsourcing in the Presence of Multiple Views , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[10]  David F. Gleich,et al.  The Spacey Random Walk: A Stochastic Process for Higher-Order Data , 2016, SIAM Rev..

[11]  Jure Leskovec,et al.  Higher-order organization of complex networks , 2016, Science.

[12]  Philip S. Yu,et al.  Proximity Tracking on Time-Evolving Bipartite Graphs , 2008, SDM.

[13]  Ken-ichi Kawarabayashi,et al.  Efficient PageRank Tracking in Evolving Networks , 2015, KDD.

[14]  Fan Chung Graham,et al.  Local Graph Partitioning using PageRank Vectors , 2006, 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS'06).

[15]  Charu C. Aggarwal,et al.  Temporally Evolving Community Detection and Prediction in Content-Centric Networks , 2018, ECML/PKDD.

[16]  Jingrui He,et al.  Convolution-Consistent Collective Matrix Completion , 2019, CIKM.

[17]  S. Shen-Orr,et al.  Networks Network Motifs : Simple Building Blocks of Complex , 2002 .

[18]  Inderjit S. Dhillon,et al.  Semi-supervised graph clustering: a kernel approach , 2005, Machine Learning.

[19]  Mark S. Granovetter The Strength of Weak Ties , 1973, American Journal of Sociology.

[20]  David F. Gleich,et al.  Multilinear PageRank , 2014, SIAM J. Matrix Anal. Appl..

[21]  C. Hoofnagle Identity Theft: Making the Known Unknowns Known , 2007 .

[22]  Jingrui He,et al.  Motif-Preserving Dynamic Local Graph Cut , 2018, 2018 IEEE International Conference on Big Data (Big Data).

[23]  Jingrui He,et al.  A Local Algorithm for Structure-Preserving Graph Cut , 2017, KDD.

[24]  Jure Leskovec,et al.  Local Higher-Order Graph Clustering , 2017, KDD.

[25]  Deepayan Chakrabarti,et al.  Evolutionary clustering , 2006, KDD '06.

[26]  Yihong Gong,et al.  Incremental spectral clustering by efficiently updating the eigen-system , 2010, Pattern Recognit..

[27]  Shang-Hua Teng,et al.  A Local Clustering Algorithm for Massive Graphs and Its Application to Nearly Linear Time Graph Partitioning , 2008, SIAM J. Comput..

[28]  Jingrui He,et al.  MultiC2: an Optimization Framework for Learning from Task and Worker Dual Heterogeneity , 2017, SDM.

[29]  Edsger W. Dijkstra,et al.  A note on two problems in connexion with graphs , 1959, Numerische Mathematik.

[30]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[31]  Tamara G. Kolda,et al.  Degree relations of triangles in real-world networks and graph models , 2012, CIKM.

[32]  Martin Rosvall,et al.  Memory in network flows and its effects on community detection, ranking, and spreading , 2014 .

[33]  S. Shen-Orr,et al.  Network motifs: simple building blocks of complex networks. , 2002, Science.

[34]  Martin Rosvall,et al.  Memory in network flows and its effects on spreading dynamics and community detection , 2013, Nature Communications.

[35]  Christos Faloutsos,et al.  Edge Weight Prediction in Weighted Signed Networks , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[36]  Jure Leskovec,et al.  Tensor Spectral Clustering for Partitioning Higher-order Network Structures , 2015, SDM.

[37]  Christos Faloutsos,et al.  REV2: Fraudulent User Prediction in Rating Platforms , 2018, WSDM.

[38]  Ryan A. Rossi,et al.  The Network Data Repository with Interactive Graph Analytics and Visualization , 2015, AAAI.

[39]  Shang-Hua Teng,et al.  Spectral affinity in protein networks , 2009, BMC Systems Biology.

[40]  Josep-Lluís Larriba-Pey,et al.  Shaping communities out of triangles , 2012, CIKM.