Finding events in temporal networks: segmentation meets densest subgraph discovery

In this paper we study the problem of discovering a timeline of events in a temporal network. We model events as dense subgraphs that occur within intervals of network activity. We formulate the event-discovery task as an optimization problem, where we search for a partition of the network timeline into k non-overlapping intervals, such that the intervals span subgraphs with maximum total density. The output is a sequence of dense subgraphs along with corresponding time intervals, capturing the most interesting events during the network lifetime. A naive solution to our optimization problem has polynomial but prohibitively high running time complexity. We adapt existing recent work on dynamic densest-subgraph discovery and approximate dynamic programming to design a fast approximation algorithm. Next, to ensure richer structure, we adjust the problem formulation to encourage coverage of a larger set of nodes. This problem is NP-hard even for static graphs. However, on static graphs a simple greedy algorithm leads to approximate solution due to submodularity. We extended this greedy approach for the case of temporal networks. However, the approximation guarantee does not hold. Nevertheless, according to the experiments, the algorithm finds good quality solutions.

[1]  Gonzalo Navarro,et al.  Compressed Representation of Web and Social Networks via Dense Subgraphs , 2012, SPIRE.

[2]  Francesco Bonchi,et al.  Finding Subgraphs with Maximum Total Density and Limited Overlap , 2015, WSDM.

[3]  Andrew V. Goldberg,et al.  Finding a Maximum Density Subgraph , 1984 .

[4]  Alessandro Vespignani,et al.  Large scale networks fingerprinting and visualization using the k-core decomposition , 2005, NIPS.

[5]  Jakub W. Pachocki,et al.  Scalable Large Near-Clique Detection in Large-Scale Networks via Sampling , 2015, KDD.

[6]  R. Bellman Dynamic programming. , 1957, Science.

[7]  Dane Taylor,et al.  Super-Resolution Community Detection for Layer-Aggregated Multilayer Networks , 2016, Physical review. X.

[8]  Sofya Vorotnikova,et al.  Densest Subgraph in Dynamic Graph Streams , 2015, MFCS.

[9]  Rajeev Motwani,et al.  Clique partitions, graph compression and speeding-up algorithms , 1991, STOC '91.

[10]  Charalampos E. Tsourakakis,et al.  Space- and Time-Efficient Algorithm for Maintaining Dense Subgraphs on One-Pass Dynamic Streams , 2015, STOC.

[11]  Kazuhisa Makino,et al.  New Algorithms for Enumerating All Maximal Cliques , 2004, SWAT.

[12]  Sudipto Guha,et al.  Data-streams and histograms , 2001, STOC '01.

[13]  Francesco Calabrese,et al.  ABACUS: frequent pAttern mining-BAsed Community discovery in mUltidimensional networkS , 2013, Data Mining and Knowledge Discovery.

[14]  Jure Leskovec,et al.  The bursty dynamics of the Twitter information network , 2014, WWW.

[15]  Robert E. Tarjan,et al.  A Fast Parametric Maximum Flow Algorithm and Applications , 1989, SIAM J. Comput..

[16]  Venkatesan Guruswami,et al.  CopyCatch: stopping group attacks by spotting lockstep behavior in social networks , 2013, WWW.

[17]  Aristides Gionis,et al.  Mining Graph Evolution Rules , 2009, ECML/PKDD.

[18]  Evaggelia Pitoura,et al.  Finding lasting dense subgraphs , 2016, Data Mining and Knowledge Discovery.

[19]  Niko Beerenwinkel,et al.  Finding Dense Subgraphs in Relational Graphs , 2015, ECML/PKDD.

[20]  Moses Charikar,et al.  Greedy approximation algorithms for finding dense components in a graph , 2000, APPROX.

[21]  Serafim Batzoglou,et al.  MotifCut: regulatory motifs finding with maximum density subgraphs , 2006, ISMB.

[22]  Nikolaj Tatti Strongly polynomial efficient approximation scheme for segmentation , 2019, Inf. Process. Lett..

[23]  Krishna P. Gummadi,et al.  On the evolution of user interaction in Facebook , 2009, WOSN '09.

[24]  Samir Khuller,et al.  On Finding Dense Subgraphs , 2009, ICALP.

[25]  James B. Orlin,et al.  Max flows in O(nm) time, or better , 2013, STOC '13.

[26]  Yun Chi,et al.  Facetnet: a framework for analyzing communities and their evolutions in dynamic networks , 2008, WWW.

[27]  Hisao Tamaki,et al.  Greedily Finding a Dense Subgraph , 2000, J. Algorithms.

[28]  Charalampos E. Tsourakakis,et al.  Denser than the densest subgraph: extracting optimal quasi-cliques with quality guarantees , 2013, KDD.

[29]  Evaggelia Pitoura,et al.  Best Friends Forever (BFF): Finding Lasting Dense Subgraphs , 2016, ArXiv.

[30]  Silvio Lattanzi,et al.  Efficient Densest Subgraph Computation in Evolving Graphs , 2015, WWW.

[31]  Aristides Gionis,et al.  Finding Dynamic Dense Subgraphs , 2017, ACM Trans. Knowl. Discov. Data.

[32]  Aristides Gionis,et al.  Overlapping community detection in labeled graphs , 2014, Data Mining and Knowledge Discovery.

[33]  David P. Woodruff,et al.  Brief Announcement: Applications of Uniform Sampling: Densest Subgraph and Beyond , 2015, SPAA.

[34]  Divesh Srivastava,et al.  Dense subgraph maintenance under streaming edge weight updates for real-time story identification , 2012, The VLDB Journal.

[35]  Hui Lin,et al.  A Class of Submodular Functions for Document Summarization , 2011, ACL.

[36]  Charalampos E. Tsourakakis A Novel Approach to Finding Near-Cliques: The Triangle-Densest Subgraph Problem , 2014, ArXiv.

[37]  M. L. Fisher,et al.  An analysis of approximations for maximizing submodular set functions—I , 1978, Math. Program..

[38]  Jeffrey Xu Yu,et al.  Efficient Core Maintenance in Large Dynamic Graphs , 2012, IEEE Transactions on Knowledge and Data Engineering.

[39]  Srinivasan Parthasarathy,et al.  An event-based framework for characterizing the evolutionary behavior of interaction graphs , 2007, KDD '07.

[40]  Kumar Chellapilla,et al.  Speeding Up Algorithms on Compressed Web Graphs , 2009, Internet Math..

[41]  T.-H. Hubert Chan,et al.  Large Scale Density-friendly Graph Decomposition via Convex Programming , 2017, WWW.

[42]  Jon M. Kleinberg,et al.  Group formation in large social networks: membership, growth, and evolution , 2006, KDD '06.

[43]  Yousef Saad,et al.  Dense Subgraph Extraction with Application to Community Detection , 2012, IEEE Transactions on Knowledge and Data Engineering.

[44]  Carlos Castillo,et al.  EviDense: A Graph-Based Method for Finding Unique High-Impact Events with Succinct Keyword-Based Descriptions , 2018, ICWSM.

[45]  Derek Greene,et al.  Tracking the Evolution of Communities in Dynamic Social Networks , 2010, 2010 International Conference on Advances in Social Networks Analysis and Mining.

[46]  Kumar Chellapilla,et al.  Finding Dense Subgraphs with Size Bounds , 2009, WAW.

[47]  Petko Bogdanov,et al.  Local Community Detection in Dynamic Networks , 2017, 2017 IEEE International Conference on Data Mining (ICDM).

[48]  Jukka-Pekka Onnela,et al.  Community Structure in Time-Dependent, Multiscale, and Multiplex Networks , 2009, Science.

[49]  Kumar Chellapilla,et al.  Speeding up algorithms on compressed web graphs , 2009, WSDM '09.

[50]  Samir Khuller,et al.  Dense Subgraphs with Restrictions and Applications to Gene Annotation Graphs , 2010, RECOMB.

[51]  Ravi Kumar,et al.  Discovering Large Dense Subgraphs in Massive Graphs , 2005, VLDB.

[52]  Aristides Gionis,et al.  Top-k overlapping densest subgraphs , 2016, Data Mining and Knowledge Discovery.