Mining Heavy Subgraphs in Time-Evolving Networks

Networks from different genres are not static entities, but exhibit dynamic behavior. The congestion level of links in transportation networks varies in time depending on the traffic. Similarly, social and communication links are employed at varying rates as information cascades unfold. In recent years there has been an increase of interest in modeling and mining dynamic networks. However, limited attention has been placed in high-scoring sub graph discovery in time-evolving networks. We define the problem of finding the highest-scoring temporal sub graph in a dynamic network, termed Heaviest Dynamic Sub graph (HDS). We show that HDS is NP-hard even with edge weights in {-1,1} and devise an efficient approach for large graph instances that evolve over long time periods. While a naive approach would enumerate all O(t^2) sub-intervals, our solution performs an effective pruning of the sub-interval space by considering O(t*log(t)) groups of sub-intervals and computing an aggregate of each group in logarithmic time. We also define a fast heuristic and a tight upper bound for approximating the static version of HDS, and use them for further pruning the sub-interval space and quickly verifying candidate sub-intervals. We perform an extensive experimental evaluation of our algorithm on transportation, communication and social media networks for discovering sub graphs that correspond to traffic congestions, communication overflow and localized social discussions. Our method is two orders of magnitude faster than a naive approach and scales well with network size and time length.

[1]  Chen Avin,et al.  How to Explore a Fast-Changing World (Cover Time of a Simple Random Walk on Evolving Graphs) , 2008, ICALP.

[2]  Yang Xiang,et al.  Identifying Dynamic Network Modules with Temporal and Spatial Constraints , 2007, Pacific Symposium on Biocomputing.

[3]  Yun Chi,et al.  Facetnet: a framework for analyzing communities and their evolutions in dynamic networks , 2008, WWW.

[4]  Jiawei Han,et al.  A Particle-and-Density Based Evolutionary Clustering Method for Dynamic Networks , 2009, Proc. VLDB Endow..

[5]  Jaimyoung Kwon Modeling Freeway Traffic with Coupled HMMs , 2000 .

[6]  George Michailidis,et al.  Global Modeling and Prediction of Computer Network Traffic , 2010, ArXiv.

[7]  Ambuj K. Singh,et al.  Content-based Modeling and Prediction of Information Dissemination , 2011, 2011 International Conference on Advances in Social Networks Analysis and Mining.

[8]  Hans-Peter Kriegel,et al.  Pattern Mining in Frequent Dynamic Subgraphs , 2006, Sixth International Conference on Data Mining (ICDM'06).

[9]  Andrea E. F. Clementi,et al.  Information Spreading in Stationary Markovian Evolving Graphs , 2011 .

[10]  Tina Eliassi-Rad,et al.  Detecting Novel Discrepancies in Communication Networks , 2010, 2010 IEEE International Conference on Data Mining.

[11]  Joan Feigenbaum,et al.  Sharing the Cost of Multicast Transmissions , 2001, J. Comput. Syst. Sci..

[12]  Masatoshi Yoshikawa,et al.  Time Graph Pattern Mining for Web Analysis and Information Retrieval , 2010, WAIM.

[13]  Terrill L. Frantz,et al.  Communication Networks from the Enron Email Corpus “It's Always About the People. Enron is no Different” , 2005, Comput. Math. Organ. Theory.

[14]  Pravin Varaiya,et al.  Traffic Flow on a Freeway Network , 2003 .

[15]  Tobias Müller,et al.  Identifying functional modules in protein–protein interaction networks: an integrated exact approach , 2008, ISMB.

[16]  Christian Böhm,et al.  Frequent subgraph discovery in dynamic networks , 2010, MLG '10.

[17]  Yann LeCun,et al.  Predictive network modeling of the high-resolution dynamic plant transcriptome in response to nitrate , 2010, Genome Biology.

[18]  Hosung Park,et al.  What is Twitter, a social network or a news media? , 2010, WWW '10.

[19]  David S. Johnson,et al.  The prize collecting Steiner tree problem: theory and practice , 2000, SODA '00.

[20]  Sung-Bae Cho,et al.  Structure evolution of dynamic Bayesian network for traffic accident detection , 2011, 2011 IEEE Congress of Evolutionary Computation (CEC).

[21]  Jure Leskovec,et al.  Microscopic evolution of social networks , 2008, KDD.

[22]  Aristides Gionis,et al.  Mining Graph Evolution Rules , 2009, ECML/PKDD.

[23]  Matteo Fischetti,et al.  An Algorithmic Framework for the Exact Solution of the Prize-Collecting Steiner Tree Problem , 2006, Math. Program..

[24]  Daniel R. Dooly,et al.  Algorithms for the constrained maximum-weight connected graph problem , 1996 .

[25]  Jure Leskovec,et al.  Patterns of temporal variation in online media , 2011, WSDM '11.

[26]  C. Faloutsos,et al.  EVENT DETECTION IN TIME SERIES OF MOBILE COMMUNICATION GRAPHS , 2010 .