Efficient Processing of Growing Temporal Graphs

Temporal graphs are useful in modeling real-world networks. For example, in a phone call network, people may communicate with each other in multiple time periods, which can be modeled as multiple temporal edges. However, the size of real-world temporal graphs keeps increasing rapidly (e.g., considering the number of phone calls recorded each day), which makes it difficult to efficiently store and analyze the complete temporal graphs. We propose a new model, called equal-weight damped time window model, to efficiently manage temporal graphs. In this model, each time window is assigned a unified weight. This model is flexible as it allows users to control the tradeoff between the required storage space and the information loss. It also supports efficient maintenance of the windows as new data come in. We then discuss applications that use the model for analyzing temporal graphs. Our experiments demonstrated that we can handle massive temporal graphs efficiently with limited space.

[1]  Fan Yang,et al.  Husky: Towards a More Efficient and Expressive Distributed Computing Framework , 2016, Proc. VLDB Endow..

[2]  Afonso Ferreira,et al.  Computing Shortest, Fastest, and Foremost Journeys in Dynamic Networks , 2003, Int. J. Found. Comput. Sci..

[3]  Jari Saramäki,et al.  Path lengths, correlations, and centrality in temporal networks , 2011, Physical review. E, Statistical, nonlinear, and soft matter physics.

[4]  Amit Kumar,et al.  Connectivity and inference problems for temporal networks , 2000, Symposium on the Theory of Computing.

[5]  Haixun Wang,et al.  Landmarks: a new model for similarity-based pattern querying in time series databases , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[6]  Christos D. Zaroliagis,et al.  Timetable Information: Models and Algorithms , 2004, ATMOS.

[7]  Jari Saramäki,et al.  Temporal Networks , 2011, Encyclopedia of Social Network Analysis and Mining.

[8]  Etienne Huens,et al.  Data for Development: the D4D Challenge on Mobile Phone Data , 2012, ArXiv.

[9]  Cecilia Mascolo,et al.  Temporal distance metrics for social network analysis , 2009, WOSN '09.

[10]  Hejun Wu,et al.  Core decomposition in large temporal graphs , 2015, 2015 IEEE International Conference on Big Data (Big Data).

[11]  Fan Yang,et al.  A General-Purpose Query-Centric Framework for Querying Big Graphs , 2016, Proc. VLDB Endow..

[12]  Nieves R. Brisaboa,et al.  A Compressed Suffix-Array Strategy for Temporal-Graph Indexing , 2014, SPIRE.

[13]  Jinfeng Li,et al.  Reachability and time-based path queries in temporal graphs , 2016, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).

[14]  Philip S. Yu,et al.  Dynamic Community Detection in Weighted Graph Streams , 2013, SDM.

[15]  Jon M. Kleinberg,et al.  The structure of information pathways in a social communication network , 2008, KDD.

[16]  Hejun Wu,et al.  Efficient Algorithms for Temporal Path Computation , 2016, IEEE Transactions on Knowledge and Data Engineering.

[17]  Yi Lu,et al.  Path Problems in Temporal Graphs , 2014, Proc. VLDB Endow..

[18]  Nieves R. Brisaboa,et al.  Compact Data Structures for Temporal Graphs , 2013, 2013 Data Compression Conference.

[19]  Wilfred Ng,et al.  Blogel: A Block-Centric Framework for Distributed Computation on Real-World Graphs , 2014, Proc. VLDB Endow..

[20]  V Latora,et al.  Small-world behavior in time-varying graphs. , 2009, Physical review. E, Statistical, nonlinear, and soft matter physics.

[21]  Yi Yang,et al.  Diversified Temporal Subgraph Pattern Mining , 2016, KDD.

[22]  Piotr Indyk,et al.  Maintaining Stream Statistics over Sliding Windows , 2002, SIAM J. Comput..

[23]  Jean Vuillemin,et al.  A unifying look at data structures , 1980, CACM.

[24]  Wilfred Ng,et al.  Effective Techniques for Message Reduction and Load Balancing in Distributed Graph Computation , 2015, WWW.

[25]  Nieves R. Brisaboa,et al.  Data structures for temporal graphs based on compact sequence representations , 2015, Inf. Syst..

[26]  Cecilia Mascolo,et al.  Characterising temporal distance and reachability in mobile and online social networks , 2010, CCRV.

[27]  Peter J. Haas,et al.  Dynamic interaction graphs with probabilistic edge decay , 2015, 2015 IEEE 31st International Conference on Data Engineering.

[28]  Yixin Chen,et al.  Multi-Dimensional Regression Analysis of Time-Series Data Streams , 2002, VLDB.