Time-Constrained Graph Pattern Matching in a Large Temporal Graph

Graph pattern matching (GPM) is an important operation on graph computation. Most existing work assumes that query graph or data graph is static, which is contrary to the fact that graphs in real life are intrinsically dynamic. Therefore, in this paper, we propose a new problem of Time-Constrained Graph Pattern Matching (TCGPM) in a large temporal graph. Different from traditional work, our work deals with temporal graphs rather than a series of snapshots. Besides, the query graph in TCGPM contains two types of time constraints which are helpful for finding more useful subgraphs. To address the problem of TCGPM, a baseline method and an improved method are proposed. Besides, to further improve the efficiency, two pruning rules are proposed. The improved method runs several orders of magnitude faster than the baseline method. The effectiveness of TCGPM is several orders of magnitude better than that of GPM. Extensive experiments on three real and semi-real datasets demonstrate high performance of our proposed methods.

[1]  C. Lee Giles,et al.  Updating Graph Indices with a One-Pass Algorithm , 2015, SIGMOD Conference.

[2]  C. Lee Giles,et al.  Iterative Graph Feature Mining for Graph Indexing , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[3]  Jeong-Hoon Lee,et al.  Turboiso: towards ultrafast and robust subgraph isomorphism search in large graph databases , 2013, SIGMOD '13.

[4]  Ron Shamir,et al.  Faster subtree isomorphism , 1997, Proceedings of the Fifth Israeli Symposium on Theory of Computing and Systems.

[5]  Yang Xiang,et al.  Identifying Dynamic Network Modules with Temporal and Spatial Constraints , 2007, Pacific Symposium on Biocomputing.

[6]  Xin Wang,et al.  Distributed Graph Simulation: Impossibility and Possibility , 2014, Proc. VLDB Endow..

[7]  Yi Yang,et al.  Diversified Temporal Subgraph Pattern Mining , 2016, KDD.

[8]  Suh-Yin Lee,et al.  Mining Temporal Patterns in Time Interval-Based Data , 2015, IEEE Transactions on Knowledge and Data Engineering.

[9]  Ambuj K. Singh,et al.  Graphs-at-a-time: query language and access methods for graph databases , 2008, SIGMOD Conference.

[10]  Jie Wang,et al.  Event Pattern Matching over Graph Streams , 2014, Proc. VLDB Endow..

[11]  Thomas A. Henzinger,et al.  Computing simulations on finite and infinite graphs , 1995, Proceedings of IEEE 36th Annual Foundations of Computer Science.

[12]  Jinfeng Li,et al.  Reachability and time-based path queries in temporal graphs , 2016, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).

[13]  Julian R. Ullmann,et al.  An Algorithm for Subgraph Isomorphism , 1976, J. ACM.

[14]  Philip S. Yu,et al.  Towards Graph Containment Search and Indexing , 2007, VLDB.

[15]  Jiawei Han,et al.  On graph query optimization in large networks , 2010, Proc. VLDB Endow..

[16]  Jeffrey Xu Yu,et al.  Taming verification hardness: an efficient algorithm for testing subgraph isomorphism , 2008, Proc. VLDB Endow..

[17]  Ada Wai-Chee Fu,et al.  Minimum Spanning Trees in Temporal Graphs , 2015, SIGMOD Conference.

[18]  Jeong-Hoon Lee,et al.  An In-depth Comparison of Subgraph Isomorphism Algorithms in Graph Databases , 2012, Proc. VLDB Endow..

[19]  Xin Wang,et al.  Diversified Top-k Graph Pattern Matching , 2013, Proc. VLDB Endow..

[20]  James F. Allen Maintaining knowledge about temporal intervals , 1983, CACM.

[21]  Jianzhong Li,et al.  Graph pattern matching , 2010, Proc. VLDB Endow..

[22]  Yi Lu,et al.  Path Problems in Temporal Graphs , 2014, Proc. VLDB Endow..

[23]  Mario Vento,et al.  A (sub)graph isomorphism algorithm for matching large graphs , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Xin Wang,et al.  Incremental graph pattern matching , 2013, TODS.

[25]  Lei Chen,et al.  Continuous Subgraph Pattern Search over Certain and Uncertain Graph Streams , 2010, IEEE Transactions on Knowledge and Data Engineering.

[26]  Udayan Khurana,et al.  Storing and Analyzing Historical Graph Data at Scale , 2015, EDBT.

[27]  Jon M. Kleinberg,et al.  The structure of information pathways in a social communication network , 2008, KDD.

[28]  Tianyu Wo,et al.  Strong simulation , 2014, ACM Trans. Database Syst..

[29]  Jignesh M. Patel,et al.  SAGA: a subgraph matching tool for biological graphs , 2007, Bioinform..

[30]  Divesh Srivastava,et al.  Holistic twig joins: optimal XML pattern matching , 2002, SIGMOD '02.