Speeding Up GED Verification for Graph Similarity Search

Graph similarity search retrieves from a database all graphs whose edit distance (GED) to a query graph is within a threshold. As GED computation is NP-hard, the existing works adopt the filtering-and-verification paradigm to reduce the number of GED verifications, and they mainly focus on designing filtering techniques while using the now out-dated algorithm A∗GED for verification. In this paper, we aim to speed up GED verification, which is orthogonal to the index structures used in the filtering phase. We propose a bestfirst search algorithm AStar+-LSa which improves A∗GED by (1) reducing memory consumption, (2) tightening lower bound estimation, and (3) improving the time complexity for lower bound computation. We formally show that AStar+-LSa has a lower space and time complexity than A∗GED. We further modify AStar+-LSa into a depth-first search algorithm to contrast these two search paradigms, and we extend our algorithms for exact GED computation. We conduct extensive empirical studies on real graph datasets, and show that our algorithm AStar+-LSa outperforms the state-of-the-art algorithms by several orders of magnitude for both GED verification and GED computation.

[1]  King-Sun Fu,et al.  A distance measure between attributed relational graphs for pattern recognition , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[2]  Kaspar Riesen,et al.  A Novel Software Toolkit for Graph Edit Distance Computation , 2013, GbRPR.

[3]  M. Bernard,et al.  Functional brain imaging by EEG graph-matching , 2005, 2005 IEEE Engineering in Medicine and Biology 27th Annual Conference.

[4]  Anthony K. H. Tung,et al.  Comparing Stars: On Approximating Graph Edit Distance , 2009, Proc. VLDB Endow..

[5]  Lijun Chang,et al.  Efficient Graph Edit Distance Computation and Verification via Anchor-aware Lower Bound Estimation , 2017, ArXiv.

[6]  Anthony K. H. Tung,et al.  An Efficient Graph Indexing Method , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[7]  Peixiang Zhao,et al.  Similarity Search in Graph Databases: A Multi-Layered Indexing Approach , 2017, 2017 IEEE 33rd International Conference on Data Engineering (ICDE).

[8]  Yang Wang,et al.  Efficient structure similarity searches: a partition-based approach , 2018, The VLDB Journal.

[9]  Jean-Yves Ramel,et al.  An Exact Graph Edit Distance Algorithm for Solving Pattern Recognition Problems , 2015, ICPRAM.

[10]  Karam Gouda,et al.  CSI_GED: An efficient approach for graph edit similarity computation , 2016, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).

[11]  Qing Liu,et al.  A Partition-Based Approach to Structure Similarity Search , 2013, Proc. VLDB Endow..

[12]  David B. Blumenthal,et al.  Exact Computation of Graph Edit Distance for Uniform and Non-uniform Metric Edit Costs , 2017, GbRPR.

[13]  Xuemin Lin,et al.  Efficient processing of graph similarity queries with edit distance constraints , 2013, The VLDB Journal.

[14]  Horst Bunke,et al.  A graph distance metric based on the maximal common subgraph , 1998, Pattern Recognit. Lett..

[15]  Gabriel Valiente,et al.  A graph distance metric combining maximum common subgraph and minimum common supergraph , 2001, Pattern Recognit. Lett..

[16]  M. Kanehisa,et al.  A heuristic graph comparison algorithm and its application to detect functionally related enzyme clusters. , 2000, Nucleic acids research.

[17]  Kaspar Riesen,et al.  Speeding Up Graph Edit Distance Computation with a Bipartite Heuristic , 2007, MLG.

[18]  Jeffrey Xu Yu,et al.  TreeSpan: efficiently computing similarity all-matching , 2012, SIGMOD Conference.

[19]  Horst Bunke,et al.  Edit distance-based kernel functions for structural pattern classification , 2006, Pattern Recognit..

[20]  Alfred O. Hero,et al.  A binary linear programming formulation of the graph edit distance , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Jenny Benois-Pineau,et al.  Retrieval of objects in video by similarity based on graph matching , 2007, Pattern Recognit. Lett..

[22]  Dong-Hoon Choi,et al.  Inves: Incremental Partitioning-Based Verification for Graph Similarity Search , 2019, EDBT.

[23]  Lei Zou,et al.  Efficient Graph Similarity Search Over Large Graph Databases , 2015, IEEE Transactions on Knowledge and Data Engineering.