HGED: A Hybrid Search Algorithm for Efficient Parallel Graph Edit Distance Computation

Graph edit distance (GED) is a measure for quantifying the similarity between two graphs. Because of its flexibility and versatility, GED is widely used in many real applications. However, the main disadvantage of GED is its high computational cost. Many solutions have been proposed to speed up GED computation, but most of them focus on developing serial algorithms and only very few solutions consider parallel computing. In this paper, we study a parallel GED computation to elaborate a fast and precise algorithm. Unlike existing solutions that utilize either a depth-first or a best-first search, we propose a hybrid approach that combines depth-first and best-first search paradigms. Our approach can quickly find tighter GED upper bounds, and effectively prune the search space using the upper bounds. Based on the approach, we develop an efficient parallel GED computation algorithm named <inline-formula> <tex-math notation="LaTeX">${\sf HGED}$ </tex-math></inline-formula>. To maximize thread utility, <inline-formula> <tex-math notation="LaTeX">${\sf HGED}$ </tex-math></inline-formula> is also equipped with a novel dynamic load balancing scheme whose main focus is on reducing the overhead of thread synchronization. Experimental results on widely used real datasets show that, on average, <inline-formula> <tex-math notation="LaTeX">${\sf HGED}$ </tex-math></inline-formula> outperforms the state-of-the art serial algorithm <inline-formula> <tex-math notation="LaTeX">${\sf AStar}^{+}$ </tex-math></inline-formula>-<inline-formula> <tex-math notation="LaTeX">${\sf LSa}$ </tex-math></inline-formula> by 6 times and the state-of-the parallel algorithm <inline-formula> <tex-math notation="LaTeX">${\sf PGED}$ </tex-math></inline-formula> by 4 times.

[1]  Yixiang Fang,et al.  Efficient Parallel Computing of Graph Edit Distance , 2019, 2019 IEEE 35th International Conference on Data Engineering Workshops (ICDEW).

[2]  Jeffrey Xu Yu,et al.  Connected substructure similarity search , 2010, SIGMOD Conference.

[3]  Dong-Hoon Choi,et al.  Inves: Incremental Partitioning-Based Verification for Graph Similarity Search , 2019, EDBT.

[4]  Jeffrey Xu Yu,et al.  TreeSpan: efficiently computing similarity all-matching , 2012, SIGMOD Conference.

[5]  Wei Jin,et al.  SAPPER: Subgraph Indexing and Approximate Matching in Large Graphs , 2010, Proc. VLDB Endow..

[6]  Karam Gouda,et al.  A novel edge-centric approach for graph edit similarity computation , 2019, Inf. Syst..

[7]  Jean-Yves Ramel,et al.  An Exact Graph Edit Distance Algorithm for Solving Pattern Recognition Problems , 2015, ICPRAM.

[8]  Horst Bunke,et al.  A graph distance metric based on the maximal common subgraph , 1998, Pattern Recognit. Lett..

[9]  Kaspar Riesen,et al.  A Survey on Applications of Bipartite Graph Edit Distance , 2017, GbRPR.

[10]  Jignesh M. Patel,et al.  SAGA: a subgraph matching tool for biological graphs , 2007, Bioinform..

[11]  Romain Raveaux,et al.  Exact Graph Edit Distance Computation Using a Binary Linear Program , 2016, S+SSPR.

[12]  King-Sun Fu,et al.  A distance measure between attributed relational graphs for pattern recognition , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[13]  Anthony K. H. Tung,et al.  Comparing Stars: On Approximating Graph Edit Distance , 2009, Proc. VLDB Endow..

[14]  Xuemin Lin,et al.  Efficient Graph Similarity Joins with Edit Distance Constraints , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[15]  Karam Gouda,et al.  CSI_GED: An efficient approach for graph edit similarity computation , 2016, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).

[16]  Lu Qin,et al.  Speeding Up GED Verification for Graph Similarity Search , 2020, 2020 IEEE 36th International Conference on Data Engineering (ICDE).

[17]  Qing Liu,et al.  A Partition-Based Approach to Structure Similarity Search , 2013, Proc. VLDB Endow..

[18]  Xuemin Lin,et al.  Efficient processing of graph similarity queries with edit distance constraints , 2013, The VLDB Journal.

[19]  Kaspar Riesen,et al.  Speeding Up Graph Edit Distance Computation with a Bipartite Heuristic , 2007, MLG.

[20]  Jean-Yves Ramel,et al.  A parallel graph edit distance algorithm , 2018, Expert Syst. Appl..