Improved Lower Bounds for Graph Edit Distance

The problem of deriving lower and upper bounds for the edit distance between undirected, labeled graphs has recently received increasing attention. However, only one algorithm has been proposed that allegedly computes not only an upper but also a lower bound for non-uniform edit costs and incorporates information about both node and edge labels. In this paper, we demonstrate that this algorithm is incorrect. We present a corrected version <inline-formula> <tex-math notation="LaTeX">$\mathsf {B\scriptstyle{RANCH}}$</tex-math><alternatives> <inline-graphic xlink:href="blumenthal-ieq1-2772243.gif"/></alternatives></inline-formula> that runs in <inline-formula> <tex-math notation="LaTeX">$\mathcal{O}(n^2\Delta ^3+n^3)$</tex-math><alternatives> <inline-graphic xlink:href="blumenthal-ieq2-2772243.gif"/></alternatives></inline-formula> time, where <inline-formula> <tex-math notation="LaTeX">$\Delta$</tex-math><alternatives><inline-graphic xlink:href="blumenthal-ieq3-2772243.gif"/> </alternatives></inline-formula> is the maximum of the maximum degrees of input graphs <inline-formula> <tex-math notation="LaTeX">$G$</tex-math><alternatives><inline-graphic xlink:href="blumenthal-ieq4-2772243.gif"/> </alternatives></inline-formula> and <inline-formula><tex-math notation="LaTeX">$H$</tex-math><alternatives> <inline-graphic xlink:href="blumenthal-ieq5-2772243.gif"/></alternatives></inline-formula>. We also develop a speed-up <inline-formula><tex-math notation="LaTeX">$\mathsf {B\scriptstyle{RANCH}}\mathsf{F\scriptstyle{AST}}$</tex-math> <alternatives><inline-graphic xlink:href="blumenthal-ieq6-2772243.gif"/></alternatives></inline-formula> that runs in <inline-formula><tex-math notation="LaTeX">$\mathcal{O}(n^2\Delta ^2+n^3)$</tex-math><alternatives> <inline-graphic xlink:href="blumenthal-ieq7-2772243.gif"/></alternatives></inline-formula> time and computes an only slightly less accurate lower bound. The lower bounds produced by <inline-formula><tex-math notation="LaTeX">$\mathsf {B\scriptstyle{RANCH}}$</tex-math><alternatives><inline-graphic xlink:href="blumenthal-ieq8-2772243.gif"/> </alternatives></inline-formula> and <inline-formula><tex-math notation="LaTeX">$\mathsf {B\scriptstyle{RANCH}}\mathsf{F\scriptstyle{AST}}$</tex-math><alternatives> <inline-graphic xlink:href="blumenthal-ieq9-2772243.gif"/></alternatives></inline-formula> are shown to be pseudo-metrics on a collection of graphs. Finally, we suggest an anytime algorithm <inline-formula> <tex-math notation="LaTeX">$\mathsf {B\scriptstyle{RANCH}}\mathsf{T\scriptstyle{IGHT}}$</tex-math><alternatives> <inline-graphic xlink:href="blumenthal-ieq10-2772243.gif"/></alternatives></inline-formula> that iteratively improves <inline-formula><tex-math notation="LaTeX">$\mathsf {B\scriptstyle{RANCH}}$</tex-math><alternatives> <inline-graphic xlink:href="blumenthal-ieq11-2772243.gif"/></alternatives></inline-formula>’s lower bound. <inline-formula><tex-math notation="LaTeX">$\mathsf {B\scriptstyle{RANCH}}\mathsf{T\scriptstyle{IGHT}}$</tex-math> <alternatives><inline-graphic xlink:href="blumenthal-ieq12-2772243.gif"/></alternatives></inline-formula> runs in <inline-formula><tex-math notation="LaTeX">$\mathcal{O}(n^3\Delta ^2+I(n^2\Delta ^3+n^3))$</tex-math><alternatives> <inline-graphic xlink:href="blumenthal-ieq13-2772243.gif"/></alternatives></inline-formula> time, where the number of iterations <inline-formula><tex-math notation="LaTeX">$I$</tex-math><alternatives> <inline-graphic xlink:href="blumenthal-ieq14-2772243.gif"/></alternatives></inline-formula> is controlled by the user. A detailed experimental evaluation shows that all suggested algorithms are Pareto optimal, that they are very effective when used as filters for edit distance range queries, and that they perform excellently when used within classification frameworks.

[1]  Qing Liu,et al.  A Partition-Based Approach to Structure Similarity Search , 2013, Proc. VLDB Endow..

[2]  Lei Zou,et al.  Graph similarity search with edit distance constraint in large graph databases , 2013, CIKM.

[3]  Kaspar Riesen,et al.  Approximate graph edit distance computation by means of bipartite graph matching , 2009, Image Vis. Comput..

[4]  David B. Blumenthal,et al.  Exact Computation of Graph Edit Distance for Uniform and Non-uniform Metric Edit Costs , 2017, GbRPR.

[5]  Francesc Serratosa,et al.  Speeding up Fast Bipartite Graph Matching Through a New Cost Matrix , 2015, Int. J. Pattern Recognit. Artif. Intell..

[6]  H. Kuhn The Hungarian method for the assignment problem , 1955 .

[7]  Brendan D. McKay,et al.  Practical graph isomorphism, II , 2013, J. Symb. Comput..

[8]  Kaspar Riesen,et al.  Approximate Graph Edit Distance Guided by Bipartite Matching of Bags of Walks , 2014, S+SSPR.

[9]  Xuemin Lin,et al.  Efficient Graph Similarity Joins with Edit Distance Constraints , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[10]  Kaspar Riesen,et al.  Speeding Up Graph Edit Distance Computation with a Bipartite Heuristic , 2007, MLG.

[11]  Kaspar Riesen,et al.  Learning Heuristics to Reduce the Overestimation of Bipartite Graph Edit Distance Approximation , 2015, MLDM.

[12]  Anthony K. H. Tung,et al.  An Efficient Graph Indexing Method , 2012, 2012 IEEE 28th International Conference on Data Engineering.

[13]  Lei Zou,et al.  Efficient Graph Similarity Search Over Large Graph Databases , 2015, IEEE Transactions on Knowledge and Data Engineering.

[14]  Jean-Yves Ramel,et al.  An Exact Graph Edit Distance Algorithm for Solving Pattern Recognition Problems , 2015, ICPRAM.

[15]  Romain Raveaux,et al.  New binary linear programming formulation to compute the graph edit distance , 2017, Pattern Recognit..

[16]  Kaspar Riesen,et al.  A Survey on Applications of Bipartite Graph Edit Distance , 2017, GbRPR.

[17]  Kaspar Riesen,et al.  Combining Bipartite Graph Matching and Beam Search for Graph Edit Distance Approximation , 2014, ANNPR.

[18]  Kaspar Riesen,et al.  Estimating Graph Edit Distance Using Lower and Upper Bounds of Bipartite Approximations , 2015, Int. J. Pattern Recognit. Artif. Intell..

[19]  Benoit Gaüzère,et al.  Approximate Graph Edit Distance Computation Combining Bipartite Matching and Exact Neighborhood Substructure Distance , 2015, GbRPR.

[20]  David B. Blumenthal,et al.  Correcting and Speeding-Up Bounds for Non-Uniform Graph Edit Distance , 2017, 2017 IEEE 33rd International Conference on Data Engineering (ICDE).

[21]  Francesc Serratosa,et al.  Fast computation of Bipartite graph matching , 2014, Pattern Recognit. Lett..

[22]  Benoit Gaüzère,et al.  Graph edit distance as a quadratic assignment problem , 2017, Pattern Recognit. Lett..

[23]  Tyler Seacrest Packings and realizations of degree sequences with specified substructures , 2011 .

[24]  Kaspar Riesen,et al.  Approximation of Graph Edit Distance in Quadratic Time , 2015, GbRPR.

[25]  Mario Vento,et al.  Graph Matching and Learning in Pattern Recognition in the Last 10 Years , 2014, Int. J. Pattern Recognit. Artif. Intell..

[26]  Henk Meijer,et al.  An algorithm for computing simple k-factors , 2009, Inf. Process. Lett..

[27]  Kaspar Riesen,et al.  Graph-Based Keyword Spotting in Historical Handwritten Documents , 2016, S+SSPR.

[28]  Ge Yu,et al.  Efficiently Indexing Large Sparse Graphs for Similarity Search , 2012, IEEE Transactions on Knowledge and Data Engineering.

[29]  Kaspar Riesen,et al.  IAM Graph Database Repository for Graph Based Pattern Recognition and Machine Learning , 2008, SSPR/SPR.

[30]  Karam Gouda,et al.  CSI_GED: An efficient approach for graph edit similarity computation , 2016, 2016 IEEE 32nd International Conference on Data Engineering (ICDE).

[31]  J. Munkres ALGORITHMS FOR THE ASSIGNMENT AND TRANSIORTATION tROBLEMS* , 1957 .

[32]  Francesc Serratosa,et al.  Graph Edit Distance: Moving from global to local structure to solve the graph-matching problem , 2015, Pattern Recognit. Lett..

[33]  Kaspar Riesen,et al.  Fast Suboptimal Algorithms for the Computation of Graph Edit Distance , 2006, SSPR/SPR.

[34]  Alfred O. Hero,et al.  A binary linear programming formulation of the graph edit distance , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[35]  Kaspar Riesen,et al.  Computing Upper and Lower Bounds of Graph Edit Distance in Cubic Time , 2014, ANNPR.