Predicting the correctness of node assignments in bipartite graph matching

We define various features that characterize individual node assignments.Based on the features we train a classifier to predict the correctness of assignments.We confirm that our methodology predicts assignment correctness with high accuracy. Due to their high representational power and flexibility, graphs have emerged as a widely used data structure for complex pattern representation. The advantages of graphs are, however, at the expense of a substantial increase of the computational complexity of the associated algorithms. For instance, exact graph similarity or distance can be computed in exponential time only. An algorithmic framework based on bipartite graph matching (allowing graph dissimilarity computation in cubic time) has been presented recently. However, this fast computation leads in general to some incorrect node assignments with respect to a globally optimal graph matching. In this paper we present a novel methodology for predicting which of the node assignments are actually incorrectly assigned by this approximation. To this end, a comprehensive set of features - which numerically characterize the assignments - is defined and extracted from the underlying graphs. These features are in turn used for training a statistical classifier, which can eventually be employed for predicting unseen assignments. The power and applicability of our novel approach is empirically verified on eight real world data sets.

[1]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[2]  Benoit Gaüzère,et al.  Two New Graph Kernels and Applications to Chemoinformatics , 2011, GbRPR.

[3]  Kaspar Riesen,et al.  Greedy Graph Edit Distance , 2015, MLDM.

[4]  Horst Bunke,et al.  Automatic learning of cost functions for graph edit distance , 2007, Inf. Sci..

[5]  Horst Bunke,et al.  Optimal quadratic-time isomorphism of ordered graphs , 1999, Pattern Recognit..

[6]  Mario Vento,et al.  Symbolic Learning vs. Graph Kernels: An Experimental Comparison in a Chemical Application , 2010, ADBIS.

[7]  Kaspar Riesen,et al.  Approximate Graph Edit Distance Guided by Bipartite Matching of Bags of Walks , 2014, S+SSPR.

[8]  Orestis Kostakis,et al.  Classy: fast clustering streams of call-graphs , 2014, Data Mining and Knowledge Discovery.

[9]  Benoit Gaüzère,et al.  Approximate Graph Edit Distance Computation Combining Bipartite Matching and Exact Neighborhood Substructure Distance , 2015, GbRPR.

[10]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Xuelong Li,et al.  A survey of graph edit distance , 2010, Pattern Analysis and Applications.

[12]  Horst Bunke,et al.  A Graph Matching Based Approach to Fingerprint Classification Using Directional Variance , 2005, AVBPA.

[13]  Alfred O. Hero,et al.  A binary linear programming formulation of the graph edit distance , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Francesc Serratosa,et al.  Learning graph-matching edit-costs based on the optimality of the oracle's node correspondences , 2015, Pattern Recognit. Lett..

[15]  Mario Vento,et al.  Thirty Years Of Graph Matching In Pattern Recognition , 2004, Int. J. Pattern Recognit. Artif. Intell..

[16]  Kaspar Riesen,et al.  IAM Graph Database Repository for Graph Based Pattern Recognition and Machine Learning , 2008, SSPR/SPR.

[17]  Andrea Torsello,et al.  Polynomial-time metrics for attributed trees , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Kaspar Riesen,et al.  Approximate graph edit distance computation by means of bipartite graph matching , 2009, Image Vis. Comput..

[19]  Kaspar Riesen,et al.  Graph Similarity Features for HMM-Based Handwriting Recognition in Historical Documents , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.

[20]  Kaspar Riesen,et al.  Improving bipartite graph matching by assessing the assignment confidence , 2015, Pattern Recognit. Lett..

[21]  Horst Bunke,et al.  Inexact graph matching for structural pattern recognition , 1983, Pattern Recognit. Lett..

[22]  Alberto Sanfeliu,et al.  On the Graph Edit Distance Cost: Properties and Applications , 2012, Int. J. Pattern Recognit. Artif. Intell..

[23]  Mario Vento,et al.  Graph Matching and Learning in Pattern Recognition in the Last 10 Years , 2014, Int. J. Pattern Recognit. Artif. Intell..

[24]  King-Sun Fu,et al.  A distance measure between attributed relational graphs for pattern recognition , 1983, IEEE Transactions on Systems, Man, and Cybernetics.

[25]  Christine Solnon,et al.  Reactive Tabu Search for Measuring Graph Similarity , 2005, GbRPR.

[26]  Horst Bunke,et al.  On Graphs with Unique Node Labels , 2003, GbRPR.

[27]  Horst Bunke,et al.  An Error-Tolerant Approximate Matching Algorithm for Attributed Planar Graphs and Its Application to Fingerprint Classification , 2004, SSPR/SPR.