Combination of product graph and random walk kernel for symbol spotting in graphical documents

This paper explores the utilization of product graph for spotting symbols on graphical documents. Product graph is intended to find the candidate subgraphs or components in the input graph containing the paths similar to the query graph. The acute angle between two edges and their length ratio are considered as the node labels. In a second step, each of the candidate subgraphs in the input graph is assigned with a distance measure computed by a random walk kernel. Actually it is the minimum of the distances of the component to all the components of the model graph. This distance measure is then used to eliminate dissimilar components. The remaining neighboring components are grouped and the grouped zone is considered as a retrieval zone of a symbol similar to the queried one. The entire method works online, i.e., it doesn't need any preprocessing step. The present paper reports the initial results of the method, which are very encouraging.

[1]  Jean-Yves Ramel,et al.  A Content Spotting System for Line Drawing Graphic Document Images , 2010, 2010 20th International Conference on Pattern Recognition.

[2]  Josep Lladós,et al.  A performance evaluation protocol for symbol spotting systems in terms of recognition and location indices , 2009, International Journal on Document Analysis and Recognition (IJDAR).

[3]  Jean-Yves Ramel,et al.  Subgraph Spotting through Explicit Graph Embedding: An Application to Content Spotting in Graphic Document Images , 2011, 2011 International Conference on Document Analysis and Recognition.

[4]  Josep Lladós,et al.  Symbol Recognition by Error-Tolerant Subgraph Matching between Region Adjacency Graphs , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Horst Bunke,et al.  Automatic Learning and Recognition of Graphical Symbols in Engineering Drawings , 1995, GREC.

[6]  Thomas Gärtner,et al.  On Graph Kernels: Hardness Results and Efficient Alternatives , 2003, COLT.

[7]  Umapada Pal,et al.  Symbol Spotting in Line Drawings through Graph Paths Hashing , 2011, 2011 International Conference on Document Analysis and Recognition.

[8]  Tony P. Pridmore,et al.  Building Synthetic Graphical Documents for Performance Evaluation , 2007, GREC.

[9]  Bart Lamiroy,et al.  Pattern Recognition Methods for Querying and Browsing Technical Documentation , 2008, CIARP.

[10]  Josep Lladós,et al.  Symbol spotting in vectorized technical drawings through a lookup table of region strings , 2009, Pattern Analysis and Applications.

[11]  Hisashi Kashima,et al.  Marginalized Kernels Between Labeled Graphs , 2003, ICML.