Efficient symbol retrieval by building a symbol index from a collection of line drawings

Symbol retrieval is important for content-based search in digital libraries and for automatic interpretation of line drawings. In this work, we present a complete symbol retrieval system. The proposed system has an off-line content-analysis stage, where the contents of a database of line drawings are represented as a symbol index, which is a compact indexable representation of the database. Such representation allows efficient on-line query retrieval. Within the retrieval system, three methods are presented. First, a feature grouping method for identifying local regions of interest (ROIs) in the drawings. The found ROIs represent symbols' parts. Second, a clustering method based on geometric matching, is used to cluster the similar parts from all the drawings together. A symbol index is then constructed from the clusters' representatives. Finally, the ROIs of a query symbol are matched to the clusters' representatives. The matching symbols' parts are retrieved from the clusters, and spatial verification is performed on the matching parts. By using the symbol index we are able to achieve a query look-up time that is independent of the database size, and dependent on the size of the symbol index. The retrieval system achieves higher recall and precision than state-of-the-art methods.

[1]  O. J. Vrieze,et al.  Kohonen Network , 1995, Artificial Neural Networks.

[2]  Vincent Lepetit,et al.  Gradient Response Maps for Real-Time Detection of Textureless Objects , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Michael Isard,et al.  Total Recall: Automatic Query Expansion with a Generative Feature Model for Object Retrieval , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[4]  Allen Gersho,et al.  Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.

[5]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[6]  Jean-Yves Ramel,et al.  A Content Spotting System for Line Drawing Graphic Document Images , 2010, 2010 20th International Conference on Pattern Recognition.

[7]  Donald E. Knuth,et al.  The art of computer programming, volume 3: (2nd ed.) sorting and searching , 1998 .

[8]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[9]  Josep Lladós,et al.  A performance evaluation protocol for symbol spotting systems in terms of recognition and location indices , 2009, International Journal on Document Analysis and Recognition (IJDAR).

[10]  D. Jacobs Grouping for Recognition , 1989 .

[11]  Vincent Lepetit,et al.  Multimodal templates for real-time detection of texture-less objects in heavily cluttered scenes , 2011, 2011 International Conference on Computer Vision.

[12]  Josep Lladós,et al.  Symbol spotting in vectorized technical drawings through a lookup table of region strings , 2009, Pattern Analysis and Applications.

[13]  Lei Wang,et al.  Handling Significant Scale Difference for Object Retrieval in a Supermarket , 2009, 2009 Digital Image Computing: Techniques and Applications.

[14]  David W. Jacobs,et al.  Robust and Efficient Detection of Salient Convex Groups , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[16]  Thomas M. Breuel,et al.  On the Use of Geometric Matching for Both: Isolated Symbol Recognition and Symbol Spotting , 2011, GREC.

[17]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[18]  Michael Isard,et al.  Object retrieval with large vocabularies and fast spatial matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Frédéric Jurie,et al.  Creating efficient codebooks for visual recognition , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[20]  Tsuhan Chen,et al.  Image retrieval with geometry-preserving visual phrases , 2011, CVPR 2011.

[21]  Thomas M. Breuel,et al.  Statistical Grouping for Segmenting Symbols Parts from Line Drawings, with Application to Symbol Spotting , 2011, 2011 International Conference on Document Analysis and Recognition.

[22]  Marcal Rusi Relational Indexing of Vectorial Primitives for Symbol Spotting in Line-Drawing Images , 2009 .

[23]  Josep Lladós,et al.  Vectorial Signatures for Symbol Discrimination , 2003, GREC.

[24]  Salvatore Tabbone,et al.  A Symbol Spotting Approach Based on the Vector Model and a Visual Vocabulary , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[25]  Thomas M. Breuel,et al.  Building a Symbol Library from Technical Drawings by Identifying Repeating Patterns , 2011, GREC.

[26]  Andrew Zisserman,et al.  Video Google: Efficient Visual Search of Videos , 2006, Toward Category-Level Object Recognition.