Towards Searchable Line Drawings, a Content-Based Symbol Retrieval Approach with Variable Query Complexity

Current symbol spotting and retrieval methods are not yet able to achieve the goal of both high accuracy and efficiency on large databases of line drawings. This paper presents an approach for focused symbol retrieval as step towards achieving such a goal by using concepts from image retrieval. During the off-line learning phase of the proposed approach, regions of interest are extracted from the drawings based on feature grouping. The regions are then described using an off-the-shelf descriptor. The similar descriptors are clustered, and finally a visual symbol vocabulary is learned by an SVM classifier. The vocabulary is constructed assuming no knowledge of the contents of the drawings. During on-line retrieval, the classifier recognizes the descriptors of query regions. A query can be a partial or a complete symbol, can contain contextual noise around a symbol or more than one symbol. Experimental results are presented for a database of architectural floor plans.

[1]  Ernest Valveny,et al.  Report on the Symbol Recognition and Spotting Contest , 2011, GREC.

[2]  O. J. Vrieze,et al.  Kohonen Network , 1995, Artificial Neural Networks.

[3]  Jean-Yves Ramel,et al.  A Content Spotting System for Line Drawing Graphic Document Images , 2010, 2010 20th International Conference on Pattern Recognition.

[4]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[5]  Frédéric Jurie,et al.  Creating efficient codebooks for visual recognition , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[6]  R. Fedorak,et al.  Trends and challenges , 1996, Proceedings of Nonvolatile Memory Technology Conference.

[7]  Allen Gersho,et al.  Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.

[8]  Su Yang Symbol Recognition via Statistical Integration of Pixel-Level Constraint Histograms: A New Descriptor , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Thomas M. Breuel,et al.  Statistical Grouping for Segmenting Symbols Parts from Line Drawings, with Application to Symbol Spotting , 2011, 2011 International Conference on Document Analysis and Recognition.

[10]  Tony P. Pridmore,et al.  Generation of synthetic documents for performance evaluation of symbol recognition & spotting systems , 2010, International Journal on Document Analysis and Recognition (IJDAR).

[11]  Salvatore Tabbone,et al.  A Symbol Spotting Approach Based on the Vector Model and a Visual Vocabulary , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[12]  Thomas M. Breuel,et al.  Building a Symbol Library from Technical Drawings by Identifying Repeating Patterns , 2011, GREC.

[13]  Thomas M. Breuel,et al.  Efficient symbol retrieval by building a symbol index from a collection of line drawings , 2013, Electronic Imaging.

[14]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[15]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[16]  Tony P. Pridmore,et al.  Building Synthetic Graphical Documents for Performance Evaluation , 2007, GREC.

[17]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[18]  Andrew Zisserman,et al.  Video Google: Efficient Visual Search of Videos , 2006, Toward Category-Level Object Recognition.

[19]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[20]  Cordelia Schmid,et al.  Toward Category-Level Object Recognition , 2006, Toward Category-Level Object Recognition.

[21]  David W. Jacobs,et al.  Robust and Efficient Detection of Salient Convex Groups , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  D. Jacobs Grouping for Recognition , 1989 .