Building a Symbol Library from Technical Drawings by Identifying Repeating Patterns

This paper describes a novel approach for extracting a library of symbols from a large collection of line drawings. This symbol library is a compact and indexable representation of the line drawings. Such a representation is important for efficient symbol retrieval. The proposed approach first identifies the candidate patterns in all images, and then it clusters the similar ones together to create a set of clusters. A representative pattern is chosen from each cluster, and these representative patterns form a library of symbols. We have tested our approach on a database of line drawings, and it achieved high accuracy in capturing and representing the contents of the line drawings.

[1]  Salvatore Tabbone,et al.  A Symbol Spotting Approach Based on the Vector Model and a Visual Vocabulary , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[2]  Tony P. Pridmore,et al.  Generation of synthetic documents for performance evaluation of symbol recognition & spotting systems , 2010, International Journal on Document Analysis and Recognition (IJDAR).

[3]  Jiri Matas,et al.  Image Matching and Retrieval by Repetitive Patterns , 2010, 2010 20th International Conference on Pattern Recognition.

[4]  Jitendra Malik,et al.  Detecting, localizing and grouping repeated scene elements from an image , 1996, ECCV.

[5]  Roberto Cipolla,et al.  Computer Vision — ECCV '96 , 1996, Lecture Notes in Computer Science.

[6]  Thomas M. Breuel,et al.  Statistical Grouping for Segmenting Symbols Parts from Line Drawings, with Application to Symbol Spotting , 2011, 2011 International Conference on Document Analysis and Recognition.

[7]  Jean-Yves Ramel,et al.  A Content Spotting System for Line Drawing Graphic Document Images , 2010, 2010 20th International Conference on Pattern Recognition.

[8]  Thomas M. Breuel,et al.  Implementation techniques for geometric branch-and-bound matching methods , 2003, Comput. Vis. Image Underst..

[9]  Tony P. Pridmore,et al.  Building Synthetic Graphical Documents for Performance Evaluation , 2007, GREC.

[10]  David A. Forsyth,et al.  Shape, Contour and Grouping in Computer Vision , 1999, Lecture Notes in Computer Science.

[11]  Josep Lladós,et al.  A graph grammar to recognize textured symbols , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[12]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[13]  Luc Van Gool,et al.  Noncombinatorial Detection of Regular Repetitions under Perspective Skew , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Yanxi Liu,et al.  A computational model for repeated pattern perception using frieze and wallpaper groups , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[15]  Andrew Zisserman,et al.  Geometric Grouping of Repeated Elements within Images , 1999, Shape, Contour and Grouping in Computer Vision.