Document Retrieval Based on Logo Spotting Using Key-Point Matching

In this paper, we present an approach to retrieve documents based on logo spotting and recognition. A document retrieval system is proposed inspired from our previous method for logo spotting and recognition. First, the key-points from both the query logo images and a given set of document images are extracted and described by SIFT descriptor, and are matched in the SIFT feature space. They are filtered by the nearest neighbor matching rule based on the two nearest neighbors and are then post-filtered with BRIEF descriptor. Secondly, logo segmentation is performed using spatial density-based clustering, and homography is used to filter the matched key-points as a post processing. Finally, for ranking, we use two measures which are calculated based on the number of matched key-points. Tested on a well-known benchmark database of real world documents containing logos Tobacco-800, our approach achieves better performance than the state-of-the-art methods.

[1]  David S. Doermann,et al.  Logo Retrieval in Document Images , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[2]  Francesca Cesarini,et al.  A neural-based architecture for spot-noisy logo recognition , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[3]  Muriel Visani,et al.  Logo spotting for document categorization , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[4]  Mathieu Delalandre,et al.  A Contour-Based Method for Logo Detection , 2011, 2011 International Conference on Document Analysis and Recognition.

[5]  Josep Lladós,et al.  Efficient logo retrieval through hashing shape context descriptors , 2010, DAS '10.

[6]  David G. Lowe,et al.  Object recognition from local scale-invariant features , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[7]  Shlomo Argamon,et al.  Building a test collection for complex document information processing , 2006, SIGIR.

[8]  Alberto Del Bimbo,et al.  Context-Dependent Logo Matching and Recognition , 2013, IEEE Transactions on Image Processing.

[9]  Tuan D. Pham Unconstrained logo detection in document images , 2003, Pattern Recognit..

[10]  David Doermann,et al.  Automatic Document Logo Detection , 2007 .

[11]  Josep Lladós,et al.  Logo Spotting by a Bag-of-words Approach for Document Categorization , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[12]  Youbin Chen,et al.  Logo Detection in Document Images Based on Boundary Extension of Feature Rectangles , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[13]  David S. Doermann,et al.  Logo Matching for Document Image Retrieval , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[14]  David S. Doermann,et al.  Multi-scale Structural Saliency for Signature Detection , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Sandy Irani,et al.  LOGO DETECTION IN DOCUMENT IMAGES , 1997 .

[16]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[17]  Vincent Lepetit,et al.  BRIEF: Binary Robust Independent Elementary Features , 2010, ECCV.

[18]  Hans-Peter Kriegel,et al.  A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise , 1996, KDD.

[19]  Ehud Rivlin,et al.  Logo recognition using geometric invariants , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[20]  Muriel Visani,et al.  Improving Logo Spotting and Matching for Document Categorization by a Post-Filter Based on Homography , 2013, 2013 12th International Conference on Document Analysis and Recognition.