Finding Logo and Seal in Historical Document Images - An Object Detection Based Approach

Logo and Seal serves the purpose of authenticating and referring to the source of a document. This strategy was also prevalent in the medieval period. Different algorithm exists for detection of logo and seal in document images. A close look into the present state-of-the-art methods reveals that those methods were focused toward detection of logo and seal in contemporary document images. However, such methods are likely to underperform while dealing with historical documents. This is due to the fact that historical documents are attributed with additional challenges like extra noise, bleed-through effect, blurred foreground elements and low contrast. The proposed method frames the problem of the logo and seals detection in an object detection framework. Using a deep-learning technique it counters earlier mentioned problems and evades the need for any pre-processing stage like layout analysis and/or binarization in the system pipeline. The experiments were conducted on historical images from 12th to the 16th century and the results obtained were very encouraging for detecting logo in historical document images. To the best of our knowledge, this is the first attempt on logo detection in historical document images using an object-detection based approach.

[1]  Frédéric Jurie,et al.  New public dataset for spotting patterns in medieval document images , 2016, J. Electronic Imaging.

[2]  Umapada Pal,et al.  Signature and Logo Detection using Deep CNN for Document Image Retrieval , 2018, 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR).

[3]  David S. Doermann,et al.  Logo Matching for Document Image Retrieval , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[4]  Ehud Rivlin,et al.  Applying algebraic and differential invariants for logo recognition , 1996 .

[5]  Hanan Samet,et al.  Integration of local and global shape analysis for logo classification , 2002, Pattern Recognit. Lett..

[6]  Eamonn J. Keogh,et al.  Mother Fugger: Mining Historical Manuscripts with Local Color Patches , 2010, 2010 IEEE International Conference on Data Mining.

[7]  Muriel Visani,et al.  Improving Logo Spotting and Matching for Document Categorization by a Post-Filter Based on Homography , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[8]  Alireza Alaei,et al.  A Complete Logo Detection/Recognition System for Document Images , 2014, 2014 11th IAPR International Workshop on Document Analysis Systems.

[9]  Jingying Chen,et al.  Noisy logo recognition using line segment Hausdorff distance , 2003, Pattern Recognit..

[10]  Ross B. Girshick,et al.  Fast R-CNN , 2015, 1504.08083.

[11]  Josep Lladós,et al.  Classification of Administrative Document Images by Logo Identification , 2011, GREC.

[12]  Caroline Petitjean,et al.  A scalable pattern spotting system for historical documents , 2016, Pattern Recognit..

[13]  Ali Farhadi,et al.  You Only Look Once: Unified, Real-Time Object Detection , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  David S. Doermann,et al.  Logo Retrieval in Document Images , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[15]  Mathieu Delalandre,et al.  A Contour-Based Method for Logo Detection , 2011, 2011 International Conference on Document Analysis and Recognition.

[16]  Ali Farhadi,et al.  YOLOv3: An Incremental Improvement , 2018, ArXiv.

[17]  Francesca Cesarini,et al.  A neural-based architecture for spot-noisy logo recognition , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[18]  Hongye Wang Document Logo Detection and Recognition Using Bayesian Model , 2010, 2010 20th International Conference on Pattern Recognition.

[19]  Alireza Alaei,et al.  Logo and seal based administrative document image retrieval: A survey , 2016, Comput. Sci. Rev..

[20]  Muriel Visani,et al.  Document Retrieval Based on Logo Spotting Using Key-Point Matching , 2014, 2014 22nd International Conference on Pattern Recognition.

[21]  Umapada Pal,et al.  Document seal detection using GHT and character proximity graphs , 2011, Pattern Recognit..

[22]  Kaiming He,et al.  Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks , 2015, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Josep Lladós,et al.  Efficient logo retrieval through hashing shape context descriptors , 2010, DAS '10.

[24]  David Doermann,et al.  Automatic Document Logo Detection , 2007 .

[25]  Trevor Darrell,et al.  Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation , 2013, 2014 IEEE Conference on Computer Vision and Pattern Recognition.