Logo and seal based administrative document image retrieval: A survey

Abstract With the advance of technology, business offices and organizations together with their clients create a massive amount of administrative documents every day. Administrative documents commonly contain some salient entities such as logos, stamps or seals as the means of their authentication and proprietorship. These salient entities provide quite discriminative information, which can effectively be used for different tasks of document image retrieval, classification and recognition in document-based applications. Thus, proper detection/recognition of these entities in document images increases the performance of such applications in terms of document retrieval, classification, and recognition. To present the state-of-the-art research on the retrieval of administrative document images, this paper deals with a survey of administrative document image retrieval in relation to seals and logos. All the available datasets, feature extraction and classification techniques for logo and seal detection/recognition are discussed systematically. The shortcomings of the present technologies on logo and seal based document processing are also highlighted. Avenues of the future works are further given for the benefit of readers. To the best of authors’ knowledge, there is no survey on administrative document image retrieval and hence the authors hope that this work will be helpful to the researchers of the document analysis community.

[1]  Liang Cai,et al.  A Robust Registration and Detection Method for Color Seal Verification , 2005, ICIC.

[2]  Francesca Cesarini,et al.  Automatic document classification and indexing in high-volume applications , 2001, International Journal on Document Analysis and Recognition.

[3]  Takahiko Horiuchi,et al.  Automatic seal identification using fluency function approximation and relaxation matching method , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[4]  Yung-Sheng Chen Automatic identification for a Chinese seal image , 1996, Pattern Recognit..

[5]  David S. Doermann,et al.  Automatic Document Logo Detection , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[6]  Tuan D. Pham Unconstrained logo detection in document images , 2003, Pattern Recognit..

[7]  Z. Ahmed,et al.  Logos extraction on picture documents using shape and color density , 2008, 2008 IEEE International Symposium on Industrial Electronics.

[8]  Jingying Chen,et al.  Noisy logo recognition using line segment Hausdorff distance , 2003, Pattern Recognit..

[9]  Katsuhiko Ueda,et al.  Automatic seal imprint verification system for bank check processing , 2005, Third International Conference on Information Technology and Applications (ICITA'05).

[10]  Zen Chen,et al.  Robust Logo Recognition for Mobile Phone Applications , 2011, J. Inf. Sci. Eng..

[11]  Katsuhiko Ueda,et al.  Automatic verification system for seal imprints on Japanese bankchecks , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[12]  Alireza Alaei,et al.  A Complete Logo Detection/Recognition System for Document Images , 2014, 2014 11th IAPR International Workshop on Document Analysis Systems.

[13]  Youbin Chen,et al.  Logo Detection in Document Images Based on Boundary Extension of Feature Rectangles , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[14]  Hongbin Zha,et al.  Automatic seal image retrieval method by using shape features of Chinese characters , 2007, 2007 IEEE International Conference on Systems, Man and Cybernetics.

[15]  A. G. Ramakrishnan,et al.  Automatic Seal Information Reader , 2007, 2007 International Conference on Computing: Theory and Applications (ICCTA'07).

[16]  Muriel Visani,et al.  Improving Logo Spotting and Matching for Document Categorization by a Post-Filter Based on Homography , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[17]  Giovanni Soda,et al.  Logo Recognition by Recursive Neural Networks , 1997, GREC.

[18]  Hanan Samet,et al.  Integration of local and global shape analysis for logo classification , 2001, Pattern Recognit. Lett..

[19]  Muriel Visani,et al.  Document Retrieval Based on Logo Spotting Using Key-Point Matching , 2014, 2014 22nd International Conference on Pattern Recognition.

[20]  Yung-Sheng Chen Registration of Seal Images Using Contour Analysis , 2003, SCIA.

[21]  D. Androutsos,et al.  Logo classification using Haar wavelet co-occurrence histograms , 2008, 2008 Canadian Conference on Electrical and Computer Engineering.

[22]  Ernest Valveny,et al.  A polar-based logo representation based on topological and colour features , 2010, DAS '10.

[23]  Olivier Buisson,et al.  Logo retrieval with a contrario visual query expansion , 2009, ACM Multimedia.

[24]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.

[25]  Shlomo Argamon,et al.  Complex document information processing: prototype, test collection, and evaluation , 2006, Electronic Imaging.

[26]  Umapada Pal,et al.  Document seal detection using GHT and character proximity graphs , 2011, Pattern Recognit..

[27]  Takenobu Matsuura,et al.  Seal imprint verification with rotation invariance , 2004, The 2004 IEEE Asia-Pacific Conference on Circuits and Systems, 2004. Proceedings..

[28]  David S. Doermann,et al.  Logo Matching for Document Image Retrieval , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[29]  Vinod Chandran,et al.  Recognition of Logo Images using Invariants Defined from Higher-order Spectra , 2002 .

[30]  Paweł Forczmański,et al.  Stamps Detection and Classification Using Simple Features Ensemble , 2015 .

[31]  Josep Lladós,et al.  Logo Spotting by a Bag-of-words Approach for Document Categorization , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[32]  Erdal Panayirci,et al.  Feature extraction in shape recognition using segmentation of the boundary curve , 1997, Pattern Recognit. Lett..

[33]  R. Manmatha,et al.  Multi-modal Retrieval of Trademark Images Using Global Similarity TITLE2: , 1999 .

[34]  Olivier Buisson,et al.  Scalable mining of small visual objects , 2012, ACM Multimedia.

[35]  Marçal Rusiñol Sanabra,et al.  Geometric and Structural-based Symbol Spotting. Application to Focused Retrieval in Graphic Document Collections , 2009 .

[36]  Yi-Wu J. Chiang,et al.  SEAL IDENTIFICATION USING THE DELAUNAY TESSELLATION , 1998 .

[37]  Josep Lladós,et al.  Efficient logo retrieval through hashing shape context descriptors , 2010, DAS '10.

[38]  Qian Zhang,et al.  An automatic seal imprint verification approach , 1995, Pattern Recognit..

[39]  Bart Lamiroy,et al.  Pattern Recognition Methods for Querying and Browsing Technical Documentation , 2008, CIARP.

[40]  Claudio Gutierrez,et al.  Survey of graph database models , 2008, CSUR.

[41]  Peng-Yeng Yin,et al.  Content-based retrieval from trademark databases , 2002, Pattern Recognit. Lett..

[42]  David S. Doermann,et al.  A robust stamp detection framework on degraded documents , 2006, Electronic Imaging.

[43]  Ehud Rivlin,et al.  Logo recognition using geometric invariants , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[44]  Rainer Lienhart,et al.  Scalable logo recognition in real-world images , 2011, ICMR.

[45]  Ernest Valveny,et al.  A Review of Shape Descriptors for Document Analysis , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[46]  Elisa H. Barney Smith,et al.  Template generation from postmarks using cascaded unsupervised learning , 2015, HIP@ICDAR.

[47]  Bart Lamiroy,et al.  Graphics recognition - from re-engineering to retrieval , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[48]  Kam-Tong Sam,et al.  Vehicle Logo Recognition Using Modest AdaBoost and Radial Tchebichef Moments , .

[49]  Guojun Lu,et al.  Review of shape representation and description techniques , 2004, Pattern Recognit..

[50]  David S. Doermann,et al.  Logo Retrieval in Document Images , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[51]  Chang-Tsun Li,et al.  Trademark image retrieval using synthetic features for describing global shape and interior structure , 2009, Pattern Recognit..

[52]  Bidyut Baran Chaudhuri,et al.  A system for Indian postal automation , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[53]  Feihu Qi,et al.  A learning-based logo recognition algorithm using SIFT and efficient correspondence matching , 2008, 2008 International Conference on Information and Automation.

[54]  Keqiu Li,et al.  An effective solution for trademark image retrieval by combining shape description and feature matching , 2010, Pattern Recognit..

[55]  Zhe Li,et al.  Fast Logo Detection and Recognition in Document Images , 2010, 2010 20th International Conference on Pattern Recognition.

[56]  Giovanni Soda,et al.  Edge-backpropagation for noisy logo recognition , 2003, Pattern Recognit..

[57]  Josep Lladós,et al.  Classification of Administrative Document Images by Logo Identification , 2011, GREC.

[58]  Yannis Avrithis,et al.  Scalable triangulation-based logo recognition , 2011, ICMR.

[59]  T. Matsuura,et al.  Rotation invariant seal imprint verification method , 2002, 9th International Conference on Electronics, Circuits and Systems.

[60]  Whoi-Yul Kim,et al.  Content-based trademark retrieval system using a visually salient feature , 1998, Image Vis. Comput..

[61]  Sandy Irani,et al.  LOGO DETECTION IN DOCUMENT IMAGES , 1997 .

[62]  Ehud Rivlin,et al.  Applying algebraic and differential invariants for logo recognition , 1996, Machine Vision and Applications.

[63]  Hongye Wang Document Logo Detection and Recognition Using Bayesian Model , 2010, 2010 20th International Conference on Pattern Recognition.

[64]  Takahiko Horiuchi,et al.  Automatic seal verification by evaluating positive cost , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[65]  Wen Gao,et al.  A system for automatic Chinese seal imprint verification , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[66]  Mathieu Delalandre,et al.  A Contour-Based Method for Logo Detection , 2011, 2011 International Conference on Document Analysis and Recognition.

[67]  Alireza Alaei,et al.  Logo Detection Using Painting Based Representation and Probability Features , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[68]  Paul A. Viola,et al.  Boosting Image Retrieval , 2004, International Journal of Computer Vision.

[69]  Hanan Samet,et al.  Content-based image retrieval using Fourier descriptors on a logo database , 2002, Object recognition supported by user interaction for service robots.

[70]  José María González-Linares,et al.  A TV-logo classification and learning system , 2008, 2008 15th IEEE International Conference on Image Processing.

[71]  Aureli Soria-Frisch,et al.  The fuzzy integral for color seal segmentation on document images , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[72]  Jin Hyung Kim,et al.  Attributed stroke graph matching for seal imprint verification , 1989, Pattern Recognit. Lett..

[73]  Hossein Pourghassem,et al.  A Novel Logo Detection and Recognition Framework for Separated Part Logos in Document Images , 2011 .

[74]  Jitendra Malik,et al.  Shape matching and object recognition using shape contexts , 2010, 2010 3rd International Conference on Computer Science and Information Technology.

[75]  Takahiko Horiuchi,et al.  Automatic seal verification using three-dimensional reference seals , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[76]  Anastasios L. Kesidis,et al.  Logo and Trademark Recognition , 2014, Handbook of Document Image Processing and Recognition.

[77]  Hanan Samet,et al.  Using negative shape features for logo similarity matching , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[78]  Francesca Cesarini,et al.  A neural-based architecture for spot-noisy logo recognition , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[79]  Paul A. Viola,et al.  Rapid object detection using a boosted cascade of simple features , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.