Stamps Detection and Classification Using Simple Features Ensemble

The paper addresses a problem of detection and classification of rubber stamp instances in scanned documents. A variety of methods from the field of image processing, pattern recognition, and some heuristic are utilized. Presented method works on typical stamps of different colors and shapes. For color images, color space transformation is applied in order to find potential color stamps. Monochrome stamps are detected through shape specific algorithms. Following feature extraction stage, identified candidates are subjected to classification task using a set of shape descriptors. Selected elementary properties form an ensemble of features which is rotation, scale, and translation invariant; hence this approach is document size and orientation independent. We perform two-tier classification in order to discriminate between stamps and no-stamps and then classify stamps in terms of their shape. The experiments carried out on a considerable set of real documents gathered from the Internet showed high potential of the proposed method.

[1]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[2]  Mohan S. Kankanhalli,et al.  Shape Measures for Content Based Image Retrieval: A Comparison , 1997, Inf. Process. Manag..

[3]  Santiago T. Pérez,et al.  Image Processing for Pollen Classification , 2012 .

[4]  Ashok Kumar,et al.  Neural Networks for Fast Estimation of Social Network Centrality Measures , 2015 .

[5]  Umapada Pal,et al.  Seal Detection and Recognition: An Approach for Document Indexing , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[6]  Barbora Micenková,et al.  Stamp Detection in Color Document Images , 2011, 2011 International Conference on Document Analysis and Recognition.

[7]  Mohammad Khajehzadeh,et al.  Economic Design of Foundation Using Harmony Search Algorithm , 2011 .

[8]  Liang Cai,et al.  A Robust Registration and Detection Method for Color Seal Verification , 2005, ICIC.

[9]  Hossein Pourghassem,et al.  A Novel Logo Detection and Recognition Framework for Separated Part Logos in Document Images , 2011 .

[10]  Miroslaw Bober,et al.  MPEG-7 visual shape descriptors , 2001, IEEE Trans. Circuits Syst. Video Technol..

[11]  Dariusz Frejlichowski An Algorithm for the Automatic Analysis of Characters Located on Car License Plates , 2013, ICIAR.

[12]  Arno Formella,et al.  Pollen classification using brightness-based and shape-based descriptors , 2004, ICPR 2004.

[13]  Sandy Irani,et al.  LOGO DETECTION IN DOCUMENT IMAGES , 1997 .

[14]  Pawel Forczmanski,et al.  General Shape Analysis Applied to Stamps Retrieval from Scanned Documents , 2010, AIMSA.

[15]  Arno Formella,et al.  Pollen classification using brightness-based and shape-based descriptors , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[16]  Zygmunt Pizlo,et al.  Shape Perception in Human and Computer Vision: An Interdisciplinary Perspective , 2013 .

[17]  Pawel Forczmanski,et al.  Low-Level Image Features for Stamps Detection and Classification , 2013, CORES.

[18]  Andy C. Downton,et al.  Configurable Text Stamp Identification Tool with Application of Fuzzy Logic , 2004, Document Analysis Systems.

[19]  Josef Kittler,et al.  A survey of the hough transform , 1988, Comput. Vis. Graph. Image Process..

[20]  Sethu Vijayakumar,et al.  2D Shape Classification and Retrieval , 2005, IJCAI.

[21]  Richard O. Duda,et al.  Use of the Hough transformation to detect lines and curves in pictures , 1972, CACM.

[22]  Xing Xie,et al.  Spatial pyramid mining for logo detection in natural scenes , 2008, 2008 IEEE International Conference on Multimedia and Expo.

[23]  Shamik Sural,et al.  Colored Rubber Stamp Removal from Document Images , 2013, PReMI.

[24]  Guojun Lu,et al.  Review of shape representation and description techniques , 2004, Pattern Recognit..

[25]  F. Albregtsen Statistical Texture Measures Computed from Gray Level Coocurrence Matrices , 2008 .

[26]  Paul L. Rosin Measuring shape: ellipticity, rectangularity, and triangularity , 2003, Machine Vision and Applications.

[27]  Tuan D. Pham Unconstrained logo detection in document images , 2003, Pattern Recognit..

[28]  Dariusz Frejlichowski An Experimental Comparison of Seven Shape Descriptors in the General Shape Analysis Problem , 2010, ICIAR.

[29]  J. IIVARINENHelsinki Efficiency of Simple Shape Descriptors , 1997 .

[30]  Pawel Forczmanski,et al.  Efficient Stamps Classification by Means of Point Distance Histogram and Discrete Cosine Transform , 2011, Computer Recognition Systems 4.

[31]  Pawel Forczmanski,et al.  Robust Stamps Detection and Classification by Means of General Shape Analysis , 2010, ICCVG.

[32]  David S. Doermann,et al.  Logo Retrieval in Document Images , 2012, 2012 10th IAPR International Workshop on Document Analysis Systems.

[33]  Umapada Pal,et al.  Document seal detection using GHT and character proximity graphs , 2011, Pattern Recognit..

[34]  David S. Doermann,et al.  A robust stamp detection framework on degraded documents , 2006, Electronic Imaging.

[35]  Robert M. Haralick,et al.  Textural Features for Image Classification , 1973, IEEE Trans. Syst. Man Cybern..

[36]  Kanghun Jeong,et al.  Object Detection Using FAST Corner Detector Based on Smartphone Platforms , 2011, 2011 First ACIS/JNU International Conference on Computers, Networks, Systems and Industrial Engineering.

[37]  David Doermann,et al.  Automatic Document Logo Detection , 2007 .