An Adaptive Zoning Technique for Word Spotting Using Dynamic Time Warping

Zoning features have been proved one of the most efficient statistical features which provide high speed and low complexity word matching. They are calculated by the density of pixels or pattern characteristics in several zones that the pattern frame is divided. In this paper, an adaptive zoning technique for efficient word spotting is introduced. The main idea is that the zoning features are extracted after cutting the query word in vertical zones, according to its length and pixel distribution along the horizontal axis, and adjusting these boundaries optimally with the corresponding zones in the candidate match-word using Dynamic Time Warping (DTW). This adjustment is performed by coupling every zone of the query word to the corresponding zone of each candidate match-word with the use of the corresponding warping matrix. This process absorbs the ambiguities between the query and the candidate match words and due to this fact it can be applied to both machine-printed and handwritten document images. The proposed word spotting technique is tested using the pixel density as a characteristic feature in every zone and an improvement is recorded compared to other state-of-the-art methods.

[1]  F. Perronnin,et al.  Local gradient histogram features for word spotting in unconstrained handwritten documents , 2008 .

[2]  Basilios Gatos,et al.  Efficient Word Recognition Using a Pixel-Based Dissimilarity Measure , 2011, 2011 International Conference on Document Analysis and Recognition.

[3]  R. Manmatha,et al.  Word image matching using dynamic time warping , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[4]  Sebastiano Impedovo,et al.  Optimal zoning design by genetic algorithms , 2006, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[5]  A. Papandreou,et al.  Adaptive Zoning Features for Character and Word Recognition , 2011, 2011 International Conference on Document Analysis and Recognition.

[6]  Sebastiano Impedovo,et al.  Feature Membership Functions in Voronoi-Based Zoning , 2009, AI*IA.

[7]  Jianchang Mao,et al.  A comparative study of different classifiers for handprinted character recognition , 1994 .

[8]  Ioannis Pratikakis,et al.  Adaptive degraded document image binarization , 2006, Pattern Recognit..

[9]  V.L. Lajish,et al.  Handwritten Character Recognition using Perceptual Fuzzy-Zoning and Class Modular Neural Networks , 2007, 2007 Innovations in Information Technologies (IIT).

[10]  Jeremy Bentham,et al.  The manuscripts of Jeremy Bentham : a chronological index to the collection in the Library of University College London , 1981 .

[11]  Sergios Theodoridis,et al.  Keyword-guided word spotting in historical printed documents using synthetic data and user feedback , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[12]  Edward M. Riseman,et al.  Word spotting: a new approach to indexing handwriting , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[13]  Nikos Papamarkos,et al.  Image retrieval systems based on compact shape descriptor and relevance feedback information , 2011, J. Vis. Commun. Image Represent..

[14]  Madasu Hanmandlu,et al.  Unconstrained handwritten character recognition based on fuzzy logic , 2003, Pattern Recognit..

[15]  Alicia Fornés,et al.  On the Influence of Word Representations for Handwritten Word Spotting in Historical Documents , 2012, Int. J. Pattern Recognit. Artif. Intell..

[16]  A. Papandreou,et al.  Slant estimation and core-region detection for handwritten Latin words , 2014, Pattern Recognit. Lett..

[17]  R. Manmatha,et al.  Holistic word recognition for handwritten historical documents , 2004, First International Workshop on Document Image Analysis for Libraries, 2004. Proceedings..

[18]  Ioannis Pratikakis,et al.  Hybrid Off-Line Cursive Handwriting Word Recognition , 2006, 18th International Conference on Pattern Recognition (ICPR'06).