Holistic Comparison of Text Images for Content-Based Retrieval

The accurate recognition of text that appears in images/videos using analytical character recognition methods is often very difficult, despite the fact that the text might be correctly localized, segmented and binarized. This is mainly due to changing features of the text such as various fonts, or noise factors embedded in the image which are inherited from the complex background. In this paper, we treat the problem of comparing text images for content-based retrieval purposes, by presenting a holistic approach to this issue. First, the shape of text is represented by estimating the salient points in the text image. Then, alignment shape methods are used to establish the correspondence of the salient points. Finally, a measure is suggested to compute the dissimilarity between two text images based on the generated correspondence. Empirical evaluation of the proposed holistic comparison method has demonstrated its very good performance

[1]  Jitendra Malik,et al.  Recognizing objects in adversarial clutter: breaking a visual CAPTCHA , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[2]  R. Manmatha,et al.  Word image matching using dynamic time warping , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[3]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[4]  Graham Rawlinson,et al.  The Significance of Letter Position in Word Recognition , 2007, IEEE Aerospace and Electronic Systems Magazine.

[5]  Allen R. Hanson,et al.  Automatic Sign Detection and Recognition in Natural Scenes , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Workshops.

[6]  H. C. Longuet-Higgins,et al.  An algorithm for associating the features of two images , 1991, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[7]  Allen R. Hanson,et al.  A Hierarchical Approach to Sign Recognition , 2005, 2005 Seventh IEEE Workshops on Applications of Computer Vision (WACV/MOTION'05) - Volume 1.

[8]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[9]  Giovanni Soda,et al.  Indexing and retrieval of words in old documents , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[10]  Edward M. Riseman,et al.  Word spotting: a new approach to indexing handwriting , 1996, Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[11]  Shaolei Feng,et al.  Using Corner Feature Correspondences to Rank Word Images by Similarity , 2003, 2003 Conference on Computer Vision and Pattern Recognition Workshop.

[12]  Jitendra Malik,et al.  Efficient shape matching using shape contexts , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  G. Moy,et al.  Distortion estimation techniques in solving visual CAPTCHAs , 2004, CVPR 2004.

[14]  Seiichi Uchida,et al.  A Survey of Elastic Matching Techniques for Handwritten Character Recognition , 2005, IEICE Trans. Inf. Syst..

[15]  Rangachar Kasturi,et al.  Video object detection and matching , 2003 .

[16]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[17]  Akira Nakamura,et al.  Caption text recognition in video frames by MAP matching , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[18]  R. Manmatha,et al.  Holistic word recognition for handwritten historical documents , 2004, First International Workshop on Document Image Analysis for Libraries, 2004. Proceedings..

[19]  Bernd Freisleben,et al.  Adaptive Fuzzy Text Segmentation in Images with Complex Backgrounds Using Color and Texture , 2005, CAIP.

[20]  Xian-Sheng Hua,et al.  Automatic performance evaluation for video text detection , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.

[21]  Bernd Freisleben,et al.  Unsupervised Text Segmentation Using Color and Wavelet Features , 2004, CIVR.

[22]  Nicu Sebe,et al.  Evaluation of Salient Point Techniques , 2002, CIVR.

[23]  Stan Sclaroff,et al.  Online and offline character recognition using alignment to prototypes , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).