Locating text in images using matched wavelets

In this paper we have proposed a novel scheme for locating text regions in an image. The method is based on multiresolution wavelet analysis. We used matched wavelets to capture textural characteristics of image regions. A clustering based approach has been proposed for estimating globally matched wavelets (GMWs) for a given collection of images. Using these GMWs, we generate feature vectors for segmentation and identification of text regions in an image. Our method, unlike most of the other methods, does not require any a priori information about the font, font size, scripts, geometric transformation, distortion or background texture. We have tested our method on various categories of images like license plates, posters, hand written documents and document images etc. The results show proposed method to be a robust, versatile and effective tool for text extraction from images.

[1]  Po-Yueh Chen,et al.  DWT Based Text Localization , 2004 .

[2]  Alexander J. Smola,et al.  Learning with Kernels: support vector machines, regularization, optimization, and beyond , 2001, Adaptive computation and machine learning series.

[3]  Bernhard Schölkopf,et al.  Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond , 2005, IEEE Transactions on Neural Networks.

[4]  Edward M. Riseman,et al.  TextFinder: An Automatic System to Detect and Recognize Text In Images , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Anil K. Jain,et al.  Text segmentation using gabor filters for automatic document processing , 1992, Machine Vision and Applications.

[6]  Anil K. Jain,et al.  A multi-channel filtering approach to texture segmentation , 1991, Proceedings. 1991 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[7]  A. Gupta,et al.  A new approach for estimation of wavelets with non-separable kernel from a given image , 2004, 2004 International Conference on Signal Processing and Communications, 2004. SPCOM '04..

[8]  Jean-Philippe Thiran,et al.  Text identification in complex background using SVM , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[9]  Rama Chellappa,et al.  Separability based tree structured local basis selection for texture classification , 1994, Proceedings of 1st International Conference on Image Processing.

[10]  Kyung-Ae Moon,et al.  An efficient extraction of character string positions using morphological operator , 2000, Smc 2000 conference proceedings. 2000 ieee international conference on systems, man and cybernetics. 'cybernetics evolving to systems, humans, organizations, and their complex interactions' (cat. no.0.

[11]  Anil K. Jain,et al.  Locating text in complex color images , 1995, Pattern Recognit..

[12]  Mausumi Acharyya,et al.  Multiscale Segmentation of Document Images Using M -Band Wavelets , 2001, CAIP.