Locating characters in scene images using frequency features

This paper presents a (language-independent) method of locating rectangular text regions in natural scene images. The method consists of two steps that can be applied in succession or independently: the frequency of edge pixels across vertical and horizontal scan lines, and the fundamental frequency in the Fourier domain. The frequency feature of text images is highly intuitive, and this is the focus of the research. The detection of rectangles using a Hough transform is also addressed. Texts that are meaningful to many viewers usually appear in rectangles of colours of high contrast to the background. Hence it is natural to assume that the detection of rectangles may be helpful for locating desired texts correctly in natural outdoor scene images.

[1]  한준희 Tetragon detection using Hough Transform , 2002 .

[2]  Anil K. Jain,et al.  Locating text in complex color images , 1995, Pattern Recognit..

[3]  Edward M. Riseman,et al.  TextFinder: An Automatic System to Detect and Recognize Text In Images , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Anil K. Jain,et al.  Automatic Caption Localization in Compressed Video , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Anil K. Jain,et al.  Automatic text location in images and video frames , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[6]  Anil K. Jain,et al.  Automatic caption localization in compressed video , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[7]  Hansung Lee,et al.  Tetragon Detection Using the Hough Transform , 2002 .

[8]  Daniel P. Lopresti,et al.  Extracting text from WWW images , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[9]  Changsong Liu,et al.  Character extraction and recognition in natural scene images , 2001, Proceedings of Sixth International Conference on Document Analysis and Recognition.