Locating text in complex color images

There is a substantial interest in retrieving images from a large database using the textual information contained in the images. An algorithm which will automatically locate the textual regions in the input image will facilitate this task; the optical character recognizer can then be applied to only those regions of the image which contain text. We present a method for automatically locating text in complex color images. The algorithm first finds the approximate locations of text lines using horizontal spatial variance, and then extracts text components in these boxes using color segmentation. The proposed method has been used to locate text in compact disc (CD) and book cover images, as well as in the images of traffic scenes captured by a video camera. Initial results are encouraging and suggest that these algorithms can be used in image retrieval applications.

[1]  Gian Antonio Mian,et al.  Trademark shapes description by string-matching techniques , 1994, Pattern Recognit..

[2]  Shigeru Akamatsu,et al.  Recognizing Characters in Scene Images , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  K. Wakimoto,et al.  Efficient and Effective Querying by Image Content , 1994 .

[4]  Mohan S. Kankanhalli,et al.  Color matching for image retrieval , 1995, Pattern Recognit. Lett..

[5]  C. H. Chen,et al.  Handbook of Pattern Recognition and Computer Vision , 1993 .

[6]  Jiangying Zhou,et al.  Page segmentation and classification , 1992, CVGIP Graph. Model. Image Process..