Detection and recognition of text superimposed in images base on layered method

Detection and recognition of text superimposed in complex background has been considered as a challenging problem. Most of the existing methods first locate the text regions and then feed them into OCR package for recognition. However, these methods cannot achieve good recognition performance due to the complex background. For this purpose, this paper proposes a novel text detection and recognition method by using color clustering to divide images into multiple layers according to main color class. In the proposed method, we exploited a connected component analysis to obtain the candidate text regions from each color layer, and then a cascade Adaboost classifier is adopted to determine whether the candidate text regions is real text regions in the corresponding image layer. Because the monochrome color exists in each layer, the interference of the background can be effectively reduced, which can significantly improve the accuracy of text regions localization. Afterwards, an OCR package is used to recognize the text regions which have been located by the cascade Adaboost classifier. Since the text region has a monochrome color, it helps to greatly improve the recognition rate. Finally, the relationship between different layers is used to verify the recognition results by the text location. The experimental results show that the proposed approach significantly outperforms the existing methods.

[1]  Jiri Matas,et al.  On Combining Classifiers , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  Miin-Shen Yang A survey of fuzzy clustering , 1993 .

[3]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[4]  J. C. Dunn,et al.  A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters , 1973 .

[5]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[6]  Ralph Ewerth,et al.  A robust algorithm for text detection in images , 2003, 3rd International Symposium on Image and Signal Processing and Analysis, 2003. ISPA 2003. Proceedings of the.

[7]  Edward M. Riseman,et al.  TextFinder: An Automatic System to Detect and Recognize Text In Images , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Xinbo Gao,et al.  Chinese text location under complex background using Gabor filter and SVM , 2011, Neurocomputing.

[9]  Cheng-Lin Liu,et al.  Text Localization in Natural Scene Images Based on Conditional Random Field , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[10]  Huitao Luo,et al.  Optimization design of cascaded classifiers , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[11]  Hyeran Byun,et al.  Scene text extraction in natural scene images using hierarchical feature combining and verification , 2004, ICPR 2004.

[12]  Kai Wang,et al.  End-to-end scene text recognition , 2011, 2011 International Conference on Computer Vision.

[13]  Kai Wang,et al.  Word Spotting in the Wild , 2010, ECCV.

[14]  Adam Krzyżak,et al.  Methods of combining multiple classifiers and their applications to handwriting recognition , 1992, IEEE Trans. Syst. Man Cybern..

[15]  Andreas Dengel,et al.  ICDAR 2011 Robust Reading Competition Challenge 2: Reading Text in Scene Images , 2011, 2011 International Conference on Document Analysis and Recognition.

[16]  Kongqiao Wang,et al.  Character location in scene images from digital camera , 2003, Pattern Recognit..

[17]  George Nagy,et al.  Recognition of Printed Chinese Characters , 1966, IEEE Trans. Electron. Comput..

[18]  Simon M. Lucas,et al.  ICDAR 2003 robust reading competitions , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[19]  Chitra Dorai,et al.  Automatic text extraction from video for content-based annotation and retrieval , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[20]  Miin-Shen Yang,et al.  On cluster-wise fuzzy regression analysis , 1997, IEEE Trans. Syst. Man Cybern. Part B.

[21]  Robert E. Schapire,et al.  A Brief Introduction to Boosting , 1999, IJCAI.

[22]  Alain Trémeau,et al.  A region growing and merging algorithm to color segmentation , 1997, Pattern Recognit..

[23]  Datong Chen,et al.  Text enhancement with asymmetric filter for video OCR , 2001, Proceedings 11th International Conference on Image Analysis and Processing.

[24]  Nevenka Dimitrova,et al.  Text detection for video analysis , 1999, Proceedings IEEE Workshop on Content-Based Access of Image and Video Libraries (CBAIVL'99).

[25]  Alan L. Yuille,et al.  Detecting and reading text in natural scenes , 2004, CVPR 2004.

[26]  Taizo Iijima,et al.  A Theory of Character Recognition by Pattern Matching Method , 1974 .

[27]  James C. Bezdek,et al.  On cluster validity for the fuzzy c-means model , 1995, IEEE Trans. Fuzzy Syst..

[28]  S.M. Lucas,et al.  ICDAR 2005 text locating competition results , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[29]  Yoshiyuki Yamashita,et al.  Classification of handprinted Kanji characters by the structured segment matching method , 1983, Pattern Recognit. Lett..

[30]  Yonatan Wexler,et al.  Detecting text in natural scenes with stroke width transform , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[31]  Robert E. Schapire,et al.  The Boosting Approach to Machine Learning An Overview , 2003 .