Automatic character detection and segmentation in natural scene images

We present a robust connected-component (CC) based method for automatic detection and segmentation of text in real-scene images. This technique can be applied in robot vision, sign recognition, meeting processing and video indexing. First, a Non-Linear Niblack method (NLNiblack) is proposed to decompose the image into candidate CCs. Then, all these CCs are fed into a cascade of classifiers trained by Adaboost algorithm. Each classifier in the cascade responds to one feature of the CC. Proposed here are 12 novel features which are insensitive to noise, scale, text orientation and text language. The classifier cascade allows non-text CCs of the image to be rapidly discarded while more computation is spent on promising text-like CCs. The CCs passing through the cascade are considered as text components and are used to form the segmentation result. A prototype system was built, with experimental results proving the effectiveness and efficiency of the proposed method.

[1]  Bernard Gosselin,et al.  From Picture to Speech: an Innovative Application for Embedded Environment , 2003 .

[2]  David S. Doermann,et al.  Progress in camera-based document image analysis , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[3]  Anil K. Jain,et al.  Automatic Caption Localization in Compressed Video , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Qing Yang,et al.  Feature based visualization of geophysical data , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[5]  Tai-Yun Kim,et al.  Automatic text extraction in digital videos using FFT and neural network , 1999, FUZZ-IEEE'99. 1999 IEEE International Fuzzy Systems. Conference Proceedings (Cat. No.99CH36315).

[6]  C. S. Shin,et al.  Support vector machine-based text detection in digital video , 2000, Neural Networks for Signal Processing X. Proceedings of the 2000 IEEE Signal Processing Society Workshop (Cat. No.00TH8501).

[7]  Lina J. Karam,et al.  Morphological text extraction from images , 2000, IEEE Trans. Image Process..

[8]  James M. Rehg,et al.  Automatic cascade training with perturbation bias , 2004, CVPR 2004.

[9]  David S. Doermann,et al.  Automatic text detection and tracking in digital video , 2000, IEEE Trans. Image Process..

[10]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, International Journal of Computer Vision.

[11]  Kongqiao Wang,et al.  Character location in scene images from digital camera , 2003, Pattern Recognit..

[12]  Ismail Haritaoglu Scene text extraction and translation for handheld devices , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[13]  Lowell L. Winger,et al.  Low-Complexity Character Extraction in Low-Contrast Scene Images , 2000, Int. J. Pattern Recognit. Artif. Intell..

[14]  Majid Mirmehdi,et al.  Finding Text Regions using Localised Statistical Measures , 2000, British Machine Vision Conference.

[15]  Majid Mirmehdi,et al.  Finding Text Regions Using Localised Measures , 2000 .

[16]  Jiang Gao,et al.  An adaptive algorithm for text detection from natural scenes , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.