A Novel Method for Embedded Text Segmentation Based on Stroke and Color

In this paper, a novel method for embedded text segmentation is proposed. The basic idea of our method is based on two properties of embedded texts: a) the color of text pixels is subject to gaussian distribution, b) the locaal part and the global part of embedded text shares the same color distribution. Inspired by this two characteristics, we develop a two-step text segmentation approach: in the coarse segmentation step, a 1-D gaussian function is adopted to model the color distribution of text pixels. To get the model parameters, a stroke operator is utilized to extract confident text region, and then a heuristic process is developed to estimate the parameters. The coarse segmentation can be carried out by the color model. In the noise elimination step, a color distribution homogeneity based method with connected omponent analysis is introduced. Preliminary experimental results show that our method performs well on complex background.

[1]  Rainer Lienhart,et al.  Localizing and segmenting text in images and videos , 2002, IEEE Trans. Circuits Syst. Video Technol..

[2]  Wen Gao,et al.  Automatic text segmentation from complex background , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[3]  Anil K. Jain,et al.  Text information extraction in images and video: a survey , 2004, Pattern Recognit..

[4]  Wen Gao,et al.  A hybrid text segmentation approach , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[5]  Jean-Marc Odobez,et al.  Text detection, recognition in images and video frames , 2004, Pattern Recognit..

[6]  Rainer Lienhart,et al.  VIDEO OCR: A SURVEY AND PRACTITIONER'S GUIDE , 2003 .

[7]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[8]  Wayne Niblack,et al.  An introduction to digital image processing , 1986 .

[9]  Weiqiang Wang,et al.  A Robust Text Segmentation Approach in Complex Background Based on Multiple Constraints , 2005, PCM.

[10]  Ching Y. Suen,et al.  Stroke-model-based character extraction from gray-level document images , 2001, IEEE Trans. Image Process..