Conditional random field for text segmentation from images with complex background

Text contained in images and video frames provide important clues for information indexing and retrieval. But it is difficult to segment text from images, especially those images with complex background. This paper presents a new conditional random field approach, in which contextual features are introduced into text segmentation. Local visual information and contextual label information are integrated into a conditional random field by several components. Some components focus on visual image information to predict the category within the image sites, while others focus on contextual label information to determine the patterns within the label field. Integrating contextual label information in conditional random field can effectively resolve local ambiguities and improve text segmentation performance in complex background. The comparing results demonstrate that the proposed method outperforms other methods for text segmentation from complex background.

[1]  Changsong Liu,et al.  Gabor filters-based feature extraction for character recognition , 2005, Pattern Recognit..

[2]  Wayne Niblack,et al.  An introduction to digital image processing , 1986 .

[3]  Anil K. Jain,et al.  Automatic text location in images and video frames , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[4]  Jiang Gao,et al.  An adaptive algorithm for text detection from natural scenes , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[5]  Martial Hebert,et al.  Discriminative random fields: a discriminative framework for contextual interaction in classification , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[6]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[7]  Yang Wang,et al.  A dynamic conditional random field model for foreground and shadow segmentation , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Wen Gao,et al.  Automatic text segmentation from complex background , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[9]  Yongdong Zhang,et al.  A Novel Image Text Extraction Method Based on K-Means Clustering , 2008, Seventh IEEE/ACIS International Conference on Computer and Information Science (icis 2008).

[10]  Trevor Darrell,et al.  Conditional Random Fields for Object Recognition , 2004, NIPS.

[11]  R. Zemel,et al.  Multiscale conditional random fields for image labeling , 2004, CVPR 2004.

[12]  Anil K. Jain,et al.  Unsupervised texture segmentation using Gabor filters , 1990, 1990 IEEE International Conference on Systems, Man, and Cybernetics Conference Proceedings.

[13]  Tomer Hertz,et al.  Learning and inferring image segmentations using the GBP typical cut algorithm , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[14]  Bernard Gosselin,et al.  Color text extraction from camera-based images: the impact of the choice of the clustering distance , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[15]  Bernd Freisleben,et al.  Unsupervised Text Segmentation Using Color and Wavelet Features , 2004, CIVR.

[16]  Ellen K. Hughes,et al.  Video OCR for digital news archive , 1998, Proceedings 1998 IEEE International Workshop on Content-Based Access of Image and Video Database.

[17]  Osamu Hasegawa,et al.  Random Field Model for Integration of Local Information and Global Information , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[18]  Jean-Marc Odobez,et al.  Text segmentation and recognition in complex background based on Markov random field , 2002, Object recognition supported by user interaction for service robots.

[19]  Zhuowen Tu,et al.  Auto-context and its application to high-level vision tasks , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Mark W. Schmidt,et al.  Accelerated training of conditional random fields with stochastic gradient methods , 2006, ICML.

[21]  Martial Hebert,et al.  Discriminative Random Fields , 2006, International Journal of Computer Vision.

[22]  Anat Levin,et al.  Learning to Combine Bottom-Up and Top-Down Segmentation , 2006, ECCV.

[23]  Kevin P. Murphy,et al.  Figure-ground segmentation using a hierarchical conditional random field , 2007, Fourth Canadian Conference on Computer and Robot Vision (CRV '07).

[24]  Martial Hebert,et al.  Discriminative Fields for Modeling Spatial Dependencies in Natural Images , 2003, NIPS.

[25]  Pat Langley,et al.  Editorial: On Machine Learning , 1986, Machine Learning.

[26]  Ping Zhong,et al.  A Multiple Conditional Random Fields Ensemble Model for Urban Area Detection in Remote Sensing Optical Images , 2007, IEEE Transactions on Geoscience and Remote Sensing.

[27]  Yang Wang,et al.  A dynamic conditional random field model for object segmentation in image sequences , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[28]  William T. Freeman,et al.  Understanding belief propagation and its generalizations , 2003 .

[29]  Xilin Chen,et al.  Automatic detection and recognition of signs from natural scenes , 2004, IEEE Transactions on Image Processing.

[30]  N. Otsu A threshold selection method from gray level histograms , 1979 .