A Weighted Hybrid Thresholding Approach for Text Binarization

Text extraction in real images taken in unconstrained environments remains surprisingly challenging in Computer Vision due to language characteristics, complex backgrounds and the text color. Extraction of text and caption from images and videos is important and in great demand for video retrieval, annotation, indexing and content analysis. In this paper we propose a weighted hybrid thresholding approach. It is demonstrated that the proposed method achieved reasonable accuracy of the text extraction for moderately difficult examples.

[1]  D. Manjula,et al.  Sliding window approach based Text Binarisation from Complex Textual images , 2010, ArXiv.

[2]  Jean-Michel Jolion,et al.  Text localization, enhancement and binarization in multimedia documents , 2002, Object recognition supported by user interaction for service robots.

[3]  Nikolaos Ntogas,et al.  A binarization algorithm for historical manuscripts , 2008, ICC 2008.

[4]  Alain Trémeau,et al.  Extreme value theory based text binarization in documents and natural scenes , 2010 .

[5]  Chew Lim Tan,et al.  Edge Based Binarization for Video Text Images , 2010, 2010 20th International Conference on Pattern Recognition.

[6]  David S. Doermann,et al.  Binarization of low quality text using a Markov random field model , 2002, Object recognition supported by user interaction for service robots.

[7]  Bernard Gosselin,et al.  Color text extraction from camera-based images: the impact of the choice of the clustering distance , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[8]  Yaakov Navon Layer-based binarization for textual images , 2008, 2008 19th International Conference on Pattern Recognition.

[9]  Ioannis Pratikakis,et al.  An Objective Evaluation Methodology for Document Image Binarization Techniques , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.

[10]  Li Linlin,et al.  Edge Based Binarization for Video Text Images , 2010, ICPR 2010.

[11]  Matti Pietikäinen,et al.  Adaptive document binarization , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[12]  Jong-Hyun Park,et al.  Binarization of Text Region based on Fuzzy Clustering and Histogram Distribution in Signboards , 2008 .

[13]  C. V. Jawahar,et al.  An MRF Model for Binarization of Natural Scene Text , 2011, 2011 International Conference on Document Analysis and Recognition.

[14]  Jorge Stolfi,et al.  Text detection and recognition in urban scenes , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).