Edge Based Binarization for Video Text Images

This paper introduces a binarization method based on edge for video text images, especially for images with complex background or low contrast. The binarization method first detects the contour of the text, and utilizes a local thresholding method to decide the inner side of the contour, and then fills up the contour to form characters that are recognizable to OCR software. Experiment results show that our method is especially effective on complex background and low contrast images.

[1]  Xinbo Gao,et al.  A spatial-temporal approach for video caption detection and recognition , 2002, IEEE Trans. Neural Networks.

[2]  Jean-Marc Odobez,et al.  Robust video text segmentation and recognition with multiple hypotheses , 2002, Proceedings. International Conference on Image Processing.

[3]  Larry D. Hostetler,et al.  The estimation of the gradient of a density function, with applications in pattern recognition , 1975, IEEE Trans. Inf. Theory.

[4]  Palaiahnakote Shivakumara,et al.  A Laplacian Method for Video Text Detection , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[5]  David Doermann,et al.  Text enhancement in digital video , 1999, Electronic Imaging.

[6]  Xian-Sheng Hua,et al.  Efficient video text recognition using multiple frame integration , 2002, Proceedings. International Conference on Image Processing.

[7]  N. Otsu A threshold selection method from gray level histograms , 1979 .