Layer-based binarization for textual images

We developed a binarization approach to handle a large variety of images, from scanned flatbed images to images acquired by mobile phone cameras. The binarization is targeted at creating layers of binary images for processing by OCR engines. The layers are classified spatially and by intensity and color. First textual pixels are classified by a text operator. The text kernel is then segmented by intensity/color levels and layout analysis techniques to create regions of similar text. Finally, adaptive binarization is applied to each region to obtain superior binary images. Our experimental results show the advantages of our method over local binarization methods.

[1]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[2]  Matti Pietikäinen,et al.  Adaptive document image binarization , 2000, Pattern Recognit..

[3]  A.W.M. Smeulders,et al.  An introduction to image processing , 1991 .

[4]  André Marion,et al.  Introduction to Image Processing , 1990, Springer US.

[5]  Josef Kittler,et al.  Minimum error thresholding , 1986, Pattern Recognit..

[6]  Dorin Comaniciu,et al.  Mean shift analysis and applications , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.