A New and Efficient Algorithm to Binarize Document Images Removing Back-to-Front Interference

Back-to-front interference", "bleeding" and "show-through" is the name given to the phenomenon found whenever documents are written on both sides of translucent paper and the print of one side is visible on the other one. The binarization of documents with back-to- front interference with standard algorithms yields unreadable documents. This paper presents a fast entropy-based segmentation method for generating high-quality binarized images of documents with back-to-front interference.

[1]  Aldo Cumani,et al.  Edge detection in multispectral images , 1991, CVGIP Graph. Model. Image Process..

[2]  Rafael Dueire Lins,et al.  Assessing algorithms to remove back-to-front interference in documents , 2006 .

[3]  Hirobumi Nishida,et al.  A multiscale approach to restoring scanned color document images with show-through effects , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[4]  Rafael Dueire Lins,et al.  A fast algorithm to binarize and filter documents with back-to-front interference , 2007, SAC '07.

[5]  Rafael Dueire Lins,et al.  BigBatch - An Environment for Processing Monochromatic Documents , 2006, ICIAR.

[6]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[7]  Andrew K. C. Wong,et al.  A new method for gray-level picture thresholding using the entropy of the histogram , 1985, Comput. Vis. Graph. Image Process..

[8]  Shyang Chang,et al.  A new criterion for automatic multilevel thresholding , 1995, IEEE Trans. Image Process..

[9]  Rafael Dueire Lins,et al.  Image segmentation of historical documents , 2000 .

[10]  Venu Govindaraju,et al.  Document image analysis: A primer , 2002 .

[11]  Rafael Dueire Lins,et al.  Generating Color Documents from Segmented and Synthetic Elements , 2007, ICIAR.

[12]  J G Daugman,et al.  Information Theory and Coding , 2005 .

[13]  Chew Lim Tan,et al.  A wavelet approach to double-sided document image pair processing , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[14]  Rafael Dueire Lins,et al.  Binarizing and filtering historical documents with back-to-front interference , 2006, SAC '06.

[15]  Sung-Il Chien,et al.  An improved binarization algorithm based on a water flow model for document image with inhomogeneous backgrounds , 2005, Pattern Recognit..

[16]  Hanqing Lu,et al.  An effective entropic thresholding for ultrasonic images , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[17]  Rafael Dueire Lins,et al.  An environment for processing images of historical documents , 1994, Microprocess. Microprogramming.

[18]  Rafael Dueire Lins,et al.  Color Document Synthesis as a Compression Strategy , 2007 .

[19]  Ali Mohammad-Djafari,et al.  Bayesian separation of document images with hidden markov model , 2007, VISAPP.

[20]  Gaurav Sharma,et al.  Show-through cancellation in scans of duplex printed documents , 2001, IEEE Trans. Image Process..

[21]  Rafael Dueire Lins,et al.  Generation of images of historical documents by composition , 2002, DocEng '02.