A spatially adaptive statistical method for the binarization of historical manuscripts and degraded document images

In this paper, we present an adaptive method for the binarization of historical manuscripts and degraded document images. The proposed approach is based on maximum likelihood (ML) classification and uses a priori information and the spatial relationship on the image domain. In contrast with many conventional methods that use a decision based on thresholding, the proposed method performs a soft decision based on a probabilistic model. The main idea is that, from an initialization map (under-binarization) containing only the darkest part of the text, the method is able to recover the main text in the document image, including low-intensity and weak strokes. To do so, fast and robust local estimation of text and background features is obtained using grid-based modeling and inpainting techniques; then, the ML classification is performed to classify pixels into black and white classes. The advantage of the proposed method is that it preserves weak connections and provides smooth and continuous strokes, thanks to its correlation-based nature. Performance is evaluated both subjectively and objectively against standard databases. The proposed method outperforms the state-of-the-art methods presented in the DIBCO'09 binarization contest, although those other methods provide performance close to it.

[1]  R. F. Moghaddam,et al.  Low quality document image modeling and enhancement , 2009, International Journal of Document Analysis and Recognition (IJDAR).

[2]  Guillermo Sapiro,et al.  Image inpainting , 2000, SIGGRAPH.

[3]  Fadoua Drira Contribution à la restauration des images de documents anciens , 2007 .

[4]  Ingemar J. Cox,et al.  Predicting and Estimating the Accuracy of a Subpixel Registration Algorithm , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Maria Luísa Carvalho,et al.  Application of spectroscopic techniques for the study of paper documents: A survey ☆ , 2009 .

[6]  Mohamed Cheriet,et al.  RSLDI: Restoration of single-sided low-quality document images , 2009, Pattern Recognit..

[7]  Ioannis Pratikakis,et al.  Improved document image binarization by using a combination of multiple binarization techniques and adapted edge information , 2008, 2008 19th International Conference on Pattern Recognition.

[8]  Ioannis Pratikakis,et al.  ICDAR 2009 Document Image Binarization Contest (DIBCO 2009) , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[9]  Mohamed Cheriet,et al.  Restoration and Segmentation of Highly Degraded Characters Using a Shape-Independent Level Set Approach and Multi-level Classifiers , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[10]  Matti Pietikäinen,et al.  Adaptive document image binarization , 2000, Pattern Recognit..

[11]  Rae-Hong Park,et al.  Document image binarization based on topographic analysis using a water flow model , 2002, Pattern Recognit..

[12]  Ali Mohammad-Djafari,et al.  Bayesian separation of document images with hidden markov model , 2007, VISAPP.

[13]  Thomas M. Breuel,et al.  Efficient implementation of local adaptive thresholding techniques using integral images , 2008, Electronic Imaging.

[14]  R. Deriche,et al.  Les EDP en traitement des images et vision par ordinateur , 1995 .

[15]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[16]  Donald Geman,et al.  Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images , 1984, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[17]  Luciano da Fontoura Costa,et al.  2D Euclidean distance transform algorithms: A comparative survey , 2008, CSUR.

[18]  Todd R. Golub,et al.  BRAF mutation predicts sensitivity to MEK inhibition , 2006, Nature.

[19]  Christian Wolf,et al.  Document Ink Bleed-Through Removal with Two Hidden Markov Random Fields and a Single Observation Field , 2010, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Mohamed Cheriet,et al.  Application of Multi-Level Classifiers and Clustering for Automatic Word Spotting in Historical Document Images , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[21]  Mohamed Cheriet,et al.  A multi-scale framework for adaptive binarization of degraded document images , 2010, Pattern Recognit..

[22]  Cedric Nishan Canagarajah,et al.  A gaze prediction technique for open signed video content using a track before detect algorithm , 2008, 2008 15th IEEE International Conference on Image Processing.

[23]  Chew Lim Tan,et al.  Removal of interfering strokes in double-sided document images , 2000, Proceedings Fifth IEEE Workshop on Applications of Computer Vision.

[24]  Sankaran Mahadevan,et al.  Bayesian risk-based decision method for model validation under uncertainty , 2007, Reliab. Eng. Syst. Saf..

[25]  Amar Mitiche,et al.  Bayes Classification of Online Arabic Characters by Gibbs Modeling of Class Conditional Densities , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[26]  Rosita Wachenchauzer,et al.  A New and Efficient Algorithm to Binarize Document Images Removing Back-to-Front Interference , 2008, J. Univers. Comput. Sci..

[27]  Ineke Joosten,et al.  Applications of microanalysis in the cultural heritage field , 2008 .

[28]  Bhabatosh Chanda,et al.  Multiscale morphological segmentation of gray-scale images , 2003, IEEE Trans. Image Process..

[29]  Frédéric Bouchara,et al.  Document Image Binarisation Using Markov Field Model , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[30]  Robert Sablatnig,et al.  Higher order MRF for foreground-background separation in multi-spectral images of historical manuscripts , 2010, DAS '10.

[31]  Nam Ik Cho,et al.  MAP-MRF approach for binarization of degraded document image , 2008, 2008 15th IEEE International Conference on Image Processing.

[32]  David G. Stork,et al.  Pattern Classification , 1973 .

[33]  Shijian Lu,et al.  Binarization of historical document images using the local maximum and minimum , 2010, DAS '10.

[34]  John A. Richards,et al.  Efficient maximum likelihood classification for imaging spectrometer data sets , 1994, IEEE Trans. Geosci. Remote. Sens..

[35]  Anna Tonazzini,et al.  Fast correction of bleed-through distortion in grayscale documents by a blind source separation technique , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[36]  Apostolos Antonacopoulos,et al.  Special issue on the analysis of historical documents , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[37]  Pheng-Ann Heng,et al.  A double-threshold image binarization method based on edge detector , 2008, Pattern Recognit..

[38]  Bülent Sankur,et al.  Survey over image thresholding techniques and quantitative performance evaluation , 2004, J. Electronic Imaging.

[39]  Ioannis Pratikakis,et al.  Adaptive degraded document image binarization , 2006, Pattern Recognit..

[40]  Ehud Rivlin,et al.  A SPECIAL SKELETONIZATION ALGORITHM FOR CURSIVE WORDS , 2004 .

[41]  Ching Y. Suen,et al.  A recursive thresholding technique for image segmentation , 1998, IEEE Trans. Image Process..

[42]  Kenji Nakayama,et al.  Handwritten alphabet and digit character recognition using skeleton pattern mapping with structural constraints , 1993 .

[43]  Giovanni Soda,et al.  Artificial neural networks for document analysis and recognition , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.