Influence of Color-to-Gray Conversion on the Performance of Document Image Binarization: Toward a Novel Optimization Problem

This paper presents a novel preprocessing method of color-to-gray document image conversion. In contrast to the conventional methods designed for natural images that aim to preserve the contrast between different classes in the converted gray image, the proposed conversion method reduces as much as possible the contrast (i.e., intensity variance) within the text class. It is based on learning a linear filter from a predefined data set of text and background pixels that: 1) when applied to background pixels, minimizes the output response and 2) when applied to text pixels, maximizes the output response, while minimizing the intensity variance within the text class. Our proposed method (called learning-based color-to-gray) is conceived to be used as preprocessing for document image binarization. A data set of 46 historical document images is created and used to evaluate subjectively and objectively the proposed method. The method demonstrates drastically its effectiveness and impact on the performance of state-of-the-art binarization methods. Four other Web-based image data sets are created to evaluate the scalability of the proposed method.

[1]  Syed Saqib Bukhari,et al.  Adaptive Binarization of Unconstrained Hand-Held Camera-Captured Document Images , 2009, J. Univers. Comput. Sci..

[2]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[3]  Ioannis Pratikakis,et al.  Improved document image binarization by using a combination of multiple binarization techniques and adapted edge information , 2008, 2008 19th International Conference on Pattern Recognition.

[4]  Jean-Michel Jolion,et al.  Text localization, enhancement and binarization in multimedia documents , 2002, Object recognition supported by user interaction for service robots.

[5]  Frédéric Bouchara,et al.  FAIR: A Fast Algorithm for Document Image Restoration , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[6]  Shijian Lu,et al.  Binarization of historical document images using the local maximum and minimum , 2010, DAS '10.

[7]  Anil K. Jain,et al.  Goal-Directed Evaluation of Binarization Methods , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Shijian Lu,et al.  Document image binarization using background estimation and stroke edges , 2010, International Journal on Document Analysis and Recognition (IJDAR).

[9]  Mohamed Cheriet,et al.  Phase-Based Binarization of Ancient Document Images: Model and Applications , 2014, IEEE Transactions on Image Processing.

[10]  Mohamed Cheriet,et al.  A multi-scale framework for adaptive binarization of degraded document images , 2010, Pattern Recognit..

[11]  Chew Lim Tan,et al.  Directional wavelet approach to remove document image interference , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[12]  Ioannis Pratikakis,et al.  ICFHR 2012 Competition on Handwritten Document Image Binarization (H-DIBCO 2012) , 2012, 2012 International Conference on Frontiers in Handwriting Recognition.

[13]  Mohamed Cheriet,et al.  Text extraction from degraded document images , 2010, 2010 2nd European Workshop on Visual Information Processing (EUVIP).

[14]  Reiner Eschbach,et al.  Spatial Color-to-Grayscale Transform Preserving Chrominance Edge Information , 2004, CIC.

[15]  Guy Lapalme,et al.  A systematic analysis of performance measures for classification tasks , 2009, Inf. Process. Manag..

[16]  Adel M. Alimi,et al.  A Methodology for the Separation of Foreground/Background in Arabic Historical Manuscripts using Hybrid Methods , 2008, J. Univers. Comput. Sci..

[17]  Raúl Rojas,et al.  An analysis of the transition proportion for binarization in handwritten historical documents , 2014, Pattern Recognit..

[18]  Neil A. Dodgson,et al.  Decolorize: Fast, contrast enhancing, color to grayscale conversion , 2007, Pattern Recognit..

[19]  Venu Govindaraju,et al.  Separating text and background in degraded document images - a comparison of global thresholding techniques for multi-stage thresholding , 2002, Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition.

[20]  Ioannis Pratikakis,et al.  ICDAR 2009 Document Image Binarization Contest (DIBCO 2009) , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[21]  Wen-Hsiang Tsai,et al.  Moment-preserving thresolding: A new approach , 1985, Comput. Vis. Graph. Image Process..

[22]  Christian Wolf,et al.  Document Ink Bleed-Through Removal with Two Hidden Markov Random Fields and a Single Observation Field , 2010, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  Ahmed S. Abutableb Automatic thresholding of gray-level pictures using two-dimensional entropy , 1989 .

[24]  Karol Myszkowski,et al.  Apparent Greyscale: A Simple and Fast Conversion to Perceptually Accurate Images and Video , 2008, Comput. Graph. Forum.

[25]  Wayne Niblack,et al.  An introduction to digital image processing , 1986 .

[26]  Bruce Gooch,et al.  Color2Gray: salience-preserving color removal , 2005, SIGGRAPH 2005.

[27]  Wen-Hsiang Tsai,et al.  Moment-preserving thresholding: a new approach , 1995 .

[28]  Paolo Cignoni,et al.  Machine Vision and Applications Manuscript No , 2022 .

[29]  Mohamed Cheriet,et al.  A spatially adaptive statistical method for the binarization of historical manuscripts and degraded document images , 2011, Pattern Recognit..

[30]  Matti Pietikäinen,et al.  Adaptive document image binarization , 2000, Pattern Recognit..

[31]  Mohamed Cheriet,et al.  AdOtsu: An adaptive and parameterless generalization of Otsu's method for document image binarization , 2012, Pattern Recognit..

[32]  Ioannis Pratikakis,et al.  Adaptive degraded document image binarization , 2006, Pattern Recognit..

[33]  Mohamed Cheriet,et al.  A local linear level set method for the binarization of degraded historical document images , 2012, International Journal on Document Analysis and Recognition (IJDAR).

[34]  Chew Lim Tan,et al.  Binarization of Badly Illuminated Document Images through Shading Estimation and Compensation , 2007 .

[35]  V.,et al.  A Spatial Thresholding Method for Image Segmentation , 2022 .

[36]  Haiping Lu,et al.  Distance-reciprocal distortion measure for binary document images , 2004, IEEE Signal Processing Letters.

[37]  Ioannis Pratikakis,et al.  ICDAR 2011 Document Image Binarization Contest (DIBCO 2011) , 2011, 2011 International Conference on Document Analysis and Recognition.

[38]  Frank Lebourgeois,et al.  Restoring Ink Bleed-Through Degraded Document Images Using a Recursive Unsupervised Classification Technique , 2006, Document Analysis Systems.

[39]  A. Papandreou,et al.  ICDAR 2013 Competition on Writer Identification , 2013, 2013 12th International Conference on Document Analysis and Recognition.