Degradation enhancement for the captured document image using retinex theory

The state-of-arts global thresholding techniques are fast and efficient to convert the gray scale document image into a binary image. However, they are unsuitable for complex and degraded documents. Moreover, global thresholding techniques produce border noise when the illumination of the document is not uniform. Other methods that depend on local thresholding techniques are efficient in the case of degraded document images, but have common disadvantages include the dependence on the region size and the image characteristics, and the computational time. In this paper we propose a method to overcome the limitations of the related global and local threshold techniques by using the concept of Retinex theory based on Median filter which can effectively enhance the degraded and poor quality document image. High quality results in terms of visual criteria and OCR performance is produced compared to the previous works.

[1]  Nam Ik Cho,et al.  2009 10th International Conference on Document Analysis and Recognition Feature Based Binarization of Document Images Degraded by Uneven Light Condition , 2022 .

[2]  Dayang Rohaya,et al.  Document image skew detection and correction method based on extreme points , 2014, 2014 International Conference on Computer and Information Sciences (ICCOINS).

[3]  Rae-Hong Park,et al.  Document image binarization based on topographic analysis using a water flow model , 2002, Pattern Recognit..

[4]  Edwin Herbert Land,et al.  The Retinex Theory of Color Vision SCIENTIFIC AMERICAN , 2009 .

[5]  Lawrence O'Gorman Binarization and Multithresholding of Document Images Using Connectivity , 1994, CVGIP Graph. Model. Image Process..

[6]  Anil K. Jain,et al.  Segmentation of document images , 1989, SMC.

[7]  Shijian Lu,et al.  Binarization of historical document images using the local maximum and minimum , 2010, DAS '10.

[8]  Wayne Niblack,et al.  An introduction to digital image processing , 1986 .

[9]  Raúl Rojas,et al.  Local Contrast Segmentation to Binarize Images , 2009, 2009 Third International Conference on Digital Society.

[10]  Soumen Kanrar,et al.  Enhancement of Image Resolution by Binarization , 2010, ArXiv.

[11]  Sung-Il Chien,et al.  An improved binarization algorithm based on a water flow model for document image with inhomogeneous backgrounds , 2005, Pattern Recognit..

[12]  Thomas M. Breuel,et al.  Efficient implementation of local adaptive thresholding techniques using integral images , 2008, Electronic Imaging.

[13]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[14]  Ching Y. Suen,et al.  Ternary Entropy-Based Binarization of Degraded Document Images Using Morphological Operators , 2011, 2011 International Conference on Document Analysis and Recognition.

[15]  Chien-Hsing Chou,et al.  A binarization method with learning-built rules for document images produced by cameras , 2010, Pattern Recognit..

[16]  A. Abutaleb,et al.  Automatic Thresholding Of Gray-Level Pictures Using 2-D Entropy , 1988, Optics & Photonics.

[17]  Shijian Lu,et al.  Document image binarization using background estimation and stroke edges , 2010, International Journal on Document Analysis and Recognition (IJDAR).

[18]  R. Kohler A segmentation system based on thresholding , 1981 .

[19]  Sudipta Roy,et al.  A New Local Adaptive Thresholding Technique in Binarization , 2012, ArXiv.

[20]  Ioannis Pratikakis,et al.  Adaptive degraded document image binarization , 2006, Pattern Recognit..

[21]  Venu Govindaraju,et al.  Binarization of camera-captured document using A MAP approach , 2011, Electronic Imaging.

[22]  E. Land The retinex theory of color vision. , 1977, Scientific American.

[23]  Matti Pietikäinen,et al.  Adaptive document image binarization , 2000, Pattern Recognit..

[24]  Hong Yan,et al.  An adaptive logical method for binarization of degraded document images , 2000, Pattern Recognit..

[25]  Matti Pietikäinen,et al.  Adaptive document binarization , 1997, Proceedings of the Fourth International Conference on Document Analysis and Recognition.

[26]  Naoki Tanaka,et al.  Robust extraction of characters from color scene image using mathematical morphology , 1998, Proceedings. Fourteenth International Conference on Pattern Recognition (Cat. No.98EX170).

[27]  Azriel Rosenfeld,et al.  Threshold Evaluation Techniques , 1978, IEEE Transactions on Systems, Man, and Cybernetics.

[28]  Yan Solihin,et al.  Integral Ratio: A New Class of Global Thresholding Techniques for Handwriting Images , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[29]  Robert M. Haralick,et al.  Automatic multithreshold selection , 1984, Comput. Vis. Graph. Image Process..

[30]  I. Faye,et al.  Fast and efficient document image clean up and binarization based on retinex theory , 2013, 2013 IEEE 9th International Colloquium on Signal Processing and its Applications.

[31]  Dayang Rohaya,et al.  Border Noise Removal and Clean Up Based on Retinex Theory , 2013, DaEng.