Evaluation of human perception of degradation in document images

Large degradations in document images impede their readability as well as substantially deteriorating the performance of automated document processing systems. Image quality metrics have been defined to correlate with OCR accuracy. However, this does not always correlate with human perception of image quality. When enhancing document images with the goal of improving readability, it is important to understand human perception of quality. The goal of this work is to evaluate human perception of degradation and correlate it to known degradation parameters and existing image quality metrics. The information captured enables the learning and estimation of human perception of document image quality.

[1]  Ioannis Pratikakis,et al.  Improved document image binarization by using a combination of multiple binarization techniques and adapted edge information , 2008, 2008 19th International Conference on Pattern Recognition.

[2]  S. Sitharama Iyengar,et al.  Automated system for numerically rating document image quality , 1997, Electronic Imaging.

[3]  Ophir Frieder,et al.  Degraded document image enhancement , 2007, Electronic Imaging.

[4]  Ophir Frieder,et al.  Ensemble LUT classification for degraded document enhancement , 2008, Electronic Imaging.

[5]  Satoshi Naoi,et al.  Automatic filter selection using image quality assessment , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[6]  Patrick Kelly,et al.  Quality assessment and restoration of typewritten document images , 1999, International Journal on Document Analysis and Recognition.

[7]  Tin Kam Ho,et al.  Large-Scale Simulation Studies in Image Pattern Recognition , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Nadia Bali,et al.  Automatic accurate broken character restoration for patrimonial documents , 2006, International Journal of Document Analysis and Recognition (IJDAR).

[9]  Elisa H. Barney Smith,et al.  Correlating degradation models and image quality metrics , 2008, Electronic Imaging.

[10]  P. Stubberud,et al.  Adaptive image restoration of text images that contain touching or broken characters , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[11]  Thomas A. Nartker,et al.  Prediction of OCR accuracy using simple image features , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[12]  Ophir Frieder,et al.  Interactive degraded document enhancement and ground truth generation , 2008, Electronic Imaging.

[13]  Henry S. Baird,et al.  Document image defect models and their uses , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[14]  Henry S. Baird,et al.  Using synthetic data safely in classification , 2009, Electronic Imaging.

[15]  C. Hale,et al.  Human Image Preference and Document Degradation Models , 2007 .

[16]  Apostolos Antonacopoulos,et al.  Flexible Text Recovery from Degraded Typewritten Historical Documents , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[17]  Bülent Sankur,et al.  Survey over image thresholding techniques and quantitative performance evaluation , 2004, J. Electronic Imaging.

[18]  Ioannis Pratikakis,et al.  Adaptive degraded document image binarization , 2006, Pattern Recognit..