Document image quality assessment based on improved gradient magnitude similarity deviation

Digitization of business processes and the use of mobile devices as portable scanner lead to a massive production of document images that is beyond manual handling. In such a scenario, automatic estimation of document image quality is a concern in order to adapt as early as possible document image analysis methods. In this paper, a method for full reference document image quality assessment (DIQA) using mainly foreground information is proposed. In the proposed method, a segmentation technique is employed on a reference document image to approximately separate foreground and background information. Foreground information of the document image are then considered in the form of foreground patches for computing image quality. For each foreground patch, corresponding gradient maps, obtained from the reference and distorted gradient magnitude maps, are used to compute a gradient magnitude similarity map of the patch. Gradient magnitude similarity deviation of the patch is then calculated by the means of standard deviation over all the values in the gradient magnitude similarity map obtained for the patch. An average pooling is finally performed on all the standard deviations obtained for all the foreground patches to obtain the final image quality metric of the distorted document image. To evaluate the proposed method, we used 3 different datasets. The first dataset was a dataset composed of 377 document images of which 29 were reference images and 348 were distorted images. The other datasets were LIVE and CSIQ datasets composed of scene images with MHOS as ground truth. The results obtained from the proposed system are encouraging.

[1]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[2]  Haiping Lu,et al.  Distance-reciprocal distortion measure for binary document images , 2004, IEEE Signal Processing Letters.

[3]  Zhou Wang,et al.  Structural Approaches to Image Quality Assessment , 2005 .

[4]  David S. Doermann,et al.  Document Image Quality Assessment: A Brief Survey , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[5]  David Zhang,et al.  FSIM: A Feature Similarity Index for Image Quality Assessment , 2011, IEEE Transactions on Image Processing.

[6]  David S. Doermann,et al.  Unsupervised feature learning framework for no-reference image quality assessment , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  David S. Doermann,et al.  Learning features for predicting OCR accuracy , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[8]  Hanghang Tong,et al.  No-reference quality assessment for JPEG2000 compressed images , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[9]  Alireza Alaei,et al.  A new scheme for unconstrained handwritten text-line segmentation , 2011, Pattern Recognit..

[10]  C. Hale,et al.  Human Image Preference and Document Degradation Models , 2007 .

[11]  Lei Zhang,et al.  Gradient Magnitude Similarity Deviation: A Highly Efficient Perceptual Image Quality Index , 2013, IEEE Transactions on Image Processing.

[12]  Gady Agam,et al.  Character-Based Automated Human Perception Quality Assessment in Document Images , 2012, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[13]  David S. Doermann,et al.  Sharpness estimation for document and scene images , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[14]  Joydeep Ghosh,et al.  Blind Image Quality Assessment Without Human Training Using Latent Quality Factors , 2012, IEEE Signal Processing Letters.