Document blur detection using edge profile mining

We present an algorithm for automatic blur detection of document images using a novel approach based on edge intensity profiles. Our main insight is that the edge profiles are a strong indicator of the blur present in the image, with steep profiles implying sharper regions and gradual profiles implying blurred regions. Our approach first retrieves the profiles for each point of intensity transition (each edge point) along the gradient and then uses them to output a quantitative measure indicating the extent of blur in the input image. The real time performance of the proposed approach makes it suitable for most applications. Additionally, our method works for both hand written and digital documents and is agnostic to the font types and sizes, which gives it a major advantage over the currently prevalent learning based approaches. Extensive quantitative and qualitative experiments over two different datasets show that our method outperforms almost all algorithms in current state of the art by a significant margin, especially in cross dataset experiments.

[1]  Jean-Marc Ogier,et al.  Combining Focus Measure Operators to Predict OCR Accuracy in Mobile-Captured Document Images , 2014, 2014 11th IAPR International Workshop on Document Analysis Systems.

[2]  Lina J. Karam,et al.  A No-Reference Objective Image Sharpness Metric Based on the Notion of Just Noticeable Blur (JNB) , 2009, IEEE Transactions on Image Processing.

[3]  David S. Doermann,et al.  Document Image Quality Assessment: A Brief Survey , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[4]  Xujun Peng,et al.  Automated image quality assessment for camera-captured OCR , 2011, 2011 18th IEEE International Conference on Image Processing.

[5]  Yonatan Wexler,et al.  Detecting text in natural scenes with stroke width transform , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[6]  R. Smith,et al.  An Overview of the Tesseract OCR Engine , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[7]  Guigang Zhang,et al.  Deep Learning , 2016, Int. J. Semantic Comput..

[8]  David S. Doermann,et al.  Learning features for predicting OCR accuracy , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[9]  David S. Doermann,et al.  A Dataset for Quality Assessment of Camera Captured Document Images , 2013, CBDAR.

[10]  Stephen V. Rice,et al.  The Fourth Annual Test of OCR Accuracy , 1995 .

[11]  Francine Chen,et al.  SmartDCap: semi-automatic capture of higher quality document images from a smartphone , 2013, IUI '13.

[12]  Li Xu,et al.  Discriminative Blur Detection Features , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  Thomas A. Nartker,et al.  Prediction of OCR accuracy using simple image features , 1995, Proceedings of 3rd International Conference on Document Analysis and Recognition.

[14]  Alan C. Bovik,et al.  Blind Image Quality Assessment: From Natural Scene Statistics to Perceptual Quality , 2011, IEEE Transactions on Image Processing.

[15]  David S. Doermann,et al.  Sharpness estimation for document and scene images , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[16]  Alan C. Bovik,et al.  No-Reference Image Quality Assessment in the Spatial Domain , 2012, IEEE Transactions on Image Processing.

[17]  Lina J. Karam,et al.  A No-Reference Image Blur Metric Based on the Cumulative Probability of Blur Detection (CPBD) , 2011, IEEE Transactions on Image Processing.

[18]  Zhou Wang,et al.  No-reference image sharpness assessment based on local phase coherence measurement , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[19]  David S. Doermann,et al.  Unsupervised feature learning framework for no-reference image quality assessment , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Le Kang,et al.  A deep learning approach to document image quality assessment , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[21]  David S. Doermann,et al.  Real-Time No-Reference Image Quality Assessment Based on Filter Learning , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.