Learning Surrogate Models of Document Image Quality Metrics for Automated Document Image Processing

Computation of document image quality metrics often depends upon the availability of a ground truth image corresponding to the document. This limits the applicability of quality metrics in applications such as hyperparameter optimization of image processing algorithms that operate on-the-fly on unseen documents. This work proposes the use of surrogate models to learn the behavior of a given document quality metric on existing datasets where ground truth images are available. The trained surrogate model can later be used to predict the metric value on previously unseen document images without requiring access to ground truth images. The surrogate model is empirically evaluated on the Document Image Binarization Competition (DIBCO) and the Handwritten Document Image Binarization Competition (H-DIBCO) datasets.

[1]  Gady Agam,et al.  Character-Based Automated Human Perception Quality Assessment in Document Images , 2012, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[2]  Simon Haykin,et al.  Neural Networks and Learning Machines , 2010 .

[3]  C. Hale,et al.  Human Image Preference and Document Degradation Models , 2007 .

[4]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[5]  Martin T. Hagan,et al.  Neural network design , 1995 .

[6]  Anders Hast,et al.  Automatic Document Image Binarization using Bayesian Optimization , 2017, HIP@ICDAR.

[7]  Yong Liu,et al.  No-reference document image quality assessment based on high order image statistics , 2016, 2016 IEEE International Conference on Image Processing (ICIP).

[8]  Haiping Lu,et al.  Distance-reciprocal distortion measure for binary document images , 2004, IEEE Signal Processing Letters.

[9]  Elisa H. Barney Smith,et al.  Human Image Preference and Document Degradation Models , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[10]  Jean-Marc Ogier,et al.  Metric-based no-reference quality assessment of heterogeneous document images , 2015, Electronic Imaging.

[11]  J. Platt Sequential Minimal Optimization : A Fast Algorithm for Training Support Vector Machines , 1998 .

[12]  Ioannis Pratikakis,et al.  ICDAR 2013 Document Image Binarization Contest (DIBCO 2013) , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[13]  Ioannis Pratikakis,et al.  ICDAR 2011 Document Image Binarization Contest (DIBCO 2011) , 2011, 2011 International Conference on Document Analysis and Recognition.

[14]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[15]  David S. Doermann,et al.  Sharpness estimation for document and scene images , 2012, Proceedings of the 21st International Conference on Pattern Recognition (ICPR2012).

[16]  Alireza Alaei,et al.  Document Image Quality Assessment Based on Texture Similarity Index , 2016, 2016 12th IAPR Workshop on Document Analysis Systems (DAS).

[17]  Ioannis Pratikakis,et al.  ICDAR 2009 Document Image Binarization Contest (DIBCO 2009) , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[18]  Konstantinos Zagoris,et al.  ICFHR2016 Handwritten Document Image Binarization Contest (H-DIBCO 2016) , 2016, 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR).

[19]  Nicholas R. Howe,et al.  Document binarization with automatic parameter tuning , 2013, International Journal on Document Analysis and Recognition (IJDAR).

[20]  Alireza Alaei,et al.  Document image quality assessment based on improved gradient magnitude similarity deviation , 2015, 2015 13th International Conference on Document Analysis and Recognition (ICDAR).

[21]  Le Kang,et al.  A deep learning approach to document image quality assessment , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[22]  D. Basak,et al.  Support Vector Regression , 2008 .

[23]  Piet Demeester,et al.  A Surrogate Modeling and Adaptive Sampling Toolbox for Computer Based Design , 2010, J. Mach. Learn. Res..

[24]  Basilios Gatos,et al.  A survey of document image word spotting techniques , 2017, Pattern Recognit..

[25]  David S. Doermann,et al.  Document Image Quality Assessment: A Brief Survey , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[26]  Tom Dhaene,et al.  Shape optimization of a cyclone separator using multi-objective surrogate-based optimization , 2016 .

[27]  Ioannis Pratikakis,et al.  H-DIBCO 2010 - Handwritten Document Image Binarization Competition , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.

[28]  Ioannis Pratikakis,et al.  ICFHR 2012 Competition on Handwritten Document Image Binarization (H-DIBCO 2012) , 2012, 2012 International Conference on Frontiers in Handwriting Recognition.

[29]  Carl E. Rasmussen,et al.  Gaussian processes for machine learning , 2005, Adaptive computation and machine learning.

[30]  Bogdan Trawinski,et al.  Comparative Analysis of Premises Valuation Models Using KEEL, RapidMiner, and WEKA , 2009, ICCCI.

[31]  Ioannis Pratikakis,et al.  ICFHR2014 Competition on Handwritten Document Image Binarization (H-DIBCO 2014) , 2014, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[32]  Jasper Snoek,et al.  Practical Bayesian Optimization of Machine Learning Algorithms , 2012, NIPS.

[33]  Bernhard Schölkopf,et al.  A tutorial on support vector regression , 2004, Stat. Comput..