FAIR: A Fast Algorithm for Document Image Restoration

We present, in this paper, the FAIR algorithm: a fast algorithm for document image restoration. This algorithm has been submitted to different contests where it showed good performance in comparison to the state of the art. In addition, this method is scale invariant and fast enough to be used in real-time applications. The method is based on a double-threshold edge detection approach that makes it possible to detect small details while remaining robust against noise. The performance of the proposition is evaluated on several types of degraded document images where considerable background noise or variation in contrast and illumination exist.

[1]  Shijian Lu,et al.  Combination of Document Image Binarization Techniques , 2011, 2011 International Conference on Document Analysis and Recognition.

[2]  Raúl Rojas,et al.  Transition pixel: A concept for binarization based on edge detection and gray-intensity histograms , 2010, Pattern Recognit..

[3]  Guangxue Yue,et al.  The Study on An Application of Otsu Method in Canny Operator , 2009 .

[4]  Ioannis Pratikakis,et al.  ICDAR 2009 Document Image Binarization Contest (DIBCO 2009) , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[5]  He Li,et al.  K-Means on Commodity GPUs with CUDA , 2009, 2009 WRI World Congress on Computer Science and Information Engineering.

[6]  Ching Y. Suen,et al.  A threshlod selection method based on multiscale and graylevel co-occurrence matrix analysis , 2005, Eighth International Conference on Document Analysis and Recognition (ICDAR'05).

[7]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Thomas M. Breuel,et al.  Efficient implementation of local adaptive thresholding techniques using integral images , 2008, Electronic Imaging.

[9]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[10]  Rafael Dueire Lins,et al.  ICFHR 2010 Contest: Quantitative Evaluation of Binarization Algorithms , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.

[11]  Ioannis Pratikakis,et al.  ICDAR 2011 Document Image Binarization Contest (DIBCO 2011) , 2011, 2011 International Conference on Document Analysis and Recognition.

[12]  Pheng-Ann Heng,et al.  A double-threshold image binarization method based on edge detector , 2008, Pattern Recognit..

[13]  Raúl Rojas,et al.  Local Contrast Segmentation to Binarize Images , 2009, 2009 Third International Conference on Digital Society.

[14]  Nicholas R. Howe,et al.  Document binarization with automatic parameter tuning , 2013, International Journal on Document Analysis and Recognition (IJDAR).

[15]  Kuo-Liang Chung,et al.  Fast incremental algorithm for speeding up the computation of binarization , 2009, Appl. Math. Comput..

[16]  Kesheng Wu,et al.  Fast connected-component labeling , 2009, Pattern Recognit..

[17]  Shijian Lu,et al.  Binarization of historical document images using the local maximum and minimum , 2010, DAS '10.

[18]  Ioannis Pratikakis,et al.  Adaptive degraded document image binarization , 2006, Pattern Recognit..

[19]  Matti Pietikäinen,et al.  Adaptive document image binarization , 2000, Pattern Recognit..

[20]  Ioannis Pratikakis,et al.  DIBCO 2009: document image binarization contest , 2011, International Journal on Document Analysis and Recognition (IJDAR).

[21]  Ioannis Pratikakis,et al.  H-DIBCO 2010 - Handwritten Document Image Binarization Competition , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.

[22]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[23]  Shijian Lu,et al.  Document image binarization using background estimation and stroke edges , 2010, International Journal on Document Analysis and Recognition (IJDAR).

[24]  Peter Veelaert,et al.  Implementation of Canny edge detection on the WiCa SmartCam architecture , 2009, 2009 Third ACM/IEEE International Conference on Distributed Smart Cameras (ICDSC).

[25]  B. Kapralos,et al.  I An Introduction to Digital Image Processing , 2022 .

[26]  Chew Lim Tan,et al.  Segmentation and Analysis of Double-Sided Handwritten Archival Documents , 2004 .

[27]  Le-Nan Wu,et al.  An adaptive threshold for the Canny Operator of edge detection , 2010, 2010 International Conference on Image Analysis and Signal Processing.

[28]  Frédéric Bouchara,et al.  Document Image Binarisation Using Markov Field Model , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[29]  Matthieu Cord,et al.  Text segmentation in natural scenes using Toggle-Mapping , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[30]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[31]  Nam Ik Cho,et al.  MAP-MRF approach for binarization of degraded document image , 2008, 2008 15th IEEE International Conference on Image Processing.

[32]  Kenneth A. Hawick,et al.  Parallel graph component labelling with GPUs and CUDA , 2010, Parallel Comput..