DCNet: Noise-Robust Convolutional Neural Networks for Degradation Classification on Ancient Documents

Analysis of degraded ancient documents is challenging due to the severity and combination of degradation present in a single image. Ancient documents also suffer from additional noise during the digitalization process, particularly when digitalization is done using low-specification devices and/or under poor illumination conditions. The noises over the degraded ancient documents certainly cause a troublesome document analysis. In this paper, we propose a new noise-robust convolutional neural network (CNN) architecture for degradation classification of noisy ancient documents, which is called a degradation classification network (DCNet). DCNet was constructed based on the ResNet101, MobileNetV2, and ShuffleNet architectures. Furthermore, we propose a new self-transition layer following DCNet. We trained the DCNet using (1) noise-free document images and (2) heavy-noise (zero mean Gaussian noise (ZMGN) and speckle) document images. Then, we tested the resulted models with document images containing different levels of ZMGN and speckle noise. We compared our results to three CNN benchmarking architectures, namely MobileNet, ShuffleNet, and ResNet101. In general, the proposed architecture performed better than MobileNet, ShuffleNet, ResNet101, and conventional machine learning (support vector machine and random forest), particularly for documents with heavy noise.

[1]  Khairuddin Omar,et al.  Degraded Historical Document Binarization: A Review on Issues, Challenges, Techniques, and Future Directions , 2019, J. Imaging.

[2]  Khairul Munadi,et al.  A database of printed Jawi character image , 2015, 2015 Third International Conference on Image Information Processing (ICIIP).

[3]  Ioannis Pratikakis,et al.  ICDAR 2013 Document Image Binarization Contest (DIBCO 2013) , 2013, 2013 12th International Conference on Document Analysis and Recognition.

[4]  Konstantinos Zagoris,et al.  ICFHR2016 Handwritten Document Image Binarization Contest (H-DIBCO 2016) , 2016, 2016 15th International Conference on Frontiers in Handwriting Recognition (ICFHR).

[5]  Mohamed Cheriet,et al.  Subjective and objective quality assessment of degraded document images , 2017 .

[6]  Anna Tonazzini,et al.  Fast correction of bleed-through distortion in grayscale documents by a blind source separation technique , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[7]  Xiangyu Zhang,et al.  ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices , 2017, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[8]  Konstantinos Zagoris,et al.  ICDAR2017 Competition on Document Image Binarization (DIBCO 2017) , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[9]  R. F. Moghaddam,et al.  Low quality document image modeling and enhancement , 2009, International Journal of Document Analysis and Recognition (IJDAR).

[10]  Khairul Munadi,et al.  Improved Thresholding Method for Enhancing Jawi Binarization Performance , 2017, 2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR).

[11]  Ioannis Pratikakis,et al.  A combined approach for the binarization of handwritten document images , 2014, Pattern Recognit. Lett..

[12]  Terrance E. Boult,et al.  Long-Range Facial Image Acquisition and Quality , 2009, Handbook of Remote Biometrics.

[13]  Shiguang Shan,et al.  Fully Learnable Group Convolution for Acceleration of Deep Neural Networks , 2019, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[14]  Nello Cristianini,et al.  Kernel Methods for Pattern Analysis , 2003, ICTAI.

[15]  Qiang Chen,et al.  Network In Network , 2013, ICLR.

[16]  Khairul Munadi,et al.  Effective and fast binarization method for combined degradation on ancient documents , 2019, Heliyon.

[17]  Anil K. Jain,et al.  Document Structure and Layout Analysis , 2007 .

[18]  Shahaboddin Shamshirband,et al.  Using SVM-RSM and ELM-RSM Approaches for Optimizing the Production Process of Methyl and Ethyl Esters , 2018, Energies.

[19]  Quoc V. Le,et al.  Searching for Activation Functions , 2018, arXiv.

[20]  Raúl Rojas,et al.  Transition thresholds and transition operators for binarization and edge detection , 2010, Pattern Recognit..

[21]  Quoc V. Le,et al.  Searching for MobileNetV3 , 2019, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[22]  Khairul Munadi,et al.  DHJ: A database of handwritten Jawi for recognition research , 2017, 2017 International Conference on Electrical Engineering and Informatics (ICELTICs).

[23]  Shijian Lu,et al.  Robust Document Image Binarization Technique for Degraded Document Images , 2013, IEEE Transactions on Image Processing.

[24]  Ioannis Pratikakis,et al.  ICDAR 2009 Document Image Binarization Contest (DIBCO 2009) , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[25]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[26]  Thomas F. La Porta,et al.  Modeling the Resource Requirements of Convolutional Neural Networks on Mobile Devices , 2017, ACM Multimedia.

[27]  Bo Chen,et al.  MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications , 2017, ArXiv.

[28]  Ioannis Pratikakis,et al.  H-DIBCO 2010 - Handwritten Document Image Binarization Competition , 2010, 2010 12th International Conference on Frontiers in Handwriting Recognition.

[29]  Rafael Dueire Lins,et al.  Assessing Binarization Techniques for Document Images , 2017, DocEng.

[30]  Mohamed Cheriet,et al.  Blind quality assessment metric and degradation classification for degraded document images , 2019, Signal Process. Image Commun..

[31]  Joonki Paik,et al.  Low-Light Image Enhancement Using Adaptive Digital Pixel Binning , 2015, Sensors.

[32]  Simone Marinai,et al.  Deep Learning for Historical Document Analysis and Recognition—A Survey , 2020, J. Imaging.

[33]  Jian Sun,et al.  Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[34]  K. Munadi,et al.  Degradation Classification on Ancient Document Image Based on Deep Neural Networks , 2020, 2020 3rd International Conference on Information and Communications Technology (ICOIACT).

[35]  Zhuowen Tu,et al.  Aggregated Residual Transformations for Deep Neural Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[36]  Mark Sandler,et al.  MobileNetV2: Inverted Residuals and Linear Bottlenecks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[37]  Ioannis Pratikakis,et al.  ICFHR2014 Competition on Handwritten Document Image Binarization (H-DIBCO 2014) , 2014, 2014 14th International Conference on Frontiers in Handwriting Recognition.

[38]  Jimmy Ba,et al.  Adam: A Method for Stochastic Optimization , 2014, ICLR.

[39]  James K. Lein,et al.  Fundamentals of Image Processing , 2012 .

[40]  Weiguo Fan,et al.  A new image classification method using CNN transfer learning and web data augmentation , 2018, Expert Syst. Appl..

[41]  S. Esakkirajan,et al.  A microcontroller based machine vision approach for tomato grading and sorting using SVM classifier , 2020, Microprocess. Microsystems.

[42]  Ioannis Pratikakis,et al.  ICFHR 2012 Competition on Handwritten Document Image Binarization (H-DIBCO 2012) , 2012, 2012 International Conference on Frontiers in Handwriting Recognition.

[43]  Xiangyu Zhang,et al.  ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design , 2018, ECCV.

[44]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[45]  Khairuddin Omar,et al.  Adaptive Thresholding Methods for Documents Image Binarization , 2011, MCPR.

[46]  Hossein Ziaei Nafchi,et al.  Persian heritage image binarization competition (PHIBC 2012) , 2013, 2013 First Iranian Conference on Pattern Recognition and Image Analysis (PRIA).

[47]  Ioannis Pratikakis,et al.  ICDAR 2011 Document Image Binarization Contest (DIBCO 2011) , 2011, 2011 International Conference on Document Analysis and Recognition.

[48]  Rafael Dueire Lins,et al.  Automatically detecting and classifying noises in document images , 2010, SAC '10.

[49]  Matti Pietikäinen,et al.  Adaptive document image binarization , 2000, Pattern Recognit..

[50]  Sergey Ioffe,et al.  Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift , 2015, ICML.