Nonlinear model identification and see-through cancelation from recto–verso data

The problem of see-through cancelation in digital images of double-sided documents is addressed. We show that a nonlinear convolutional data model proposed elsewhere for moderate show-through can also be effective on strong back-to-front interferences, provided that the recto and verso pure patterns are estimated jointly. To this end, we propose a restoration algorithm that does not need any classification of the pixels. The see-through PSFs are estimated off-line, and an iterative procedure is then employed for a joint estimation of the pure patterns. This simple and fast algorithm can be used on both grayscale and color images and has proved to be very effective in real-world cases. The experimental results we report in this paper demonstrate that our algorithm outperforms the ones based on linear models with no need to tune free parameters and remains computationally inexpensive despite the nonlinear model and the iterative solution adopted. Strategies to overcome some of the residual difficulties are also envisaged.

[1]  Chew Lim Tan,et al.  Document image enhancement using directional wavelet , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[2]  Anna Tonazzini,et al.  Bleed-Through Removal from Degraded Documents Using a Color Decorrelation Method , 2004, Document Analysis Systems.

[3]  Gaurav Sharma,et al.  Show-through cancellation in scans of duplex printed documents , 2001, IEEE Trans. Image Process..

[4]  Petteri Pajunen,et al.  Nonlinear Blind Source Separation by Self-Organizing Maps , 1996 .

[5]  Anna Tonazzini,et al.  Multichannel Blind Separation and Deconvolution of Images for Document Analysis , 2010, IEEE Transactions on Image Processing.

[6]  Farnood Merrikh-Bayat,et al.  Using Non-Negative Matrix Factorization for Removing Show-Through , 2010, LVA/ICA.

[7]  B. R. Hunt,et al.  Digital Image Restoration , 1977 .

[8]  Aggelos K. Katsaggelos,et al.  Digital image restoration , 2012, IEEE Signal Process. Mag..

[9]  Yannick Deville,et al.  Recurrent networks for separating extractable-target nonlinear mixtures. Part I: Non-blind configurations , 2009, Signal Process..

[10]  Boaz Ophir,et al.  Show-Through Cancellation in Scanned Images using Blind Source Separation Techniques , 2007, 2007 IEEE International Conference on Image Processing.

[11]  Aapo Hyvärinen,et al.  Nonlinear independent component analysis: Existence and uniqueness results , 1999, Neural Networks.

[12]  Anna Tonazzini,et al.  Registration and Enhancement of Double-Sided Degraded Manuscripts Acquired in Multispectral Modality , 2009, 2009 10th International Conference on Document Analysis and Recognition.

[13]  Antti Honkela,et al.  Advances in variational Bayesian nonlinear blind source separation , 2005 .

[14]  David G. Luenberger,et al.  Linear and Nonlinear Programming: Second Edition , 2003 .

[15]  Anna Tonazzini,et al.  Nonlinear model and constrained ML for removing back-to-front interferences from recto-verso documents , 2012, Pattern Recognit..

[16]  R. F. Moghaddam,et al.  Low quality document image modeling and enhancement , 2009, International Journal of Document Analysis and Recognition (IJDAR).

[17]  Farnood Merrikh-Bayat,et al.  Linear-quadratic blind source separating structure for removing show-through in scanned documents , 2011, International Journal on Document Analysis and Recognition (IJDAR).

[18]  Luís B. Almeida,et al.  Separating a Real-Life Nonlinear Image Mixture , 2005, J. Mach. Learn. Res..

[19]  Chew Lim Tan,et al.  Matching of double-sided document images to remove interference , 2001, Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition. CVPR 2001.

[20]  Erkki Oja,et al.  Independent Component Analysis , 2001 .

[21]  Luís B. Almeida,et al.  Wavelet-based separation of nonlinear show-through and bleed-through image mixtures , 2008, Neurocomputing.

[22]  Yannick Deville,et al.  Recurrent networks for separating extractable-target nonlinear mixtures. Part II. Blind configurations , 2013, Signal Process..

[23]  Anna Tonazzini,et al.  Fast correction of bleed-through distortion in grayscale documents by a blind source separation technique , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[24]  Anna Tonazzini,et al.  Independent component analysis for document restoration , 2004, Document Analysis and Recognition.

[25]  Mariana S. C. Almeida,et al.  Separating Nonlinear Image Mixtures using a Physical Model Trained with ICA , 2006, 2006 16th IEEE Signal Processing Society Workshop on Machine Learning for Signal Processing.