An inpainting technique based on regularization to remove bleed-through from ancient documents

In the techniques proposed so far to remove bleed-through from digital images of ancient documents, two critical aspects are the identification of the occlusion areas, i.e. those pixels where the bleed-through pattern overlaps with the main fore-ground text, and the in painting of the areas to be removed with a pattern that is in continuity with the surrounding background, often inhomogeneous due to paper texture or noise. In this paper we propose a new method for bleed-through removal that aims at solving both the aforementioned issues. The method first exploits information from the accurately registered images of the manuscript recto and verso to locate, in each side, the pixels corresponding to the interfering text, no matter if they are pure bleed-through or occlusion pixels. Then, processing separately the two sides, the identified areas are filled in by interpolating, through a suitable regularization model, the surrounding regions. We show the promising results obtained with this method on manuscripts affected by a very strong bleed-through.

[1]  Anna Tonazzini,et al.  Nonlinear model and constrained ML for removing back-to-front interferences from recto-verso documents , 2012, Pattern Recognit..

[2]  Mohamed Cheriet,et al.  A Variational Approach to Degraded Document Enhancement , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Anna Tonazzini,et al.  A non-stationary density model to separate overlapped texts in degraded documents , 2015, Signal Image Video Process..

[4]  Anil C. Kokaram,et al.  Bleed-through removal in degraded documents , 2012, Electronic Imaging.

[5]  Michael S. Brown,et al.  User-Assisted Ink-Bleed Reduction , 2010, IEEE Transactions on Image Processing.

[6]  Anna Tonazzini,et al.  Fast correction of bleed-through distortion in grayscale documents by a blind source separation technique , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[7]  Anil C. Kokaram,et al.  A Non-parametric Framework for Document Bleed-through Removal , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Anna Tonazzini,et al.  Nonlinear model identification and see-through cancelation from recto–verso data , 2012, International Journal on Document Analysis and Recognition (IJDAR).

[9]  Andrew Blake,et al.  Visual Reconstruction , 1987, Deep Learning for EEG-Based Brain–Computer Interfaces.

[10]  Anna Tonazzini,et al.  Digital restoration of ancient color manuscripts from geometrically misaligned recto-verso pairs , 2016 .

[11]  Boaz Ophir,et al.  Show-Through Cancellation in Scanned Images using Blind Source Separation Techniques , 2007, 2007 IEEE International Conference on Image Processing.

[12]  Michael S. Brown,et al.  Accurate Alignment of Double-Sided Manuscripts for Bleed-Through Removal , 2008, 2008 The Eighth IAPR International Workshop on Document Analysis Systems.

[13]  Farnood Merrikh-Bayat,et al.  Using Non-Negative Matrix Factorization for Removing Show-Through , 2010, LVA/ICA.