RSLDI: Restoration of single-sided low-quality document images

This paper addresses the problem of enhancing and restoring single-sided low-quality single-sided document images. Initially, a series of multi-level classifiers is introduced covering several levels, including the regional and content levels. These classifiers can then be integrated into any enhancement or restoration method to generalize or improve them. Based on these multi-level classifiers, we first propose a novel PDE-based method for the restoration of the degradations in single-sided document images. To reduce the local nature of PDE-based methods, we empower our method with two flow fields to play the role of regional classifiers and help in preserving meaningful pixels. Also, the new method further diffuses the background information by using a content classifier, which provides an efficient and accurate restoration of the degraded backgrounds. The performance of the method is tested on both real samples, from the Google Book Search dataset, UNESCO's Memory of the World Programme, and the Juma Al Majid (Dubai) datasets, and synthesized samples provided by our degradation model. The results are promising. The method-independent nature of the classifiers is illustrated by modifying the ICA method to make it applicable to single-sided documents, and also by providing a Bayesian binarization model.

[1]  Beat Kleiner,et al.  Graphical Methods for Data Analysis , 1983 .

[2]  Eric Dubois,et al.  Joint Compression and Restoration of Documents with Bleed-through , 2005 .

[3]  Ingrid Daubechies,et al.  Variational image restoration by means of wavelets: simultaneous decomposition , 2005 .

[4]  Leo Grady,et al.  Random Walks for Image Segmentation , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[5]  Ji-zhou Sun,et al.  Physical Modeling of "Xuan" Paper in the Simulation of Chinese Ink-Wash Drawing , 2005, CGIV.

[6]  Jean-Michel Morel,et al.  A non-local algorithm for image denoising , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[7]  Chew Lim Tan,et al.  Removal of interfering strokes in double-sided document images , 2000, Proceedings Fifth IEEE Workshop on Applications of Computer Vision.

[8]  Anna Tonazzini,et al.  Fast correction of bleed-through distortion in grayscale documents by a blind source separation technique , 2007, International Journal of Document Analysis and Recognition (IJDAR).

[9]  Jérôme Monteil,et al.  A New Interpretation and improvement of the Nonlinear Anisotropic Diffusion for Image Enhancement , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Frank Lebourgeois,et al.  Restoring Ink Bleed-Through Degraded Document Images Using a Recursive Unsupervised Classification Technique , 2006, Document Analysis Systems.

[11]  Norishige Chiba,et al.  Simple cellular automaton-based simulation of ink behaviour and its application to suibokuga-like 3d rendering of trees , 1999 .

[12]  Frank Lebourgeois,et al.  OCR Accuracy Improvement through a PDE-Based Approach , 2007, Ninth International Conference on Document Analysis and Recognition (ICDAR 2007).

[13]  Antonin Chambolle,et al.  Nonlinear wavelet image processing: variational problems, compression, and noise removal through wavelet shrinkage , 1998, IEEE Trans. Image Process..

[14]  Anil K. Jain,et al.  Goal-Directed Evaluation of Binarization Methods , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[15]  Hsieh S. Hou,et al.  Digital document processing , 1983 .

[16]  R. Hersch,et al.  Reflectance and transmittance model for recto-verso halftone prints. , 2006, Journal of the Optical Society of America. A, Optics, image science, and vision.

[17]  Eric V. Slud Graphical Models: Methods for Data Analysis and Mining , 2003, Technometrics.

[18]  Michael S. Brown,et al.  User-assisted ink-bleed correction for handwritten documents , 2008, JCDL '08.

[19]  Wang Xiu Graphical Simulator for Chinese Ink-Wash Drawing , 2002 .

[20]  Der-Lor Way,et al.  Physical-based Model of Ink Diffusion in Chinese Paintings , 2003, WSCG.

[21]  Mohamed Cheriet,et al.  Degradation modeling and enhancement of low quality documents , 2008 .

[22]  Thomas S. Huang,et al.  Image processing , 1971 .

[23]  Henry S. Baird,et al.  The State of the Art of Document Image Degradation Modelling , 2007 .

[24]  Hwan-Gue Cho,et al.  Interactive Rendering Technique for Realistic Oriental Painting , 2003, WSCG.

[25]  Boaz Ophir,et al.  Show-Through Cancellation in Scanned Images using Blind Source Separation Techniques , 2007, 2007 IEEE International Conference on Image Processing.

[26]  Rosita Wachenchauzer,et al.  A New and Efficient Algorithm to Binarize Document Images Removing Back-to-Front Interference , 2008, J. Univers. Comput. Sci..

[27]  Katrin Franke,et al.  Ink-deposition model: the relation of writing and ink deposition processes , 2004, Ninth International Workshop on Frontiers in Handwriting Recognition.

[28]  Abdelaziz Abid ‘Memory of the World’: Preserving Our Documentary Heritage , 1997 .

[29]  Ching Y. Suen,et al.  Stroke-model-based character extraction from gray-level document images , 2001, IEEE Trans. Image Process..

[30]  S. Miklavcic,et al.  Revised Kubelka-Munk theory. II. Unified framework for homogeneous and inhomogeneous optical media. , 2004, Journal of The Optical Society of America A-optics Image Science and Vision.

[31]  Gaurav Sharma,et al.  Show-through cancellation in scans of duplex printed documents , 2001, IEEE Trans. Image Process..

[32]  Sun Mei-jun,et al.  Physical modeling of "Xuan" paper in the simulation of Chinese ink-wash drawing , 2005, International Conference on Computer Graphics, Imaging and Visualization (CGIV'05).

[33]  Erkki Oja,et al.  The FastICA Algorithm Revisited: Convergence Analysis , 2006, IEEE Transactions on Neural Networks.

[34]  I. Daubechiesa,et al.  Variational image restoration by means of wavelets : Simultaneous decomposition , deblurring , and denoising , 2005 .

[35]  Venu Govindaraju,et al.  Separating text and background in degraded document images - a comparison of global thresholding techniques for multi-stage thresholding , 2002, Proceedings Eighth International Workshop on Frontiers in Handwriting Recognition.