Sparsity-based edge noise removal from bilevel graphical document images

This paper presents a new method to remove edge noise from graphical document images using geometrical regularities of the graphics contours that exist in the images. Denoising is understood as a recovery problem and is accomplished by employing a sparse representation framework in the form of a basis pursuit denoising algorithm. Directional information of the graphics contours is encoded by atoms in an overcomplete dictionary which is designed to match the input data. The optimal precision parameter used in this framework is shown to have a linear relationship with the level of the noise that exists in the image. Experimental results show the superiority of the proposed method over existing ones in terms of image recovery and contour raggedness.

[1]  M. Elad,et al.  $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[2]  L. O'Gorman Image and document processing techniques for the RightPages electronic library system , 1992, Proceedings., 11th IAPR International Conference on Pattern Recognition. Vol.II. Conference B: Pattern Recognition Methodology and Systems.

[3]  Edward H. Adelson,et al.  The Design and Use of Steerable Filters , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  S. Mallat,et al.  Adaptive greedy approximations , 1997 .

[5]  Michael Elad,et al.  Sparse and Redundant Representations - From Theory to Applications in Signal and Image Processing , 2010 .

[6]  Emmanuel J. Candès,et al.  The curvelet transform for image denoising , 2002, IEEE Trans. Image Process..

[7]  Salvatore Tabbone,et al.  Text extraction from graphical document images using sparse representation , 2010, DAS '10.

[8]  Marc Teboulle,et al.  A Fast Iterative Shrinkage-Thresholding Algorithm for Linear Inverse Problems , 2009, SIAM J. Imaging Sci..

[9]  Henry S. Baird,et al.  Document image defect models , 1995 .

[10]  Bhaskar D. Rao,et al.  Sparse Bayesian learning for basis selection , 2004, IEEE Transactions on Signal Processing.

[11]  Minh N. Do,et al.  The finite ridgelet transform for image representation , 2003, IEEE Trans. Image Process..

[12]  T. Chan,et al.  Edge-preserving and scale-dependent properties of total variation regularization , 2003 .

[13]  A. Bruckstein,et al.  K-SVD : An Algorithm for Designing of Overcomplete Dictionaries for Sparse Representation , 2005 .

[14]  Maarten Jansen,et al.  Noise Reduction by Wavelet Thresholding , 2001 .

[15]  David J. Field,et al.  Emergence of simple-cell receptive field properties by learning a sparse code for natural images , 1996, Nature.

[16]  Demetrio Labate,et al.  Optimally Sparse Multidimensional Representation Using Shearlets , 2007, SIAM J. Math. Anal..

[17]  L D Cromwell,et al.  Filtering noise from images with wavelet transforms , 1991, Magnetic resonance in medicine.

[18]  Michael Elad,et al.  Efficient Implementation of the K-SVD Algorithm using Batch Orthogonal Matching Pursuit , 2008 .

[19]  Jitendra Malik,et al.  Scale-Space and Edge Detection Using Anisotropic Diffusion , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[20]  Emmanuel J. Candès,et al.  New multiscale transforms, minimum total variation synthesis: applications to edge-preserving image reconstruction , 2002, Signal Process..

[21]  Eero P. Simoncelli,et al.  Multiscale Denoising of Photographic Images , 2009 .

[22]  L. Demanet,et al.  Wave atoms and sparsity of oscillatory patterns , 2007 .

[23]  Thai V. Hoang Image Representations for Pattern Recognition , 2011 .

[24]  Joachim Weickert,et al.  Coherence-Enhancing Diffusion Filtering , 1999, International Journal of Computer Vision.

[25]  E. Candès,et al.  New tight frames of curvelets and optimal representations of objects with piecewise C2 singularities , 2004 .

[26]  Petros Maragos Chapter 13 – Morphological Filtering , 2009 .

[27]  Minh N. Do,et al.  Ieee Transactions on Image Processing the Contourlet Transform: an Efficient Directional Multiresolution Image Representation , 2022 .

[28]  Sergio Escalera,et al.  Report on the Third Contest on Symbol Recognition , 2007, GREC.

[29]  Elisa H. Barney Smith Modeling image degradations for improving OCR , 2008, 2008 16th European Signal Processing Conference.

[30]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[31]  Donggang Yu,et al.  An efficient algorithm for smoothing, linearization and detection of structural feature points of binary image contours , 1997, Pattern Recognit..

[32]  Elisa H. Barney Smith,et al.  Edge noise in document images , 2009, AND '09.

[33]  Gonzalo R. Arce,et al.  Nonlinear Filtering for Image Analysis and Enhancement , 2009 .

[34]  Ronald R. Coifman,et al.  Brushlets: A Tool for Directional Image Analysis and Image Compression , 1997 .

[35]  Jean-Michel Morel,et al.  A Review of Image Denoising Algorithms, with a New One , 2005, Multiscale Model. Simul..

[36]  Robert Haimes,et al.  Multiscale and Multiresolution Methods , 2002 .

[37]  Xiaoming Huo,et al.  Beamlets and Multiscale Image Analysis , 2002 .

[38]  Gerlind Plonka-Hoch,et al.  The Curvelet Transform , 2010, IEEE Signal Processing Magazine.

[39]  Michael Elad,et al.  Image Denoising with Shrinkage and Redundant Representations , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[40]  Kazuhiko Yamamoto,et al.  Structured Document Image Analysis , 1992, Springer Berlin Heidelberg.

[41]  Michael A. Saunders,et al.  Atomic Decomposition by Basis Pursuit , 1998, SIAM J. Sci. Comput..

[42]  R. Tibshirani,et al.  Least angle regression , 2004, math/0406456.

[43]  Y. C. Pati,et al.  Orthogonal matching pursuit: recursive function approximation with applications to wavelet decomposition , 1993, Proceedings of 27th Asilomar Conference on Signals, Systems and Computers.

[44]  Guillermo Sapiro,et al.  Online Learning for Matrix Factorization and Sparse Coding , 2009, J. Mach. Learn. Res..

[45]  I. Daubechies,et al.  Iteratively reweighted least squares minimization for sparse recovery , 2008, 0807.0575.

[46]  Richard W. Hamming,et al.  Error detecting and error correcting codes , 1950 .

[47]  Tai Sing Lee,et al.  Image Representation Using 2D Gabor Wavelets , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[48]  D. Donoho For most large underdetermined systems of linear equations the minimal 𝓁1‐norm solution is also the sparsest solution , 2006 .

[49]  Alan C. Bovik,et al.  The Essential Guide to Image Processing , 2009, J. Electronic Imaging.

[50]  Venkat Chandrasekaran,et al.  Representation and Compression of Multidimensional Piecewise Functions Using Surflets , 2009, IEEE Transactions on Information Theory.

[51]  Salvatore Tabbone,et al.  Author manuscript, published in "IEEE International Conference on Image Processing- ICIP'2011 (2011)" EDGE NOISE REMOVAL IN BILEVEL GRAPHICAL DOCUMENT IMAGES USING SPARSE REPRESENTATION , 2011 .

[52]  Elisa H. Barney Smith,et al.  Statistical image differences, degradation features, and character distance metrics , 2003, Document Analysis and Recognition.

[53]  L. Rudin,et al.  Nonlinear total variation based noise removal algorithms , 1992 .

[54]  I. Johnstone,et al.  Adapting to Unknown Smoothness via Wavelet Shrinkage , 1995 .

[55]  C. Boncelet Image Noise Models , 2009 .