Circular Noises Removal from Scanned Document Images

Defects inspection and correction is an important topic in the fields of scanned documents preprocessing. In this paper, a very fast and robust algorithm is proposed for locating and removing a special kind of circular noises caused by scanning documents with punched holes. Firstly, original image is reduced according to an elaborately selected ratio. Punched holes after reduction will leave some distinctive small regions. By examining such small regions, holes noises can be fast detected and located. To diminish false detections, Hough transformation is applied to the roughly located regions to further confirm the located holes. Finally, circular noise is eliminated by fitting a bi-linear blending Coons surface which interpolates along the four edges of noisy region. Experiments on a variety of scanned documents with punched holes demonstrate the feasibility and efficiency of the proposed algorithm.

[1]  S. M. Steve SUSAN - a new approach to low level image processing , 1997 .

[2]  Kuo-Chin Fan,et al.  Marginal noise removal of document images , 2002, Pattern Recognit..

[3]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[4]  E. R. Davies,et al.  Machine vision - theory, algorithms, practicalities , 2004 .

[5]  Chew Lim Tan,et al.  Restoring Warped Document Images through 3D Shape Modeling , 2006, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Henry S. Baird,et al.  Document image defect models and their uses , 1993, Proceedings of 2nd International Conference on Document Analysis and Recognition (ICDAR '93).

[7]  Guillermo Sapiro,et al.  Inpainting surface holes , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[8]  Tsnsliiiiia Nala,et al.  Shape from Shading with Interreflections under Proximal Light Source - 3D shape Reconstruction of Unfolded Book Surface from a Scanner Image - , 1995 .