Low-complexity comprehensive labeling and enhancement algorithm for compound documents

We present a multiresolutional algorithm that segments a compound document and uses the results of the segmentation for document enhancement in copier applications. The document is initially segmented into halftone and nonhalftone areas. Based on this segmentation the location of the edges due to text, graphics, and images (and not due to halftone dots) are detected on halftone as well as on nonhalftone portions. We further detect constant-tone regions within nonhalftone areas for subsequent bleed-through removal applications. Edge enhancement on detected edges and descreening on detected halftones are carried out. The algorithm can detect general halftones over regions of arbitrary sizes and shapes, and it can be straightforwardly adjusted for operation at various dpi resolutions. We obtain high detection probabilities on compound multilingual documents containing halftones and fine text. The proposed enhancement stage is tolerant of segmentation errors providing robust performance for the remaining problem cases. Our main contribution is the accomplishment of these tasks with a single pass algorithm that is computationally very simple and that requires less than 1% of full page memory, with active memory requirements less than 0.02% of full page memory. The operation of the algorithm can be imagined as a very thin line (of thickness the size of a "full-stop" in 11 pt text) that rapidly scans an input page while simultaneously producing an output page.

[1]  Onur G. Guleryuz,et al.  Fast text/graphics resolution improvement using wavelet based denoising and chain-code table lookup , 2002, Proceedings. International Conference on Image Processing.

[2]  A. Ravishankar Rao,et al.  Wavelet based halftone segmentation and descreening filter design , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[3]  Henry R. Kang Digital Color Halftoning , 1999 .

[4]  Anil K. Jain Fundamentals of Digital Image Processing , 2018, Control of Color Imaging Systems.

[5]  Paul G. Roetling,et al.  Fourier spectrum of halftone images , 1975 .

[6]  A. Ravishankar Rao,et al.  Segmentation and automatic descreening of scanned documents , 1998, Electronic Imaging.

[7]  Robert Ulichney,et al.  Digital Halftoning , 1987 .

[8]  Jan P. Allebach,et al.  Computer-aided design of clustered-dot color screens based on a human visual system model , 2002 .

[9]  William E. Higgins,et al.  Extracting halftones from printed documents using texture analysis , 1996, Proceedings of 3rd IEEE International Conference on Image Processing.

[10]  Faouzi Kossentini,et al.  A fast segmentation algorithm for bi-level image compression using JBIG2 , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[11]  Anil K. Jain,et al.  Document Representation and Its Application to Page Decomposition , 1998, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Thrasyvoulos N. Pappas,et al.  A robust and efficient algorithm for bilevel document block classification , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[13]  Xiangdong Liu,et al.  Analysis of moire patterns in non-uniformly sampled halftones , 2000, Image Vis. Comput..

[14]  Ahmed H. Tewfik,et al.  Color halftone document segmentation and descreening , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).