Compression of Compound Documents

Compound (or mixed) document images contain graphic or textual content along with pictures. They are a very common form of documents, found in magazines, brochures, web-sites etc. Because of the very distinct nature of those two image classes (text/graphics vs. pictures), their compression invariably involves multiple compression systems and a region segmentation (classification) method. We review state-of-the-art technologies on the subject while focusing our attention on the mixed raster content (MRC) multi-layer approach. We also present new results on segmentation for MRC based on optimized rate-distortion-based block thresholding.

[1]  Yoshua Bengio,et al.  High quality document image compression with "DjVu" , 1998, J. Electronic Imaging.

[2]  Daniel P. Huttenlocher,et al.  Digipaper: a versatile color document image representation , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[3]  Scott T. Acton,et al.  Document page segmentation using multiscale clustering , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[4]  Faouzi Kossentini,et al.  A fast segmentation algorithm for bi-level image compression using JBIG2 , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[5]  Pascal Vincent,et al.  Color documents on the Web with DjVu , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[6]  Edward K. Wong,et al.  Check image compression using a layered coding method , 1998, J. Electronic Imaging.

[7]  Lloyd McIntyre,et al.  New Developments in Color Facsimile and Internet Fax , 1997, Color Imaging Conference.

[8]  Stephen N. Zilles,et al.  File Format for Internet Fax , 1998, RFC.

[9]  Ricardo L. de Queiroz,et al.  Nonexpansive pyramid for image coding using a nonlinear filterbank , 1998, IEEE Trans. Image Process..

[10]  Trac D. Tran,et al.  Optimizing block-thresholding segmentation for multilayer compression of compound images , 2000, IEEE Trans. Image Process..

[11]  Amir Said,et al.  Simplified segmentation for compound image compression , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[12]  Ming Xu,et al.  Mixed raster content (MRC) model for compound image compression , 1998, Electronic Imaging.

[13]  Faouzi Kossentini,et al.  Lossy compression of stochastic halftones with JBIG2 , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[14]  Joan L. Mitchell,et al.  JPEG: Still Image Data Compression Standard , 1992 .

[15]  Gregory K. Wallace,et al.  The JPEG Still Image Compression Standard , 1991 .