High quality scanned book compression using pattern matching

This paper proposes a hybrid approximate pattern matching/transform-based compression engine. The idea is to use regular video interframe prediction as a pattern matching algorithm that can be applied to document coding. We show that this interpretation may generate residual data that can be efficiently compressed by a transform-based encoder. The novelty of this approach is demonstrated by using H.264/AVC, the newest video compression standard, as a high quality book compressor. The proposed method uses segments of the originally independent scanned pages of a book to create a video sequence, which is encoded through regular H.264/AVC. Results show that the proposed method outperforms AVC-I (H.264/AVC operating in pure intra mode) and JPEG2000 by up to 4 dB and 7 dB, respectively. Superior subjective quality is also achieved.

[1]  Charles A. Bouman,et al.  High-Quality MRC Document Coding , 2006, IEEE Transactions on Image Processing.

[2]  Arun N. Netravali,et al.  Digital Video: An introduction to MPEG-2 , 1996 .

[3]  Michael W. Marcellin,et al.  JPEG2000 - image compression fundamentals, standards and practice , 2002, The Kluwer International Series in Engineering and Computer Science.

[4]  Gary J. Sullivan,et al.  Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[5]  Yücel Altunbasak,et al.  Performance comparison of the emerging H.264 video coding standard with the existing standards , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[6]  Debargha Mukherjee,et al.  MRC Compression of Compound Documents Using Threshold Segmentation, Iterative Data-Filling and H.264/AVC-INTRA , 2008, 2008 Sixth Indian Conference on Computer Vision, Graphics & Image Processing.

[7]  Iain E. G. Richardson,et al.  H.264 and MPEG-4 Video Compression: Video Coding for Next-Generation Multimedia , 2003 .

[8]  Itu-T and Iso Iec Jtc Advanced video coding for generic audiovisual services , 2010 .

[9]  Eduardo A. B. da Silva,et al.  Universal Image Compression Using Multiscale Recurrent Patterns With Adaptive Probability Model , 2008, IEEE Transactions on Image Processing.

[10]  K. Rijkse,et al.  H.263: video coding for low-bit-rate communication , 1996, IEEE Commun. Mag..

[11]  Joan L. Mitchell,et al.  JPEG: Still Image Data Compression Standard , 1992 .

[12]  Ming Xu,et al.  Mixed raster content (MRC) model for compound image compression , 1998, Electronic Imaging.

[13]  A. Zaghetto,et al.  Fringe benefits of the H.264/AVC , 2006, 2006 International Telecommunications Symposium.

[14]  Yoshua Bengio,et al.  High quality document image compression with "DjVu" , 1998, J. Electronic Imaging.

[15]  Itu-T Video coding for low bitrate communication , 1996 .

[16]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[17]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, SPIE Optics + Photonics.

[18]  Iain E.G,et al.  H.264 and MPEG 4 video , 2009 .