Scanned Document Compression Using Block-Based Hybrid Video Codec

This paper proposes a hybrid pattern matching/transform-based compression method for scanned documents. The idea is to use regular video interframe prediction as a pattern matching algorithm that can be applied to document coding. We show that this interpretation may generate residual data that can be efficiently compressed by a transform-based encoder. The efficiency of this approach is demonstrated using H.264/advanced video coding (AVC) as a high-quality single and multipage document compressor. The proposed method, called advanced document coding (ADC), uses segments of the originally independent scanned pages of a document to create a video sequence, which is then encoded through regular H.264/AVC. The encoding performance is unrivaled. Results show that ADC outperforms AVC-I (H.264/AVC operating in pure intramode) and JPEG2000 by up to 2.7 and 6.2 dB, respectively. Superior subjective quality is also achieved.

[1]  Itu-T and Iso Iec Jtc Advanced video coding for generic audiovisual services , 2010 .

[2]  Joan L. Mitchell,et al.  JPEG: Still Image Data Compression Standard , 1992 .

[3]  Ricardo L. de Queiroz Compressing Compound Documents , 2006 .

[4]  Ajay Luthra,et al.  The H.264/AVC Advanced Video Coding standard: overview and introduction to the fidelity range extensions , 2004, SPIE Optics + Photonics.

[5]  Akiyoshi Wakatani Improvement of adaptive fractal image coding on GPUs , 2012, 2012 IEEE International Conference on Consumer Electronics (ICCE).

[6]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[7]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[8]  Ricardo L. de Queiroz,et al.  Least-Squares Directional Intra Prediction in H.264/AVC , 2010, IEEE Signal Processing Letters.

[9]  Yücel Altunbasak,et al.  Performance comparison of the emerging H.264 video coding standard with the existing standards , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[10]  Eduardo A. B. da Silva,et al.  Scanned Compound Document Encoding Using Multiscale Recurrent Patterns , 2010, IEEE Transactions on Image Processing.

[11]  E. Delp ENHANCED INTRA PREDICTION USING CONTEXT-ADAPTIVE LINEAR PREDICTION , 2007 .

[12]  Charles A. Bouman,et al.  High-Quality MRC Document Coding , 2006, IEEE Transactions on Image Processing.

[13]  Michael W. Marcellin,et al.  JPEG2000 - image compression fundamentals, standards and practice , 2002, The Kluwer International Series in Engineering and Computer Science.

[14]  D. Marpe,et al.  Video coding with H.264/AVC: tools, performance, and complexity , 2004, IEEE Circuits and Systems Magazine.

[15]  Debargha Mukherjee,et al.  MRC Compression of Compound Documents Using Threshold Segmentation, Iterative Data-Filling and H.264/AVC-INTRA , 2008, 2008 Sixth Indian Conference on Computer Vision, Graphics & Image Processing.

[16]  Ching Y. Suen,et al.  Historical review of OCR research and development , 1992, Proc. IEEE.

[17]  E. Walach,et al.  A fractal based approach to image compression , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[18]  Margaret H. Pinson,et al.  A new standardized method for objectively measuring video quality , 2004, IEEE Transactions on Broadcasting.

[19]  Antti Hallapuro,et al.  High Performance, Low Complexity Video Coding and the Emerging HEVC Standard , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Iain E. G. Richardson,et al.  H.264 and MPEG-4 Video Compression: Video Coding for Next-Generation Multimedia , 2003 .

[21]  Guangming Shi,et al.  Intra frame coding with template matching prediction and adaptive transform , 2010, 2010 IEEE International Conference on Image Processing.

[22]  Alexandre Zaghetto,et al.  Improved layer processing for MRC compression of scanned documents , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[23]  Eduardo A. B. da Silva,et al.  Universal Image Compression Using Multiscale Recurrent Patterns With Adaptive Probability Model , 2008, IEEE Transactions on Image Processing.

[24]  K. Rijkse,et al.  H.263: video coding for low-bit-rate communication , 1996, IEEE Commun. Mag..

[25]  Ming Xu,et al.  Mixed raster content (MRC) model for compound image compression , 1998, Electronic Imaging.

[26]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[27]  Alexandre Zaghetto,et al.  High quality scanned book compression using pattern matching , 2010, 2010 IEEE International Conference on Image Processing.

[28]  Heiko Schwarz,et al.  Context-based adaptive binary arithmetic coding in the H.264/AVC video compression standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[29]  Arun N. Netravali,et al.  Digital Video: An introduction to MPEG-2 , 1996 .

[30]  Gary J. Sullivan,et al.  Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[31]  Yoshua Bengio,et al.  High quality document image compression with "DjVu" , 1998, J. Electronic Imaging.

[32]  Sudhir S. Kudva,et al.  Quality and complexity comparison of H.264 intra mode with JPEG2000 and JPEG , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[33]  A. Zaghetto,et al.  Fringe benefits of the H.264/AVC , 2006, 2006 International Telecommunications Symposium.

[34]  E. W. Jacobs,et al.  Fractal-Based Image Compression , 1989 .