HEVC-based scanned document compression

This paper proposes a hybrid pattern matching/transform-based compression engine for scanned compound documents. The novelty of this approach is demonstrated by using a modified version of the HEVC (High Efficiency Video Coding) Test Model as a compound document compressor, here conveniently referred to as HEDC (High Efficiency Document Coder). The proposed method uses segments of a document to create a video sequence, which is then encoded by HEDC. The idea is to explore interframe prediction as a pattern matching algorithm for coding units pre-classified as text; and intraframe prediction for coding units pre-classified as image. Results show that HEDC outperforms AVC-I, HEVC-I (H.264/AVC and HEVC operating in pure intra mode), H.264/AVC and JPEG2000 by up to 3.3, 2.5, 1.7 and 5 dB, respectively. Furthermore, for most documents the proposed method yields practically the same rate-distortion performance as regular HEVC, but is approximately 5% to 20% faster due to a pre-classification algorithm that prevents it of performing all possible inter/intra prediction tests for each prediction unit.

[1]  Alexandre Zaghetto,et al.  Segmentation-Driven Compound Document Coding Based on H.264/AVC-INTRA , 2007, IEEE Transactions on Image Processing.

[2]  Charles A. Bouman,et al.  High-Quality MRC Document Coding , 2006, IEEE Transactions on Image Processing.

[3]  Eduardo A. B. da Silva,et al.  Scanned Compound Document Encoding Using Multiscale Recurrent Patterns , 2010, IEEE Transactions on Image Processing.

[4]  Qing Wang,et al.  Hierarchical content classification and script determination for automatic document image processing , 2002, Object recognition supported by user interaction for service robots.

[5]  Joan L. Mitchell,et al.  JPEG: Still Image Data Compression Standard , 1992 .

[6]  A. Zaghetto,et al.  Fringe benefits of the H.264/AVC , 2006, 2006 International Telecommunications Symposium.

[7]  Ming Xu,et al.  Mixed raster content (MRC) model for compound image compression , 1998, Electronic Imaging.

[8]  Debargha Mukherjee,et al.  MRC Compression of Compound Documents Using Threshold Segmentation, Iterative Data-Filling and H.264/AVC-INTRA , 2008, 2008 Sixth Indian Conference on Computer Vision, Graphics & Image Processing.

[9]  Michael W. Marcellin,et al.  JPEG2000 - image compression fundamentals, standards and practice , 2002, The Kluwer International Series in Engineering and Computer Science.

[10]  Antti Hallapuro,et al.  High Performance, Low Complexity Video Coding and the Emerging HEVC Standard , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Ching Y. Suen,et al.  Historical review of OCR research and development , 1992, Proc. IEEE.

[12]  Yoshua Bengio,et al.  High quality document image compression with "DjVu" , 1998, J. Electronic Imaging.

[13]  Alexandre Zaghetto,et al.  Pre- and postprocessing for multilayer compression of scanned documents , 2011, J. Electronic Imaging.

[14]  Alexandre Zaghetto,et al.  High quality scanned book compression using pattern matching , 2010, 2010 IEEE International Conference on Image Processing.