Video Compression Using Nested Quadtree Structures, Leaf Merging, and Improved Techniques for Motion Representation and Entropy Coding

Abstract-A video coding architecture is described that is based on nested and pre-configurable quadtree structures for flexible and signal-adaptive picture partitioning. The primary goal of this partitioning concept is to provide a high degree of adaptability for both temporal and spatial prediction as well as for the purpose of space-frequency representation of prediction residuals. At the same time, a leaf merging mechanism is included in order to prevent excessive partitioning of a picture into prediction blocks and to reduce the amount of bits for signaling the prediction signal. For fractional-sample motion-compensated prediction, a fixed-point implementation of the maximal-order minimum-support algorithm is presented that uses a combination of infinite impulse response and FIR filtering. Entropy coding utilizes the concept of probability interval partitioning entropy codes that offers new ways for parallelization and enhanced throughput. The presented video coding scheme was submitted to a joint call for proposals of ITU-T Visual Coding Experts Group and ISO/IEC Moving Picture Experts Group and was ranked among the five best performing proposals, both in terms of subjective and objective quality.

[1]  Gary J. Sullivan,et al.  Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[2]  He Special Section on the Joint Call for Proposals on High Efficiency Video Coding ( HEVC ) Standardization , 2011 .

[3]  D. Marpe,et al.  Video coding with H.264/AVC: tools, performance, and complexity , 2004, IEEE Circuits and Systems Magazine.

[4]  Gary J. Sullivan,et al.  Recent developments in standardization of high efficiency video coding (HEVC) , 2010, Optical Engineering + Applications.

[5]  M. Victor Wickerhauser,et al.  Adapted wavelet analysis from theory to software , 1994 .

[6]  Thierry Blu,et al.  MOMS: maximal-order interpolation of minimal support , 2001, IEEE Trans. Image Process..

[7]  Philip A. Chou,et al.  Optimal pruning with applications to tree-structured source coding and modeling , 1989, IEEE Trans. Inf. Theory.

[8]  Heiko Schwarz,et al.  Improved context modeling for coding quantized transform coefficients in video compression , 2010, 28th Picture Coding Symposium.

[9]  Heiko Schwarz,et al.  Entropy coding in video compression using probability interval partitioning , 2010, 28th Picture Coding Symposium.

[10]  Michael J. Gormish,et al.  Very high speed entropy coding , 1994, Proceedings of 1st International Conference on Image Processing.

[11]  Gary J. Sullivan,et al.  Efficient quadtree coding of images and video , 1994, IEEE Trans. Image Process..

[12]  Heiko Schwarz,et al.  Context-based adaptive binary arithmetic coding in the H.264/AVC video compression standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[13]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[14]  Heiko Schwarz,et al.  Separable Wiener filter based adaptive in-loop filter for video coding , 2010, 28th Picture Coding Symposium.

[15]  Thomas Wedi Adaptive interpolation filters and high-resolution displacements for video coding , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[16]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[17]  Gary J. Sullivan,et al.  On dead-zone plus uniform threshold scalar quantization , 2005, Visual Communications and Image Processing.

[18]  Sanjit K. Mitra,et al.  Rate-distortion optimized mode selection for very low bit rate video coding and the emerging H.263 standard , 1996, IEEE Trans. Circuits Syst. Video Technol..

[19]  Shigenori Kino,et al.  Bi-level image coding with MELCODE-comparison of block type code and arithmetic type code , 1989, IEEE Global Telecommunications Conference, 1989, and Exhibition. 'Communications Technology for the 1990s and Beyond.

[20]  Itu-T and Iso Iec Jtc Advanced video coding for generic audiovisual services , 2010 .

[21]  David S. Taubman,et al.  On the benefits of leaf merging in quad-tree motion models , 2005, IEEE International Conference on Image Processing 2005.

[22]  Minh N. Do,et al.  Rate-distortion optimized tree-structured compression algorithms for piecewise polynomial images , 2005, IEEE Transactions on Image Processing.

[23]  Heiko Schwarz,et al.  Fractional-sample motion compensation using generalized interpolation , 2010, 28th Picture Coding Symposium.

[24]  D. Marpe,et al.  The H.264/MPEG4 advanced video coding standard and its applications , 2006, IEEE Communications Magazine.

[25]  Gary J. Sullivan,et al.  Rate-distortion optimization for video compression , 1998, IEEE Signal Process. Mag..

[26]  Henrique S. Malvar Fast adaptive encoder for bi-level images , 2001, Proceedings DCC 2001. Data Compression Conference.