A hybrid video codec based on extended block sizes, recursive integer transforms, improved interpolation, and flexible motion representation

This paper describes video coding technology proposal submitted by Qualcomm Inc. in response to a joint call for proposal (CfP) issued by ITU-T SG16 Q.6 (VCEG) and ISO/IEC JTC1/SC29/WG11 (MPEG) in January 2010. Proposed video codec follows a hybrid coding approach based on temporal prediction, followed by transform, quantization, and entropy coding of the residual. Some of its key features are extended block sizes (up to 64x64), recursive integer transforms, single pass switched interpolation filters with offsets (single pass SIFO), mode dependent directional transform (MDDT) for intra-coding, luma and chroma high precision filtering, geometry motion partitioning, adaptive motion vector resolution. It also incorporates internal bit-depth increase (IBDI), and modified quadtree based adaptive loop filtering (QALF). Simulation results are presented for a variety of bit rates, resolutions and coding configurations to demonstrate the high compression efficiency achieved by the proposed video codec at moderate level of encoding and decoding complexity. For random access hierarchical B configuration (HierB), the proposed video codec achieves an average BD-rate reduction of 30.88c/o compared to the H.264/AVC alpha anchor. For low delay hierarchical P (HierP) configuration, the proposed video codec achieves an average BD-rate reduction of 32.96c/o and 48.57c/o, compared to the H.264/AVC beta and gamma anchors, respectively.

[1]  K. Rao,et al.  Discrete Cosine and Sine Transforms: General Properties, Fast Algorithms and Integer Approximations , 2006 .

[2]  G.S. Moschytz,et al.  Algorithm-architecture mapping for custom DSP chips , 1988, 1988., IEEE International Symposium on Circuits and Systems.

[3]  Itu-T and Iso Iec Jtc Advanced video coding for generic audiovisual services , 2010 .

[4]  Xin Li,et al.  Geometry-Adaptive Block Partitioning for Video Coding , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[5]  Yuriy Reznik,et al.  Design of fast transforms for high-resolution image and video coding , 2009, Optical Engineering + Applications.

[6]  Marta Karczewicz,et al.  R-D based quantization in H.264 , 2009, Optical Engineering + Applications.

[7]  Marta Karczewicz,et al.  Efficient large size transforms for high-performance video coding , 2010, Optical Engineering + Applications.

[8]  G. Plonka,et al.  Fast and numerically stable algorithms for discrete cosine transforms , 2005 .

[9]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[10]  Joan L. Mitchell,et al.  Improved precision of fixed-point algorithms by means of common factors , 2008, 2008 15th IEEE International Conference on Image Processing.

[11]  Heiko Schwarz,et al.  Context-based adaptive binary arithmetic coding in the H.264/AVC video compression standard , 2003, IEEE Trans. Circuits Syst. Video Technol..