Efficient H.264/AVC Video Coding with Adaptive Transforms

Transform has been widely used to remove spatial redundancy of prediction residuals in the modern video coding standards. However, since the residual blocks exhibit diverse characteristics in a video sequence, conventional transform methods with fixed transform kernels may result in low efficiency. To tackle this problem, we propose a novel content adaptive transform framework for the H.264/AVC-based video coding. The proposed method utilizes pixel rearrangement to dynamically adjust the transform kernels to adapt to the video content. In addition, unlike the traditional adaptive transforms, the proposed method obtains the transform kernels from the reconstructed block, and hence it consumes only one logic indicator for each transform unit. Moreover, a spiral-scanning method is developed to reorder the transform coefficients for better entropy coding. Experimental results on the Key Technical Area (KTA) platform show that the proposed method can achieve an average bitrate reduction of about 7.95% and 7.0% under all-intra and low-delay configurations, respectively.

[1]  Mark R. Pickering,et al.  Improved H.264-based video coding using an adaptive transform , 2010, 2010 IEEE International Conference on Image Processing.

[2]  Ankur Saxena,et al.  Mode dependent DCT/DST for intra prediction in block-based image/video coding , 2011, 2011 18th IEEE International Conference on Image Processing.

[3]  Anil K. Jain,et al.  A Sinusoidal Family of Unitary Transforms , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[5]  Anthony Vetro,et al.  Direction-adaptive transforms for coding prediction residuals , 2010, 2010 IEEE International Conference on Image Processing.

[6]  Kenneth Rose,et al.  Jointly Optimized Spatial Prediction and Block Transform for Video and Image Coding , 2012, IEEE Transactions on Image Processing.

[7]  Stanley C. Ahalt,et al.  A hybrid DCT-SVD image-coding algorithm , 2002, IEEE Trans. Circuits Syst. Video Technol..

[8]  K. R. Rao,et al.  The Transform and Data Compression Handbook , 2000 .

[9]  Susanto Rahardja,et al.  Mode-Dependent Transforms for Coding Directional Intra Prediction Residuals , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Henrique S. Malvar,et al.  Low-complexity transform and quantization in H.264/AVC , 2003, IEEE Trans. Circuits Syst. Video Technol..

[11]  Anthony Vetro,et al.  Robust Learning of 2-D Separable Transforms for Next-Generation Video Coding , 2011, 2011 Data Compression Conference.

[12]  King Ngi Ngan,et al.  An efficient content adaptive transform for video coding , 2013, 2013 IEEE China Summit and International Conference on Signal and Information Processing.

[13]  Ivan W. Selesnick,et al.  The discrete Fourier transforms , 2000 .

[14]  Marta Karczewicz,et al.  Improved h.264 intra coding based on bi-directional intra prediction, directional transform, and adaptive coefficient scanning , 2008, 2008 15th IEEE International Conference on Image Processing.

[15]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[16]  Bing Zeng,et al.  Directional Discrete Cosine Transforms—A New Framework for Image Coding , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  King Ngi Ngan,et al.  A rate distortion optimized transform for motion compensation residual , 2013, 2013 Picture Coding Symposium (PCS).

[18]  C.-C. Jay Kuo,et al.  Feature-Based Intra-/InterCoding Mode Selection for H.264/AVC , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[19]  Lu Yu,et al.  Overview of AVS-video coding standards , 2009, Signal Process. Image Commun..

[20]  Wen Gao,et al.  Video Coding With Rate-Distortion Optimized Transform , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[21]  Hyun Wook Park,et al.  Adaptive Interpolation-Based Divide-and-Predict Intra Coding for H.264/AVC , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[22]  T. Smith,et al.  The C.I.E. colorimetric standards and their use , 1931 .

[23]  King Ngi Ngan,et al.  Two-Layer Directional Transform for High Performance Video Coding , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[24]  Gary J. Sullivan,et al.  Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[25]  Gene H. Golub,et al.  Singular value decomposition and least squares solutions , 1970, Milestones in Matrix Computation.

[26]  Bu-Sung Lee,et al.  Low-Complexity Video Coding Based on Two-Dimensional Singular Value Decomposition , 2012, IEEE Transactions on Image Processing.

[27]  Chris H. Q. Ding,et al.  2-Dimensional Singular Value Decomposition for 2D Maps and Images , 2005, SDM.

[28]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[29]  Felix C. A. Fernandes,et al.  Rotational transform for image and video compression , 2011, 2011 18th IEEE International Conference on Image Processing.

[30]  Bernd Girod,et al.  Direction-Adaptive Partitioned Block Transform for Color Image Coding , 2010, IEEE Transactions on Image Processing.

[31]  Jhing-Fa Wang,et al.  Intensity Gradient Technique for Efficient Intra-Prediction in H.264/AVC , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[32]  Yung-Lyul Lee,et al.  Alternative Transform Based on the Correlation of the Residual Signal , 2008, 2008 Congress on Image and Signal Processing.

[33]  Bo Yan,et al.  Lagrangian Multiplier Based Joint Three-Layer Rate Control for H.264/AVC , 2009, IEEE Signal Processing Letters.