Annealed learning based block transforms for HEVC video coding

Most of the recent video compression standards employ the Discrete Cosine Transform (DCT) for transforming the residual signal in order to remove spatial correlation and to achieve higher compression efficiency. However, by careful adaptation of transforms to the video content, a better set of integer transforms can be obtained. This paper proposes a new on-the-fly block-based transform optimization technique which involves first the classification of the residual blocks based on the cost of encoding the block, and then the generation of new optimized transforms for each class. An annealing based learning technique is further proposed in this paper in order to improve the performance of the optimization algorithm. The algorithm is tested using the latest HEVC test software where an optimized set of transforms is learned on the first frame of the HEVC test sequences and then applied to the subsequent frames in a Random Access (RA) and All Intra (AI) configuration. The results shows that this method can gain over 2% in terms of Bjontegaard Delta (BD)-rate compared to standard HEVC encoder in AI configuration and nearly 1.5% in RA.

[1]  Bing Zeng,et al.  Directional Discrete Cosine Transforms—A New Framework for Image Coding , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  King Ngi Ngan,et al.  Efficient H.264/AVC Video Coding with Adaptive Transforms , 2014, IEEE Transactions on Multimedia.

[3]  Marta Karczewicz,et al.  Improved h.264 intra coding based on bi-directional intra prediction, directional transform, and adaptive coefficient scanning , 2008, 2008 15th IEEE International Conference on Image Processing.

[4]  Ivan W. Selesnick,et al.  A diagonally-oriented DCT-like 2D block transform , 2011, Optical Engineering + Applications.

[5]  King Ngi Ngan,et al.  Video content dependent directional transform for intra frame coding , 2012, 2012 Picture Coding Symposium.

[6]  Chi-Wah Kok,et al.  Fast algorithm for computing discrete cosine transform , 1997, IEEE Trans. Signal Process..

[7]  Onur G. Guleryuz,et al.  Sparse orthonormal transforms for image compression , 2008, 2008 15th IEEE International Conference on Image Processing.

[8]  Anthony Vetro,et al.  Direction-adaptive transforms for coding prediction residuals , 2010, 2010 IEEE International Conference on Image Processing.

[9]  Mark R. Pickering,et al.  Improved H.264-based video coding using an adaptive transform , 2010, 2010 IEEE International Conference on Image Processing.

[10]  Wen Gao,et al.  Video Coding With Rate-Distortion Optimized Transform , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Wen Gao,et al.  Mode Dependent Coding Tools for Video Coding , 2013, IEEE Journal of Selected Topics in Signal Processing.

[12]  Joel Solé,et al.  Joint sparsity-based optimization of a set of orthonormal 2-D separable block transforms , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[13]  Oscar C. Au,et al.  Rate distortion optimized transform for intra block coding for HEVC , 2011, 2011 Visual Communications and Image Processing (VCIP).

[14]  Anthony Vetro,et al.  Robust Learning of 2-D Separable Transforms for Next-Generation Video Coding , 2011, 2011 Data Compression Conference.