Hardware Implementation and Performance Evaluation of the HEVC Intra-predicted $$4\times 4$$4×4 Blocks Transforms

This paper presents a performance evaluation of the high efficiency video coding (HEVC) based exclusively on the discrete cosine transform DCT-II and compares it to the reference algorithm which uses with the DCT-II, a discrete sine transform (DST-VI) for $$4\times 4$$4×4 intra-predicted luminance blocks. Then, a modified version of DST-VI called “MDST-VI” is proposed and evaluated. Using the HEVC reference model HM-16.3 and the main profile, the test results show that the MDST-VI presents an interesting compromise between the DST-VI and the DCT-II. The former leads to the best rate-distortion results, the latter yields a quality mean decrease of 0.02 dB and a mean bit-rate increase of 0.47 %, while the MDST_VI reduces the previous figures to the tenth. In a second part, a FPGA implementation performance analysis of the three transforms indicates that the DCT-II and the MDST-VI outperform the DST-VI with a 55.27 and 41.35 % area reduction, 47.73 and 10.61 % energy saving, and 85.70 and 10.55 % throughput gain, respectively. The MDST-VI could well fulfill the requirements of present and next-generation mobile communication where high-throughput and low-powered hardware accelerators are highly desirable while ensuring the best quality and the lowest bit-rate.

[1]  Ying Chen,et al.  Standardized Extensions of High Efficiency Video Coding (HEVC) , 2013, IEEE Journal of Selected Topics in Signal Processing.

[2]  Arjuna Madanayake,et al.  Multiplierless Approximate 4-point DCT VLSI Architectures for Transform Block Coding , 2013, ArXiv.

[3]  Fábio M. Bayer,et al.  DCT-like Transform for Image Compression Requires 14 Additions Only , 2017, ArXiv.

[4]  Ankur Saxena,et al.  Fast Transforms for Intra-prediction-based Image and Video Coding , 2013, 2013 Data Compression Conference.

[5]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[6]  Soontorn Oraintara,et al.  FPGA Implementation of Highly Modular Fast Universal Discrete Transforms , 2009, IEICE Trans. Electron..

[7]  Susanto Rahardja,et al.  Mode-dependent fast separable KLT for block-based intra coding , 2011, 2011 IEEE International Symposium of Circuits and Systems (ISCAS).

[8]  K. Rao,et al.  Discrete Cosine and Sine Transforms: General Properties, Fast Algorithms and Integer Approximations , 2006 .

[9]  Yuriy A. Reznik,et al.  Fast computing of discrete cosine and sine transforms of types VI and VII , 2011, Optical Engineering + Applications.

[10]  Vladimir Britanak,et al.  CHAPTER 1 – Discrete Cosine and Sine Transforms , 2006 .

[11]  Mathias Wien,et al.  High Efficiency Video Coding: Coding Tools and Specification , 2014 .

[12]  Kenneth Rose,et al.  Towards jointly optimal spatial prediction and adaptive transform in video/image coding , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[13]  Grzegorz Pastuszak Flexible Architecture Design for H.265/HEVC Inverse Transform , 2015, Circuits Syst. Signal Process..

[14]  Fábio M. Bayer,et al.  A DCT Approximation for Image Compression , 2011, IEEE Signal Processing Letters.

[15]  F. Bossen,et al.  Common test conditions and software reference configurations , 2010 .

[16]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  Yuriy A. Reznik Relationship between DCT-II, DCT-VI, and DST-VII transforms , 2013, 2013 IEEE International Conference on Acoustics, Speech and Signal Processing.

[18]  M N.,et al.  Efficient Integer DCT Architectures for HEVC , 2015 .

[19]  M. N. Shanmukha Swamy,et al.  New Systolic Algorithm and Array Architecture for Prime-Length Discrete Sine Transform , 2007, IEEE Transactions on Circuits and Systems II: Express Briefs.

[20]  Yongdong Zhang,et al.  High Efficiency Video Coding: High Efficiency Video Coding , 2014 .