Low-Cost and High-Throughput Hardware Design for the HEVC 16x16 2-D DCT Transform

This article presents the hardware design of the 16x16 2-D DCT used in the new video coding standard, the HEVC – High Efficiency Video Coding. The transforms stage is one of the innovations proposed by HEVC, since a variable size transforms stage is available (from 4x4 to 32x32), allowing the use of transforms with larger dimensions than used in previous standards. The presented design explores the 2-D DCT separability property, using two instances of the one-dimension DCT. The architecture focuses on low hardware cost and high throughput, thus the HEVC 16-points DCT algorithm was simplified targeting a more efficient hardware implementation. Operations and hardware minimization strategies were used in order to achieve such simplifications: operation reordering, factoring, multiplications to shift-adds conversion, and sharing of common sub-expressions. The 1-D DCT architectures were designed in a fully combinational way in order to reduce control overhead. A transposition buffer is used to connect the two 1-D DCT architectures. The synthesis was directed to Stratix III FPGA and TSMC 65nm standard cells technologies. The complete 2-D DCT architecture is able to achieve real-time processing for high and ultra-high definition videos, such as Full HD, QFHD and UHD 8K. When compared with related works, the architectures designed in this work reached the highest throughput and the lowest hardware resources

[1]  Chuohao Yeo,et al.  Efficient Integer DCT Architectures for HEVC , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  Dongsheng Wang,et al.  Fully pipelined DCT/IDCT/Hadamard unified transform architecture for HEVC Codec , 2013, 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013).

[4]  Itu-T and Iso Iec Jtc Advanced video coding for generic audiovisual services , 2010 .

[5]  Guido Masera,et al.  VLSI implementation of 16-point DCT for H.265/HEVC using walsh hadamard transform and lifting scheme , 2011, 2011 IEEE 14th International Multitopic Conference.

[6]  M. Grellert,et al.  Low cost and high throughput multiplierless design of a 16 point 1-D DCT of the new HEVC video coding standard , 2012, 2012 VIII Southern Conference on Programmable Logic.

[7]  P. Lønning,et al.  Performance comparison of three BRAF V600E detection methods in malignant melanoma and colorectal cancer specimens , 2014, Tumor Biology.

[8]  Nuno Roma,et al.  High performance multi-standard architecture for DCT computation in H.264/AVC High Profile and HEVC codecs , 2013, 2013 Conference on Design and Architectures for Signal and Image Processing.

[9]  Min-Su Cheon,et al.  Improved Video Compression Efficiency Through Flexible Unit Representation and Corresponding Extension of Coding Tools , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Wen-Hsiung Chen,et al.  A Fast Computational Algorithm for the Discrete Cosine Transform , 1977, IEEE Trans. Commun..

[11]  Sergio Bampi,et al.  Design and FPGA prototyping of a H.264/AVC main profile decoder for HDTV , 2010, Journal of the Brazilian Computer Society.

[12]  David Flynn,et al.  HEVC Complexity and Implementation Analysis , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[13]  Madhukar Budagavi,et al.  Core Transform Design in the High Efficiency Video Coding (HEVC) Standard , 2013, IEEE Journal of Selected Topics in Signal Processing.

[14]  Iain E. G. Richardson,et al.  H.264 and MPEG-4 Video Compression: Video Coding for Next-Generation Multimedia , 2003 .

[15]  Ilker Hamzaoglu,et al.  A low energy HEVC Inverse DCT hardware , 2013, 2013 IEEE Third International Conference on Consumer Electronics ¿ Berlin (ICCE-Berlin).

[16]  A. Madanayake,et al.  A multiplication-free digital architecture for 16×16 2-D DCT/DST transform for HEVC , 2012, 2012 IEEE 27th Convention of Electrical and Electronics Engineers in Israel.

[17]  Detlev Marpe,et al.  Performance comparison of H.265/MPEG-HEVC, VP9, and H.264/MPEG-AVC encoders , 2013, 2013 Picture Coding Symposium (PCS).