Flexible Architecture Design for H.265/HEVC Inverse Transform

Highly-efficient video coding involves a great amount of computations. Hardware encoders apply different parallelization techniques to satisfy real-time requirements. This paper describes a novel design methodology for the 2D inverse transform used in the H.265/HEVC hardware decoder and encoder. To support different transform sizes, matrix multiplications are decomposed into some steps based on the division of transform blocks into fixed-size subblocks. The assumed order of processed subblocks along with separate transform cores assigned to both dimensions allows a significant reduction of the size of the transposition buffer, which in turn decreases the resource consumption of the whole architecture. The decomposition enables different hardware configurations of the architectures. Particularly, configuration parameters enable the tradeoff between resources and throughput, the interface adaptation to desired horizontal and vertical sizes, and the availability of particular transform sizes. Two versions of the architecture are developed for FPGA and ASIC technologies. Synthesis results show that they can operate at 200 and 400 MHz when implemented in FPGA Arria II and TSMC 90 nm, respectively.

[1]  Khan A. Wahid,et al.  A cost effective implementation of 8×8 transform of HEVC from H.264/AVC , 2012, 2012 25th IEEE Canadian Conference on Electrical and Computer Engineering (CCECE).

[2]  A. Madanayake,et al.  A multiplication-free digital architecture for 16×16 2-D DCT/DST transform for HEVC , 2012, 2012 IEEE 27th Convention of Electrical and Electronics Engineers in Israel.

[3]  M N.,et al.  Efficient Integer DCT Architectures for HEVC , 2015 .

[4]  M. Grellert,et al.  Low cost and high throughput multiplierless design of a 16 point 1-D DCT of the new HEVC video coding standard , 2012, 2012 VIII Southern Conference on Programmable Logic.

[5]  Anantha Chandrakasan,et al.  A 249-Mpixel/s HEVC Video-Decoder Chip for 4K Ultra-HD Applications , 2014, IEEE Journal of Solid-State Circuits.

[6]  Seongsoo Lee,et al.  2-D Large Inverse Transform (16×16, 32×32) for HEVC (High Efficiency Video Coding) , 2012 .

[7]  Madhukar Budagavi,et al.  Unified forward+inverse transform architecture for HEVC , 2012, 2012 19th IEEE International Conference on Image Processing.

[8]  Wen-Hsiung Chen,et al.  A Fast Computational Algorithm for the Discrete Cosine Transform , 1977, IEEE Trans. Commun..

[9]  Muhammad Usman Shahid,et al.  Point DCT VLSI Architecture for Emerging HEVC Standard , 2012, VLSI Design.

[10]  Tian-Sheuan Chang,et al.  A reconfigurable inverse transform architecture design for HEVC decoder , 2013, 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013).

[11]  Dongsheng Wang,et al.  Fully pipelined DCT/IDCT/Hadamard unified transform architecture for HEVC Codec , 2013, 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013).

[12]  Weiwei Shen,et al.  A Unified 4/8/16/32-Point Integer IDCT Architecture for Multiple Video Coding Standards , 2012, 2012 IEEE International Conference on Multimedia and Expo.

[13]  Frank Kienle,et al.  Flexible Radio Design: Trends and Challenges in Digital Baseband Implementation , 2012, VLSI Design.