A High Performance FPGA-Based Architecture for the Future Video Coding Adaptive Multiple Core Transform

Future video coding (FVC) will be the next generation video coding standard. Yet in the first stages of the standardization process, it is expected to replace high efficiency video coding beyond 2020. One of the enhancements in discussion is the adaptive multiple core transform, which uses five different types of 2-D discrete sine/cosine transforms (DCT-II, DCT-V, DCT-VIII, DST-I, and DST-VII) and up to <inline-formula> <tex-math notation="LaTeX">${64 \times 64}$ </tex-math></inline-formula> transform unit sizes. This schema, increases the computational complexity of both, encoder and decoder. In this paper, a deeply pipelined high performance architecture to implement the five transforms for <inline-formula> <tex-math notation="LaTeX">${4 \times 4}$ </tex-math></inline-formula>, <inline-formula> <tex-math notation="LaTeX">${8 \times 8}$ </tex-math></inline-formula>, <inline-formula> <tex-math notation="LaTeX">${16 \times 16}$ </tex-math></inline-formula>, and <inline-formula> <tex-math notation="LaTeX">${32 \times 32}$ </tex-math></inline-formula> sizes is proposed. The design has been described in VHDL and it has been synthesized for different Field-programmable gate array chips, being able to process up 182 fps<inline-formula> <tex-math notation="LaTeX">${@} 3840\times2160$ </tex-math></inline-formula> for <inline-formula> <tex-math notation="LaTeX">${4 \times 4}$ </tex-math></inline-formula> transform sizes. Up the best of our knowledge, this is the first implementation of an architecture for the FVC Adaptive Multiple Core Transform supporting <inline-formula> <tex-math notation="LaTeX">${4 \times 4}$ </tex-math></inline-formula>, <inline-formula> <tex-math notation="LaTeX">${8 \times 8}$ </tex-math></inline-formula>, <inline-formula> <tex-math notation="LaTeX">${16 \times 16}$ </tex-math></inline-formula>, and <inline-formula> <tex-math notation="LaTeX">${32 \times 32}$ </tex-math></inline-formula> sizes.

[1]  Eduardo Juárez Martínez,et al.  Complexity analysis of an HEVC decoder based on a digital signal processor , 2013, IEEE Transactions on Consumer Electronics.

[2]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  Athanassios N. Skodras,et al.  A high speed FPGA implementation of the 2D DCT for Ultra High Definition video coding , 2013, 2013 18th International Conference on Digital Signal Processing (DSP).

[4]  Min Chen,et al.  Efficient architecture of variable size HEVC 2D-DCT for FPGA platforms , 2017 .

[5]  Ilker Hamzaoglu,et al.  High performance 2D transform hardware for future video coding , 2017, IEEE Transactions on Consumer Electronics.

[6]  K. Rao,et al.  Discrete Cosine and Sine Transforms: General Properties, Fast Algorithms and Integer Approximations , 2006 .

[7]  Waleed Fakhr,et al.  An efficient implementation of the 1D DCT using FPGA technology , 2004, Proceedings. 11th IEEE International Conference and Workshop on the Engineering of Computer-Based Systems, 2004..