Fast intra coding unit size decision for HEVC with GPU based keypoint detection

In this paper, a fast intra Coding Unit (CU) size decision framework based on keypoint detection on Graphic Processing Unit (GPU) is proposed. In this framework, firstly the original frames are sent to GPU and then keypoint detection is conducted with numerous threads, which is able to avoid bringing in additional computational complexity even in realtime systems. Then, based on the keypoint distribution, whether to split the CU to the next coding depth is efficiently predicted. Experiments show that the proposed algorithm can achieve over 25% time saving under all intra (AI) configuration with ignorable performance loss.

[1]  Byeungwoo Jeon,et al.  Adaptive keypoint-based CU depth decision for HEVC intra coding , 2016, 2016 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB).

[2]  Manuel P. Malumbres,et al.  GPU-based HEVC intra-prediction module , 2016, The Journal of Supercomputing.

[3]  Seoung-Jun Oh,et al.  Variable block size motion estimation implementation on compute unified device architecture (CUDA) , 2013, 2013 IEEE International Conference on Consumer Electronics (ICCE).

[4]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Li Su,et al.  Multiple layer parallel motion estimation on GPU for High Efficiency Video Coding (HEVC) , 2015, 2015 IEEE International Symposium on Circuits and Systems (ISCAS).

[6]  D. Viswanathan,et al.  Features from Accelerated Segment Test ( FAST ) , 2011 .

[7]  Nuno Roma,et al.  GHEVC: An Efficient HEVC Decoder for Graphics Processing Units , 2017, IEEE Transactions on Multimedia.

[8]  Biao Min,et al.  A Fast CU Size Decision Algorithm for the HEVC Intra Encoder , 2015, IEEE Transactions on Circuits and Systems for Video Technology.

[9]  Jun Sun,et al.  Efficient SIMD optimization of HEVC encoder over X86 processors , 2012, Proceedings of The 2012 Asia Pacific Signal and Information Processing Association Annual Summit and Conference.

[10]  Wen Gao,et al.  Low complexity encoder optimization for HEVC , 2016, J. Vis. Commun. Image Represent..

[11]  John E. Stone,et al.  OpenCL: A Parallel Programming Standard for Heterogeneous Computing Systems , 2010, Computing in Science & Engineering.

[12]  Wen Gao,et al.  GPU based sample adaptive offset parameter decision and perceptual optimization for HEVC , 2016, 2016 IEEE International Symposium on Circuits and Systems (ISCAS).

[13]  Siwei Ma,et al.  Parallel intra coding for HEVC on CPU plus GPU platform , 2015, 2015 Visual Communications and Image Processing (VCIP).

[14]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .