Parallel Rate Distortion Optimized Quantization for 4K Real-time GPU-based HEVC Encoder

We proposed a highly parallel rate distortion optimized quantization (RDOQ) for 4K real-time GPU-based HEVC encoder. RDOQ optimizes a quantized value in the view of a tradeoff relationship between picture quality and compression efficiency. While it brings better compression efficiency for HEVC, it is difficult to process on GPU. This is because two parts which compose RDOQ; the cost calculation and the optimization, are sequential. The proposed method parallelizes both of parts and accelerates RDOQ on GPU. For the cost calculation, the proposed method uses the history data of previous frame. Furthermore, to parallelize the optimization part, the proposed method applies bi-directional parallel scan which can be processed on GPU. Experimental results show that the proposed method improved 26.43 % of BD-rate compared with the conventional GPU-based encoder without RDOQ which enables 4K/60FPS real-time encoding. Furthermore, the proposed method is 5x faster than x265 which is the most practical CPU-based encoder under similar conditions of BD-rate.

[1]  Fumiyo Takano,et al.  Highly parallel transformation and quantization for HEVC encoder on GPUs , 2016, 2016 Visual Communications and Image Processing (VCIP).

[2]  Yicong Zhou,et al.  High-speed implementation of rate-distortion optimised quantisation for H.265/HEVC , 2015, IET Image Process..

[3]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[4]  Homer H. Chen,et al.  Acceleration of rate-distortion optimized quantization for H.264/AVC , 2013, 2013 IEEE International Symposium on Circuits and Systems (ISCAS2013).

[5]  Yongdong Zhang,et al.  High Efficiency Video Coding: High Efficiency Video Coding , 2014 .

[6]  Wen Gao,et al.  Hybrid Laplace Distribution-Based Low Complexity Rate-Distortion Optimized Quantization , 2017, IEEE Transactions on Image Processing.

[7]  Fumiyo Takano,et al.  4K-UHD real-time HEVC encoder with GPU accelerated motion estimation , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[8]  Zhiyong Gao,et al.  Improved rate distortion optimized quantization for HEVC with adaptive thresholding , 2016, 2016 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB).

[9]  Wen Gao,et al.  Rate-GOP Based Rate Control for High Efficiency Video Coding , 2013, IEEE Journal of Selected Topics in Signal Processing.

[10]  F. Bossen,et al.  Common test conditions and software reference configurations , 2010 .