Efficient GPU-Based Inter Prediction for Video Decoder

Interpolation is a very important module in inter prediction for any video decoder, e.g. AVS2 [1] and HEVC [2], which occupies most of the time in the whole decoding process . Thus, the real-time decoder is largely limited by the speed of inter prediction. To solve this problem, we propose an efficient GPU-based interpolation framework for inter prediction. Through optimizing shared memory allocation and thread scheduling on the GPU side, GPU are utilized efficiently and inter prediction is accelerated effectively. The experimental results on AVS2 show that for all Ultra HD 4K, WQXGA and full HD video sequences tested, the inter prediction acceleration ratio is over 6 times, and the average processing time is up to 1.25ms, 0.75ms and 0.45ms, respectively, with the NVIDIA GeForce GTX 1080TI GPU.

[1]  Wen Gao,et al.  The second generation IEEE 1857 video coding standard , 2015, 2015 IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP).

[2]  Mihir Mody HEVC video encoder & decoder architecture for multi-cores , 2014, 2014 International Conference on Signal Processing and Communications (SPCOM).

[3]  Biao Wang,et al.  Efficient HEVC decoder for heterogeneous CPU with GPU systems , 2016, 2016 IEEE 18th International Workshop on Multimedia Signal Processing (MMSP).

[4]  Nuno Roma,et al.  GPU acceleration of the HEVC decoder inter prediction module , 2015, 2015 IEEE Global Conference on Signal and Information Processing (GlobalSIP).

[5]  Debargha Mukherjee,et al.  A Technical Overview of VP9—The Latest Open-Source Video Codec , 2013 .

[6]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Ben H. H. Juurlink,et al.  SIMD Acceleration for HEVC Decoding , 2015, IEEE Transactions on Circuits and Systems for Video Technology.