A Fast GPU-Based Motion Estimation Algorithm for HD 3D Video Coding

H.264/AVC is the commercial standard currently most in use for video and is based on single view (mono view). Recently, the video community has also standardized an H.264/AVC extension for supporting 3D video sensation which is referred to as Multiview Video Coding (MVC). Like H.264/AVC, MVC includes temporal and spatial prediction but also includes inter-view prediction as well as disparity estimation. Until now, in H.264/AVC the inter prediction techniques have been the most time-consuming tasks and, thus, in MVC with its new interview predictions the encoding time is even higher. This paper proposes a GPU-based algorithm for both temporal and interview prediction. The algorithm proposed is able to perform this complex prediction task by means of an efficient distribution of all the computations over the GPU and also tries to mitigate the sequential dependencies. The approach can achieve a remarkable time reduction of up to 98% with only a negligible loss in coding efficiency. Moreover, this paper shows that the proposed GPU-based algorithm is more energy efficient and thus, requires less energy than the sequential reference.

[1]  Hsueh-Ming Hang,et al.  H.264/AVC motion estimation implmentation on Compute Unified Device Architecture (CUDA) , 2008, 2008 IEEE International Conference on Multimedia and Expo.

[2]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[3]  Aljoscha Smolic,et al.  Efficient Prediction Structures for Multiview Video Coding , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  HOMAS,et al.  Overview of the Stereo and Multiview Video Coding Extensions of the H . 264 / MPEG-4 AVC Standard , 2022 .

[5]  Gary J. Sullivan,et al.  Overview of the Stereo and Multiview Video Coding Extensions of the H.264/MPEG-4 AVC Standard , 2011, Proceedings of the IEEE.

[6]  Gary Sullivan,et al.  Recommended Simulation Common Conditions for H.26L Coding Efficiency Experiments on Low Resolution Progressive Scan Source Material , 2001 .

[7]  Zhi Liu,et al.  Selective Disparity Estimation and Variable Size Motion Estimation Based on Motion Homogeneity for Multi-View Coding , 2009, IEEE Transactions on Broadcasting.

[8]  J. Li,et al.  An Epipolar Geometry-Based Fast Disparity Estimation Algorithm for Multiview Image and Video Coding , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[9]  Liang-Gee Chen,et al.  Content-Aware Prediction Algorithm With Inter-View Mode Decision for Multiview Video Coding , 2008, IEEE Transactions on Multimedia.

[10]  Oscar C. Au,et al.  Video Coding On Multi-Core Graphics Processors , 2009 .

[11]  Dinesh Manocha,et al.  High-performance computing using accelerators , 2007, Parallel Comput..

[12]  Wen-mei W. Hwu,et al.  Optimization principles and application performance evaluation of a multithreaded GPU using CUDA , 2008, PPoPP.

[13]  Oscar C. Au,et al.  Video Coding on Multicore Graphics Processors , 2010, IEEE Signal Processing Magazine.

[14]  Gary J. Sullivan,et al.  Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[15]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[16]  He Yejunb Efficient Prediction Structure of Multi-view Video Coding , 2012 .

[17]  Ming Li,et al.  Scalable prediction structure for multiview video coding , 2009, 2009 IEEE International Symposium on Circuits and Systems.

[18]  Oscar C. Au,et al.  Highly Parallel Rate-Distortion Optimized Intra-Mode Decision on Multicore Graphics Processors , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[19]  Wen Gao,et al.  A Fast Inter Frame Prediction Algorithm for Multi-View Video Coding , 2007, 2007 IEEE International Conference on Image Processing.