Global-local correlation-based early large-size mode decision for multiview video coding

Abstract. Multiview video coding (MVC) is a recent extension of H.264/AVC, and it consumes huge encoding time to select the optimal macroblock (MB) mode, among different size candidate modes. As compared with the small-size mode (Inter16×8, Inter8×16, Inter8×8, Intra8×8, and Intra4×4), the large-size mode (Skip/Direct, Inter16×16, and Intra16×16) occupies most of the MB mode proportion with much less computational complexity. Thus, if the large-size mode could be early decided as the optimal MB mode, the complexity of mode decision could be effectively reduced. In this work, an early large-size mode decision algorithm is proposed based on the global correlation of rate-distortion (RD) costs between neighbor views and the local correlation of RD costs among candidate modes. Average RD costs of large-size and small-size MB modes in the neighbor view are employed as a global reference for the threshold of early decision. And RD costs of estimated modes are used to calculate the local adjustment for the threshold. Experimental results demonstrate that the proposed algorithm can significantly reduce the whole encoding time while maintaining an RD performance similar to that of the original MVC encoder.

[1]  Gary J. Sullivan,et al.  Overview of the Stereo and Multiview Video Coding Extensions of the H.264/MPEG-4 AVC Standard , 2011, Proceedings of the IEEE.

[2]  Zhipin Deng,et al.  Iterative search strategy with selective bi-directional prediction for low complexity multiview video coding , 2012, J. Vis. Commun. Image Represent..

[3]  Susanto Rahardja,et al.  Fast intermode decision in H.264/AVC video coding , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Pascal Frossard,et al.  Fast encoding techniques for Multiview Video Coding , 2013, Signal Process. Image Commun..

[5]  Fan Zhou,et al.  Fast inter mode decision based on textural segmentation and correlations for multiview video coding , 2010, IEEE Transactions on Consumer Electronics.

[6]  Gary J. Sullivan,et al.  Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[7]  Itu-T and Iso Iec Jtc Advanced video coding for generic audiovisual services , 2010 .

[8]  Zhi Liu,et al.  Selective Disparity Estimation and Variable Size Motion Estimation Based on Motion Homogeneity for Multi-View Coding , 2009, IEEE Transactions on Broadcasting.

[9]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[10]  Gangyi Jiang,et al.  Efficient Multi-Reference Frame Selection Algorithm for Hierarchical B Pictures in Multiview Video Coding , 2011, IEEE Transactions on Broadcasting.

[11]  Sergio Bampi,et al.  A multi-level dynamic complexity reduction scheme for multiview video coding , 2011, 2011 18th IEEE International Conference on Image Processing.

[12]  Aljoscha Smolic,et al.  Efficient Prediction Structures for Multiview Video Coding , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[13]  Sam Kwong,et al.  Fast Inter-Mode Decision Based on Rate-Distortion Cost Characteristics , 2010, PCM.

[14]  Fan Zhou,et al.  Fast disparity estimation using spatio-temporal correlation of disparity field for multiview video coding , 2010, IEEE Transactions on Consumer Electronics.

[15]  Tao Yan,et al.  Early SKIP mode decision for MVC using inter-view correlation , 2010, Signal Process. Image Commun..

[16]  Zhi Liu,et al.  Low-Complexity Mode Decision for MVC , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  Kai-Kuang Ma,et al.  Fast Mode Decision for Multiview Video Coding Using Mode Correlation , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[18]  Guorui Feng,et al.  Macroblock-level adaptive search range algorithm for motion estimation in multiview video coding , 2009, J. Electronic Imaging.

[19]  Liang-Gee Chen,et al.  Content-Aware Prediction Algorithm With Inter-View Mode Decision for Multiview Video Coding , 2008, IEEE Transactions on Multimedia.

[20]  Mei Yu,et al.  Statistical Early Termination Model for Fast Mode Decision and Reference Frame Selection in Multiview Video Coding , 2012, IEEE Transactions on Broadcasting.

[21]  Jia-Ching Wang,et al.  Fast Mode Decision for H.264/AVC Based on Rate-Distortion Clustering , 2012, IEEE Transactions on Multimedia.

[22]  Kai-Kuang Ma,et al.  Mode-correlation-based early termination mode decision for multi-view video coding , 2010, 2010 IEEE International Conference on Image Processing.

[23]  Tao Yan,et al.  View-Adaptive Motion Estimation and Disparity Estimation for Low Complexity Multiview Video Coding , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[24]  Zhou Wang,et al.  Multiview Coding Mode Decision With Hybrid Optimal Stopping Model , 2013, IEEE Transactions on Image Processing.

[25]  HOMAS,et al.  Overview of the Stereo and Multiview Video Coding Extensions of the H . 264 / MPEG-4 AVC Standard , 2022 .

[26]  Sergio Bampi,et al.  An adaptive early skip mode decision scheme for multiview video coding , 2010, 28th Picture Coding Symposium.