Fast Mode Decision Algorithm Through Inter-View Rate-Distortion Prediction for Multiview Video Coding System

Multiview video coding (MVC) has attracted great attention from industries and research institutes. MVC is used to encode stereoscopic video streams for 3D playout systems such as 3D television, digital cinema, and IP network applications. MVC is an extended version of H.264/AVC that improves the performance of multiview videos. Yet, when compared with single-view video coding, MVC consumes much more time when encoding large amounts of data. Speed-up algorithms, therefore, are essential for realizing related applications. This paper presents a fast mode decision algorithm to avoid the high computational complexity of MVC. The proposed approach aims to reduce candidate modes and make mode decision process more efficient. The minimum and maximum values of rate-distortion cost (RD cost) in the previously encoded view are used to compute a threshold for each mode in the current view. Compared with joint multiview video coding, the experimental results demonstrate that the proposed algorithm provides an average of 79% in time savings with negligible bit rate increase and peak signal-to-noise ratio decrease.

[1]  Ping He,et al.  Fast Mode Selection and Disparity Estimation for Multiview Video Coding , 2009, 2009 Third International Symposium on Intelligent Information Technology Application Workshops.

[2]  Honghai Liu,et al.  Intelligent Video Systems and Analytics: A Survey , 2013, IEEE Transactions on Industrial Informatics.

[3]  Kwanghoon Sohn,et al.  Fast Disparity and Motion Estimation for Multi-view Video Coding , 2007, IEEE Transactions on Consumer Electronics.

[4]  Aljoscha Smolic,et al.  Efficient Compression of Multi-View Video Exploiting Inter-View Dependencies Based on H.264/MPEG4-AVC , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[5]  Wen Gao,et al.  Fast disparity and motion estimation based on correlations for multiview video coding , 2008, IEEE Transactions on Consumer Electronics.

[6]  Changsheng Xu,et al.  Mining Semantic Context Information for Intelligent Video Surveillance of Traffic Scenes , 2013, IEEE Transactions on Industrial Informatics.

[7]  Jong-Hann Jean,et al.  Voting-Based Motion Estimation for Real-Time Video Transmission in Networked Mobile Camera Systems , 2013, IEEE Transactions on Industrial Informatics.

[8]  Tao Yan,et al.  View-Adaptive Motion Estimation and Disparity Estimation for Low Complexity Multiview Video Coding , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[9]  Fuchun Sun,et al.  Spatial Neighborhood-Constrained Linear Coding for Visual Object Tracking , 2014, IEEE Transactions on Industrial Informatics.

[10]  Yan Zhang,et al.  Low complexity multiview video plus depth coding , 2011, IEEE Transactions on Consumer Electronics.

[11]  Giovanni Muscato,et al.  3-D Integration of Robot Vision and Laser Data With Semiautomatic Calibration in Augmented Reality Stereoscopic Visual Interface , 2012, IEEE Transactions on Industrial Informatics.

[12]  Yo-Sung Ho,et al.  Generation of multi-view video using a fusion camera system for 3D displays , 2010, IEEE Transactions on Consumer Electronics.

[13]  Anthony Vetro,et al.  Extensions of H.264/AVC for Multiview Video Compression , 2006, 2006 International Conference on Image Processing.

[14]  Tien-Ying Kuo,et al.  Fast mode decision for non-anchor picture in multiview video coding , 2010, 2010 IEEE International Symposium on Broadband Multimedia Systems and Broadcasting (BMSB).

[15]  Tao Yan,et al.  Early SKIP mode decision for MVC using inter-view correlation , 2010, Signal Process. Image Commun..

[16]  Fan Zhou,et al.  Fast inter mode decision based on textural segmentation and correlations for multiview video coding , 2010, IEEE Transactions on Consumer Electronics.

[17]  Ismo Rakkolainen,et al.  A Survey of 3DTV Displays: Techniques and Technologies , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[18]  Zhi Liu,et al.  Selective Disparity Estimation and Variable Size Motion Estimation Based on Motion Homogeneity for Multi-View Coding , 2009, IEEE Transactions on Broadcasting.

[19]  Fan Zhou,et al.  Fast disparity estimation using spatio-temporal correlation of disparity field for multiview video coding , 2010, IEEE Transactions on Consumer Electronics.

[20]  Peter H. N. de With,et al.  System architecture for free-viewpoint video and 3D-TV , 2008, IEEE Transactions on Consumer Electronics.

[21]  B. Girod,et al.  Multiview Video Compression , 2007, IEEE Signal Processing Magazine.

[22]  A. Hallapuro,et al.  Mobile 3D Video Using MVC and N800 Internet Tablet , 2008, 2008 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video.

[23]  Shih-Hsuan Yang,et al.  Fast reference frame and mode selection for multiview video coding based on coded block patterns , 2010, 2010 IEEE International Conference on Multimedia and Expo.

[24]  Mohan M. Trivedi,et al.  3-D Posture and Gesture Recognition for Interactivity in Smart Spaces , 2012, IEEE Transactions on Industrial Informatics.

[25]  Liang-Gee Chen,et al.  Content-Aware Prediction Algorithm With Inter-View Mode Decision for Multiview Video Coding , 2008, IEEE Transactions on Multimedia.