Fast inter-mode decision in multi-view video plus depth coding

Motion and disparity estimations are employed in Multi-view Video Coding (MVC) to remove redundancies present between temporal and different viewpoint frames, respectively, in both the color and the depth multi-view videos. These constitute the major computational expensive tasks of the video encoder, as iterative search for the optimal mode and its appropriate compensation vectors is employed to reduce the Rate-Distortion Optimization (RDO) cost function. This paper proposes a solution to limit the number of modes that are tested for RDO to encode the inter-view predicted views. The decision is based on the encoded information obtained from the corresponding Macroblock in the Base view, identified accurately by using the multi-view geometry together with the depth data. Results show that this geometric technique manages to reduce about 70% of the estimation's computational time and can also be used with fast geometric estimations to reduce up to 95% of the original encoding time. These gains are obtained with little degradation on the multi-view video quality for both color and depth MVC.

[1]  Kai-Kuang Ma,et al.  Correction to "a new diamond search algorithm for fast block-matching motion estimation" , 2000, IEEE Trans. Image Process..

[2]  Tao Yan,et al.  View-Adaptive Motion Estimation and Disparity Estimation for Low Complexity Multiview Video Coding , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  Kai-Kuang Ma,et al.  A new diamond search algorithm for fast block-matching motion estimation , 2000, IEEE Trans. Image Process..

[4]  Toshiaki Fujii,et al.  View Generation with 3D Warping Using Depth Information for FTV , 2008, 2008 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video.

[5]  Reuben A. Farrugia,et al.  Exploiting depth information for fast motion and disparity estimation in Multi-view Video Coding , 2011, 2011 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON).

[6]  N. Atzpadin,et al.  Depth map creation and image-based rendering for advanced 3DTV services providing interoperability and scalability , 2007, Signal Process. Image Commun..

[7]  T. Wiegand,et al.  Efficient Compression of Multi-View Depth Data Based on MVC , 2007, 2007 3DTV Conference.

[8]  Hao Yin,et al.  Video digital watermark research based on MPEG4 , 2007, Fourth International Conference on Image and Graphics (ICIG 2007).

[9]  Wen Gao,et al.  An epipolar resticted inter-mode selection for stereoscopic video encoding , 2010, 28th Picture Coding Symposium.

[10]  Gary J. Sullivan,et al.  Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[11]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[12]  Aljoscha Smolic,et al.  Efficient Compression of Multi-View Video Exploiting Inter-View Dependencies Based on H.264/MPEG4-AVC , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[13]  Kai-Kuang Ma,et al.  Fast Mode Decision for H.264/AVC Based on Macroblock Motion Activity , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[14]  Reuben A. Farrugia,et al.  Exploiting depth information for efficient Multi-View Video Coding , 2011, 2011 IEEE International Conference on Multimedia and Expo.

[15]  Richard Szeliski,et al.  High-quality video view interpolation using a layered representation , 2004, SIGGRAPH 2004.

[16]  David Bull,et al.  Video Coding for Mobile Communications: Efficiency, Complexity and Resilience , 2002 .

[17]  Reuben A. Farrugia,et al.  Fast disparity estimation for Multi-view plus depth video coding , 2011, 2011 Visual Communications and Image Processing (VCIP).

[18]  David Bull,et al.  Video Coding for Mobile Communications , 2013 .

[19]  Reuben A. Farrugia,et al.  Exploiting depth information for fast multi-view video coding , 2010, 28th Picture Coding Symposium.