An adaptive texture-depth rate allocation estimation technique for low latency multi-view video plus depth transmission

This paper presents an adaptive texture-depth target bit rate allocation estimation technique for low latency multi-view video plus depth transmission using a multi-regression model. The proposed technique employs the prediction mode distribution of the macroblocks at the discontinuity regions of the depth map video to estimate the optimal texture-depth target bit rate allocation considering the total available bit rate. This technique was tested using various standard test sequences and has shown efficacy as the model is able to estimate, in real-time, the optimal texture-depth rate allocation with an absolute mean estimation error of 2.5% and a standard deviation of 2.2%. Moreover, it allows the texture-depth rate allocation to be adapted to the video sequence with good tracking performance, allowing the correct handling of scene changes.

[1]  Wen Gao,et al.  Rate-distortion analysis for H.264/AVC video coding and its application to rate control , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  Mamta Juneja,et al.  Performance Evaluation of Edge Detection Techniques for Images in Spatial Domain , 2009 .

[3]  Zhi Liu,et al.  Selective Disparity Estimation and Variable Size Motion Estimation Based on Motion Homogeneity for Multi-View Coding , 2009, IEEE Transactions on Broadcasting.

[4]  Yo-Sung Ho,et al.  Overview of Multi-view Video Coding , 2007, 2007 14th International Workshop on Systems, Signals and Image Processing and 6th EURASIP Conference focused on Speech and Image Processing, Multimedia Communications and Services.

[5]  Aljoscha Smolic,et al.  Efficient Prediction Structures for Multiview Video Coding , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Aljoscha Smolic,et al.  Multi-View Video Plus Depth Representation and Coding , 2007, 2007 IEEE International Conference on Image Processing.

[7]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Aljoscha Smolic,et al.  Interactive 3-D Video Representation and Coding Technologies , 2005, Proceedings of the IEEE.

[9]  Zhaoyang Lu,et al.  Model-Based Joint Bit Allocation Between Texture Videos and Depth Maps for 3-D Video Coding , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Antonio Ortega,et al.  On Dependent Bit Allocation for Multiview Image Coding With Depth-Image-Based Rendering , 2011, IEEE Transactions on Image Processing.

[11]  Yannick Morvan,et al.  Joint depth/texture bit-allocation for multi-view video compression , 2010 .

[12]  Aljoscha Smolic,et al.  Toward a 3D video format for auto-stereoscopic displays , 2008, Optical Engineering + Applications.

[13]  Christoph Fehn,et al.  Depth-image-based rendering (DIBR), compression, and transmission for a new approach on 3D-TV , 2004, IS&T/SPIE Electronic Imaging.

[14]  Christine Guillemot,et al.  Bit-rate allocation for multi-view video plus depth , 2011, 2011 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON).

[15]  Qingming Huang,et al.  Joint video/depth rate allocation for 3D video coding based on view synthesis distortion model , 2009, Signal Process. Image Commun..

[16]  M. Norusis IBM SPSS Statistics 19 Advanced Statistical Procedures Companion , 2011 .