Reference frame selection for loss-resilient depth map coding in multiview video conferencing

Multiview video in "texture-plus-depth" format enables decoder to synthesize freely chosen intermediate views for enhanced visual experience. Nevertheless, transmission of multiple texture and depth maps over bandwidthconstrained and loss-prone networks is challenging, especially for conferencing applications with stringent deadlines. In this paper, we examine the problem of loss-resilient coding of depth maps by exploiting two observations. First, different depth macroblocks have significantly different error sensitivities with respect to the reconstructed images. Second, unlike texture, the relative overhead of using reference pictures with large prediction distance is low for depth maps. This motivates our approach of assigning a weight to represent the varying error sensitivity of each macroblock and using such weights to guide selection of reference frames. Results show that (1) errors in depth maps in sequence with high motion yields significant drop in quality in reconstructed images, and (2) that the proposed scheme can efficiently maintain the quality of reconstructed images even at relatively high packet loss rates of 3-5%.

[1]  Gene Cheung,et al.  Reference Frame Optimization for Multiple-Path Video Streaming With Complexity Scaling , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  Zhaozheng Yin,et al.  Improving depth perception with motion parallax and its application in teleconferencing , 2009, 2009 IEEE International Workshop on Multimedia Signal Processing.

[3]  Antonio Ortega,et al.  Depth map distortion analysis for view rendering and depth coding , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[4]  Antonio Ortega,et al.  Transform domain sparsification of depth maps using iterative quadratic programming , 2011, 2011 18th IEEE International Conference on Image Processing.

[5]  Feng Wu,et al.  Channel Distortion Modeling for Multi-View Video Transmission Over Packet-Switched Networks , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Yo-Sung Ho,et al.  H.264-Based Depth Map Sequence Coding Using Motion Information of Corresponding Texture Video , 2006, PSIVT.

[7]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[8]  Christophe Tillier,et al.  Motion Vector Sharing and Bitrate Allocation for 3D Video-Plus-Depth Coding , 2009, EURASIP J. Adv. Signal Process..

[9]  Leonard McMillan,et al.  Post-rendering 3D warping , 1997, SI3D.

[10]  Masayuki Tanimoto,et al.  Multiview Imaging and 3DTV , 2007, IEEE Signal Processing Magazine.