Extension of High Efficiency Video Coding (HEVC) for multiview video and depth data

This paper presents an approach for 3D video coding that uses a format in which a small number of views as well as associated depth maps are coded and transmitted. At the receiver side, additional views required for displaying the 3D video on an autostereoscopic display can be generated based on the corresponding decoded signals by using depth image based rendering (DIBR) techniques. In terms of coding technology, the proposed coding scheme represents an extension of High Efficiency Video Coding (HEVC), similar to the Multiview Coding (MVC) extension of H.264/AVC. Besides the well-known disparity-compensated prediction, advanced techniques for inter-view and inter-component prediction, the representation of depth blocks, and the encoder control for depth signals have been developed and integrated. In comparison to simulcasting the different signals using HEVC, the proposed approach provides about 40% and 50% average bit rate savings for a whole test set when configured to comply with a 2- and 3-view scenario, respectively. The proposed codec was submitted as response to a Call for Proposals on 3D Video Technology issued by the ISO/IEC Moving Picture Experts Group (MPEG) and it was ranked as the overall best performing HEVC-based proposal in the related subjective tests.

[1]  Itu-T and Iso Iec Jtc Advanced video coding for generic audiovisual services , 2010 .

[2]  Edson M. Hung,et al.  Efficiency improvements for a geometric-partition-based video coder , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[3]  Jaejoon Lee,et al.  Depth Map Coding Based on Synthesized View Distortion Function , 2011, IEEE Journal of Selected Topics in Signal Processing.

[4]  Antonio Ortega,et al.  Depth map distortion analysis for view rendering and depth coding , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[5]  Aljoscha Smolic,et al.  Multi-View Video Plus Depth Representation and Coding , 2007, 2007 IEEE International Conference on Image Processing.

[6]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[7]  Erika Müller,et al.  Sharing of motion vectors in 3D video coding , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[8]  Heiko Schwarz,et al.  3D video coding using the synthesized view distortion change , 2012, 2012 Picture Coding Symposium.

[9]  Detlev Marpe,et al.  3D video: Depth coding based on inter-component prediction of block partitions , 2012, 2012 Picture Coding Symposium.

[10]  Gary J. Sullivan,et al.  Compression performance of high efficiency video coding (HEVC) working draft 4 , 2012, 2012 IEEE International Symposium on Circuits and Systems.

[11]  Heiko Schwarz,et al.  Motion vector inheritance for high efficiency 3D video plus depth coding , 2012, 2012 Picture Coding Symposium.

[12]  Peter H. N. de With,et al.  Platelet-based coding of depth maps for the transmission of multiview images , 2006, Electronic Imaging.

[13]  Heiko Schwarz,et al.  Encoder control for renderable regions in high efficiency multiview video plus depth coding , 2012, 2012 Picture Coding Symposium.

[14]  Heiko Schwarz,et al.  Inter-view prediction of motion data in multiview video coding , 2012, 2012 Picture Coding Symposium.