SSIM-based joint-bit allocation for 3D video coding

The quality of a 3D video display depends on virtual view synthesis process which is affected by the bit allocation criterion. The performance of a bit allocation algorithm is dependent on various encoding parameters like quantization parameter, motion vector, mode selection, and so on. Rate-distortion optimization (RDO) is used to efficiently allocate bits with minimum distortion. In 3D video, rate-distortion (RD) property of synthesized view is used to assign bits between texture video and depth map. Existing literature on bit allocation methods use mean square error (MSE) as distortion metric which is not suitable for measuring perceptual quality. In this paper, we propose structural similarity (SSIM)-based joint bit allocation scheme to enhance visual quality of 3D video. Perceptual quality of a synthesized view depends on texture and depth map quality. Thus, SSIM-based RDO is performed on both texture and depth map where SSIM is used as distortion metric in mode decision and motion estimation. SSIM-based distortion model for synthesized view is determined experimentally. As SSIM cannot be related to quantization step, SSIM-MSE relation is used to convert distortion model in terms of MSE. The Lagrange multiplier method is used to solve the bit allocation problem. The proposed algorithm is implemented using 3DV-ATM as well as HEVC. RD curves show reduction in bitrate with an improvement in SSIM of synthesized view.

[1]  Jaejoon Lee,et al.  Fast joint bit-allocation between texture and depth maps for 3D video coding , 2013, 2013 IEEE International Conference on Consumer Electronics (ICCE).

[2]  King Ngi Ngan,et al.  Perceptual video coding: Challenges and approaches , 2010, 2010 IEEE International Conference on Multimedia and Expo.

[3]  Yilin Chang,et al.  Model based bit allocation between texture images and depth maps , 2010, 2010 International Conference on Computer and Communication Technologies in Agriculture Engineering.

[4]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Zongliang Gan,et al.  Structural similarity optimal MB layer rate control for H.264 , 2011, 2011 International Conference on Wireless Communications and Signal Processing (WCSP).

[6]  Qingming Huang,et al.  Joint video/depth rate allocation for 3D video coding based on view synthesis distortion model , 2009, Signal Process. Image Commun..

[7]  Chao Yang,et al.  Adaptive Bit Allocation for 3D Video Coding , 2017, Circuits Syst. Signal Process..

[8]  Mei Yu,et al.  View synthesis distortion model optimization for bit allocation in three-dimensional video coding , 2011 .

[9]  Lai-Man Po,et al.  A New Rate-Distortion Optimization Using Structural Information in H.264 I-Frame Encoder , 2005, ACIVS.

[10]  Zhaoyang Lu,et al.  Model-Based Joint Bit Allocation Between Texture Videos and Depth Maps for 3-D Video Coding , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Chuohao Yeo,et al.  On Rate Distortion Optimization Using SSIM , 2013, IEEE Trans. Circuits Syst. Video Technol..

[12]  Wen Gao,et al.  Spatio-temporal ssim index for video quality assessment , 2012, 2012 Visual Communications and Image Processing.

[13]  Qionghai Dai,et al.  Joint Bit Allocation and Rate Control for Coding Multi-View Video Plus Depth Based 3D Video , 2013, IEEE Transactions on Multimedia.

[14]  Homer H. Chen,et al.  Perceptual Rate-Distortion Optimization Using Structural Similarity Index as Quality Metric , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[16]  Dong Tian,et al.  View synthesis techniques for 3D video , 2009, Optical Engineering + Applications.

[17]  Homer H. Chen,et al.  Perceptual-based coding mode decision , 2010, Proceedings of 2010 IEEE International Symposium on Circuits and Systems.

[18]  Christoph Fehn,et al.  Depth-image-based rendering (DIBR), compression, and transmission for a new approach on 3D-TV , 2004, IS&T/SPIE Electronic Imaging.

[19]  Thomas Wiegand,et al.  3-D Video Representation Using Depth Maps , 2011, Proceedings of the IEEE.

[20]  C. Fehn A 3 DTV Approach Using Depth-Image-Based Rendering ( DIBR ) , 2003 .

[21]  Richard Szeliski,et al.  High-quality video view interpolation using a layered representation , 2004, SIGGRAPH 2004.

[22]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[23]  Aidong Men,et al.  Efficient rate-distortion optimization for HEVC using SSIM and motion homogeneity , 2013, 2013 Picture Coding Symposium (PCS).

[24]  Heiko Schwarz,et al.  3D High-Efficiency Video Coding for Multi-View Video and Depth Data , 2013, IEEE Transactions on Image Processing.

[25]  Homer H. Chen,et al.  Improving video coding quality by perceptual rate-distortion optimization , 2010, 2010 IEEE International Conference on Multimedia and Expo.

[26]  Mei Yu,et al.  Joint video/depth bit allocation for 3D video coding based on distortion of synthesized view , 2012, IEEE international Symposium on Broadband Multimedia Systems and Broadcasting.

[27]  Prabir Kumar Biswas,et al.  Rate distortion optimization using SSIM for 3D video coding , 2016, 2016 23rd International Conference on Pattern Recognition (ICPR).

[28]  Hakan Urey,et al.  State of the Art in Stereoscopic and Autostereoscopic Displays , 2011, Proceedings of the IEEE.

[29]  Yannick Morvan,et al.  Joint depth/texture bit-allocation for multi-view video compression , 2010 .