A depth map rate control algorithm for HEVC Multi-View Video plus Depth

In this paper, we present a depth map rate control algorithm for the High Efficiency Video Coding (HEVC) Multi-View Video plus Depth (MVD) representation. The proposed algorithm is based on a bit allocation refinement technique for key frames together with a depth-map inspired R-λ model coupled with an adaptive clipping algorithm designed to exploit the depth map characteristics. This results in improved video quality of the synthesized views while maintaining depth map rate control accuracy. This scheme was tested using various standard test sequences and has shown efficacy with an average improvement of the synthesized view Peak Signal-to-Noise Ratio (PSNR) and Bjøntegaard Delta PSNR (BD-PSNR) of 1.15% and 0.45dB respectively, whilst achieving a bit rate error reduction of 0.2% when compared to the reference rate control algorithm implemented in MV-HEVC.

[1]  Wen Gao,et al.  Efficient bit allocation and CTU level rate control for High Efficiency Video Coding , 2013, 2013 Picture Coding Symposium (PCS).

[2]  Ying Chen,et al.  HEVC-Compatible Extensions for Advanced Coding of 3D and Multiview Video , 2015, 2015 Data Compression Conference.

[3]  King Ngi Ngan,et al.  An Efficient Frame-Content Based Intra Frame Rate Control for High Efficiency Video Coding , 2015, IEEE Signal Processing Letters.

[4]  Wen Gao,et al.  Adaptive frame level rate control in 3D-HEVC , 2014, 2014 IEEE Visual Communications and Image Processing Conference.

[5]  Lin Sun,et al.  An adaptive frame complexity based rate quantization model for intra-frame rate control of High Efficiency Video Coding (HEVC) , 2012, Proceedings of The 2012 Asia Pacific Signal and Information Processing Association Annual Summit and Conference.

[6]  Aljoscha Smolic,et al.  Toward a 3D video format for auto-stereoscopic displays , 2008, Optical Engineering + Applications.

[7]  Wen Gao,et al.  Laplace distribution based CTU level rate control for HEVC , 2013, 2013 Visual Communications and Image Processing (VCIP).

[8]  John F. Canny,et al.  A Computational Approach to Edge Detection , 1986, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[9]  Carl J. Debono,et al.  An adaptive Lagrange multiplier technique for multi-view video plus depth coding , 2013, 2013 Picture Coding Symposium (PCS).

[10]  Zhi Liu,et al.  Selective Disparity Estimation and Variable Size Motion Estimation Based on Motion Homogeneity for Multi-View Coding , 2009, IEEE Transactions on Broadcasting.

[11]  Christoph Fehn,et al.  Depth-image-based rendering (DIBR), compression, and transmission for a new approach on 3D-TV , 2004, IS&T/SPIE Electronic Imaging.

[12]  Aljoscha Smolic,et al.  Efficient Prediction Structures for Multiview Video Coding , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[13]  Aljoscha Smolic,et al.  Multi-View Video Plus Depth Representation and Coding , 2007, 2007 IEEE International Conference on Image Processing.