A Novel Method of Minimizing View Synthesis Distortion Based on Its Non-Monotonicity in 3D Video

In depth-based 3D video, the view synthesis distortion (VSD), is generally measured by modeling the effect of texture and depth errors separately. With such a development, it has been referred that the VSD changes monotonically with respect to to both the texture and depth distortions. In this paper, we find that the VSD does not always change monotonically with them by both theoretical analysis and experimental test, when the effect of the texture and depth errors is considered together. Specifically, first, we prove that the VSD is non-monotonic with the texture distortion. That is, the VSD increases with the increasing texture distortion at higher distortion range but conversely decreases with it at lower range. It is different from the general scenario that only considering the effect of the texture errors. We also analytically depict their relationship with low computational cost and identify the turning point at which the change of the VSD is converted. Second, we confirm that the VSD is always monotonic with the depth distortion, which is consistent with the general scenario that only considering the effect of the depth errors. The non-monotonicity property of the VSD can be utilized to improve the viewing performance of 3D video in relevant applications, since a minimal value of the VSD exists at the turning point. We conduct two applications for this purpose. First, it is used to generate the synthesis view of minimal distortion, which achieves 0.51-dB gain of PSNR on average for the tested scenarios. Second, it is used for lossy compression of texture videos in 3D video, which reduces the coding rate by 24% on average for the tested scenarios, meanwhile, keeps the VSD not increased simultaneously.

[1]  Gary J. Sullivan,et al.  Overview of the Stereo and Multiview Video Coding Extensions of the H.264/MPEG-4 AVC Standard , 2011, Proceedings of the IEEE.

[2]  Qionghai Dai,et al.  Joint Bit Allocation and Rate Control for Coding Multi-View Video Plus Depth Based 3D Video , 2013, IEEE Transactions on Multimedia.

[3]  Wen Gao,et al.  Local Stereo Matching with Improved Matching Cost and Disparity Refinement , 2014, IEEE MultiMedia.

[4]  Seung-Won Jung,et al.  Enhancement of Image and Depth Map Using Adaptive Joint Trilateral Filter , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Dong Tian,et al.  View synthesis techniques for 3D video , 2009, Optical Engineering + Applications.

[6]  Qionghai Dai,et al.  Depth Map Coding for View Synthesis Based on Distortion Analyses , 2014, IEEE Journal on Emerging and Selected Topics in Circuits and Systems.

[7]  Masayuki Tanimoto,et al.  3D-TV System with Depth-Image-Based Rendering , 2012 .

[8]  Thomas Wiegand,et al.  3-D Video Representation Using Depth Maps , 2011, Proceedings of the IEEE.

[9]  Roberto Manduchi,et al.  Bilateral filtering for gray and color images , 1998, Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271).

[10]  Heiko Schwarz,et al.  3D video coding using the synthesized view distortion change , 2012, 2012 Picture Coding Symposium.

[11]  Liang Zhang,et al.  Stereoscopic image generation based on depth images for 3D TV , 2005, IEEE Transactions on Broadcasting.

[12]  Hakan Urey,et al.  State of the Art in Stereoscopic and Autostereoscopic Displays , 2011, Proceedings of the IEEE.

[13]  Jaejoon Lee,et al.  Depth Map Coding Based on Synthesized View Distortion Function , 2011, IEEE Journal of Selected Topics in Signal Processing.

[14]  Dong Tian,et al.  On modeling the rendering error in 3D video , 2012, 2012 19th IEEE International Conference on Image Processing.

[15]  Christoph Fehn,et al.  Depth-image-based rendering (DIBR), compression, and transmission for a new approach on 3D-TV , 2004, IS&T/SPIE Electronic Imaging.

[16]  Qingming Huang,et al.  Joint video/depth rate allocation for 3D video coding based on view synthesis distortion model , 2009, Signal Process. Image Commun..

[17]  Lu Fang,et al.  An Analytical Model for Synthesis Distortion Estimation in 3D Video , 2014, IEEE Transactions on Image Processing.

[18]  Peter Kauff,et al.  Three-Dimensional Video Postproduction and Processing , 2011, Proceedings of the IEEE.

[19]  Ying Chen,et al.  Overview of the Multiview and 3D Extensions of High Efficiency Video Coding , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  C.-C. Jay Kuo,et al.  Efficient Multiview Depth Coding Optimization Based on Allowable Depth Distortion in View Synthesis , 2014, IEEE Transactions on Image Processing.

[21]  Christopher Joseph Pal,et al.  Learning Conditional Random Fields for Stereo , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[22]  Lu Fang,et al.  Depth Error Induced Virtual View Synthesis Distortion Estimation for 3D Video Coding , 2015, 2015 Data Compression Conference.

[23]  C.-C. Jay Kuo,et al.  Rate-Distortion Optimized Rate Control for Depth Map-Based 3-D Video Coding , 2013, IEEE Transactions on Image Processing.

[24]  Antonio Ortega,et al.  Depth map distortion analysis for view rendering and depth coding , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[25]  Zhenzhong Chen,et al.  Depth No-Synthesis-Error Model for View Synthesis in 3-D Video , 2011, IEEE Transactions on Image Processing.

[26]  Nanning Zheng,et al.  Parameter-free view synthesis distortion model with application to depth video coding , 2015, 2015 IEEE International Symposium on Circuits and Systems (ISCAS).

[27]  Ju Liu,et al.  A Novel Distortion Model and Lagrangian Multiplier for Depth Maps Coding , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[28]  Dong Zhang,et al.  View Synthesis Distortion Estimation With a Graphical Model and Recursive Calculation of Probability Distribution , 2015, IEEE Transactions on Circuits and Systems for Video Technology.

[29]  Chang-Su Kim,et al.  FDQM: Fast Quality Metric for Depth Maps Without View Synthesis , 2015, IEEE Transactions on Circuits and Systems for Video Technology.

[30]  Zhaoyang Lu,et al.  Model-Based Joint Bit Allocation Between Texture Videos and Depth Maps for 3-D Video Coding , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[31]  Antonio Ortega,et al.  On Dependent Bit Allocation for Multiview Image Coding With Depth-Image-Based Rendering , 2011, IEEE Transactions on Image Processing.

[32]  Yao Wang,et al.  Color-Guided Depth Recovery From RGB-D Data Using an Adaptive Autoregressive Model , 2014, IEEE Transactions on Image Processing.

[33]  Antonio Ortega,et al.  Depth Map Coding Optimization Using Rendered View Distortion for 3D Video Coding , 2015, IEEE Transactions on Image Processing.

[34]  Qionghai Dai,et al.  Free Viewpoint Video Coding With Rate-Distortion Analysis , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[35]  Bruno Macchiavello,et al.  Loss-Resilient Coding of Texture and Depth for Free-Viewpoint Video Conferencing , 2013, IEEE Transactions on Multimedia.

[36]  Feng Wu,et al.  Estimation of Virtual View Synthesis Distortion Toward Virtual View Position , 2016, IEEE Transactions on Image Processing.

[37]  Chang-Su Kim,et al.  Bit Allocation Algorithm With Novel View Synthesis Distortion Model for Multiview Video Plus Depth Coding , 2014, IEEE Transactions on Image Processing.

[38]  张云 Regional bit allocation and rate distortion optimization for multiview depth video coding with View synthesis distortion model , 2013 .

[39]  Antonio Ortega,et al.  Depth map coding with distortion estimation of rendered view , 2010, Electronic Imaging.

[40]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.