No-Reference Depth Assessment Based on Edge Misalignment Errors for T + D Images

The quality of depth is crucial in all depth-based applications. Unfortunately, the error-free ground truth is often unattainable for depth. Therefore, no-reference quality assessment is very much desired. This paper presents a novel depth quality assessment scheme that is completely different from conventional approaches. In particular, this scheme focuses on depth edge misalignment errors in texture-plus-depth (T + D) images and develops a robust method to detect them. Based on the detected misalignments, a no-reference metric is calculated to evaluate the quality of depth maps. In the proposed scheme, misalignments are detected by matching texture and depth edges through three constraints: 1) spatial similarity; 2) edge orientation similarity; and 3) segment length similarity. Furthermore, the matching is performed on edge segments instead of individual pixels, which enables robust edge matching. Experimental results demonstrate that the proposed scheme can detect misalignment errors accurately. The proposed no-reference depth quality metric is highly consistent with the full-reference metric, and is also well-correlated with the quality of synthesized virtual views. Moreover, the proposed scheme can also use the detected edge misalignments to facilitate depth enhancement in various practical texture-plus-depth-based applications.

[1]  Ahmet M. Kondoz,et al.  A new reduced reference metric for color plus depth 3D video , 2014, J. Vis. Commun. Image Represent..

[2]  Andreas Klaus,et al.  Segment-Based Stereo Matching Using Belief Propagation and a Self-Adapting Dissimilarity Measure , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[3]  Patrick Le Callet,et al.  Reliability of 2D quality assessment methods for synthesized views evaluation in stereoscopic viewing conditions , 2012, 2012 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON).

[4]  Gozde Bozdagi Akar,et al.  An abstraction based reduced reference depth perception metric for 3D video , 2012, 2012 19th IEEE International Conference on Image Processing.

[5]  Ahmet M. Kondoz,et al.  Prediction of stereoscopic video quality using objective quality models of 2-D video , 2008 .

[6]  Ju Liu,et al.  Coding Distortion Elimination of Virtual View Synthesis for 3D Video System: Theoretical Analyses and Implementation , 2012, IEEE Transactions on Broadcasting.

[7]  Ja-Ling Wu,et al.  Quality Assessment of Stereoscopic 3D Image Compression by Binocular Integration Behaviors , 2014, IEEE Transactions on Image Processing.

[8]  Li Yu,et al.  Structural similarity-based synthesized view distortion estimation for depth map coding , 2012, IEEE Transactions on Consumer Electronics.

[9]  Gustavo Olague,et al.  The Infection Algorithm: An Artificial Epidemic Approach for Dense Stereo Correspondence , 2006, Artificial Life.

[10]  N. Otsu A threshold selection method from gray level histograms , 1979 .

[11]  Lifeng Sun,et al.  Virtual support window for adaptive-weight stereo matching , 2011, 2011 Visual Communications and Image Processing (VCIP).

[12]  Fernando Jaureguizar,et al.  Subjective assessment of the impact of transmission errors in 3DTV compared to HDTV , 2011, 2011 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON).

[13]  Antonio Ortega,et al.  Depth map coding with distortion estimation of rendered view , 2010, Electronic Imaging.

[14]  Alan C. Bovik,et al.  Range image quality assessment by Structural Similarity , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[15]  Feng Qi,et al.  Stereoscopic video quality assessment based on stereo just-noticeable difference model , 2013, 2013 IEEE International Conference on Image Processing.

[16]  Carsten Rother,et al.  A stereo approach that handles the matting problem via image warping , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[17]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[18]  Alan C. Bovik,et al.  No-Reference Quality Assessment of Natural Stereopairs , 2013, IEEE Transactions on Image Processing.

[19]  Shipeng Li,et al.  Texture-assisted Kinect depth inpainting , 2012, 2012 IEEE International Symposium on Circuits and Systems.

[20]  Stefano Tubaro,et al.  No-reference quality metric for depth maps , 2013, 2013 IEEE International Conference on Image Processing.

[21]  Julian Eggert,et al.  A Two-Stage Correlation Method for Stereoscopic Depth Estimation , 2010, 2010 International Conference on Digital Image Computing: Techniques and Applications.

[22]  Lu Fang,et al.  An Analytical Model for Synthesis Distortion Estimation in 3D Video , 2014, IEEE Transactions on Image Processing.

[23]  Zhenzhong Chen,et al.  Depth No-Synthesis-Error Model for View Synthesis in 3-D Video , 2011, IEEE Transactions on Image Processing.

[24]  Xiangyang Ji,et al.  Quality assessment of 3D asymmetric view coding using spatial frequency dominance model , 2009, 2009 3DTV Conference: The True Vision - Capture, Transmission and Display of 3D Video.

[25]  Dong Tian,et al.  Boundary Artifact Reduction in View Synthesis of 3D Video: From Perspective of Texture-Depth Alignment , 2011, IEEE Transactions on Broadcasting.

[26]  Thomas Wiegand,et al.  3D Video and Free Viewpoint Video - Technologies, Applications and MPEG Standards , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[27]  Li Yu,et al.  No-reference depth quality assessment for texture-plus-depth images , 2014, 2014 IEEE International Conference on Multimedia and Expo (ICME).

[28]  Zhou Wang,et al.  Modern Image Quality Assessment , 2006, Modern Image Quality Assessment.

[29]  Touradj Ebrahimi,et al.  Quality assessment of a stereo pair formed from decoded and synthesized views using objective metrics , 2012, 2012 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON).

[30]  Alan C. Bovik,et al.  3D Visual Discomfort Predictor: Analysis of Disparity and Neural Activity Statistics , 2015, IEEE Transactions on Image Processing.

[31]  Stefano Mattoccia,et al.  Linear stereo matching , 2011, 2011 International Conference on Computer Vision.

[32]  Richard Szeliski,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, International Journal of Computer Vision.

[33]  Tianli Yu,et al.  Efficient Message Representations for Belief Propagation , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[34]  Filippo Speranza,et al.  Perceived Picture Quality of Frame-Compatible 3DTV Video Formats , 2012, 2012 IEEE International Conference on Multimedia and Expo.

[35]  Qingxiong Yang,et al.  Near Real-time Stereo for Weakly-Textured Scenes , 2008, BMVC.

[36]  Zhaoyang Lu,et al.  Model-Based Joint Bit Allocation Between Texture Videos and Depth Maps for 3-D Video Coding , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[37]  Ahmet M. Kondoz,et al.  Quality Evaluation of Color Plus Depth Map-Based Stereoscopic Video , 2009, IEEE Journal of Selected Topics in Signal Processing.

[38]  Kwanghoon Sohn,et al.  Stereoscopic image quality metric based on binocular perception model , 2012, 2012 19th IEEE International Conference on Image Processing.

[39]  Michael F. Cohen,et al.  Digital photography with flash and no-flash image pairs , 2004, ACM Trans. Graph..

[40]  Weisi Lin,et al.  Perceptual Full-Reference Quality Assessment of Stereoscopic Images by Considering Binocular Visual Characteristics , 2013, IEEE Transactions on Image Processing.

[41]  Ahmet M. Kondoz,et al.  Toward an Impairment Metric for Stereoscopic Video: A Full-Reference Video Quality Metric to Assess Compressed Stereoscopic Video , 2013, IEEE Transactions on Image Processing.

[42]  Yutaka Ishibashi,et al.  QoE assessment in tele-operation with 3D video and haptic media , 2011, 2011 IEEE International Conference on Multimedia and Expo.

[43]  Andrew W. Fitzgibbon,et al.  KinectFusion: real-time 3D reconstruction and interaction using a moving depth camera , 2011, UIST.

[44]  Richard Szeliski,et al.  High-quality video view interpolation using a layered representation , 2004, SIGGRAPH 2004.

[45]  Lai-Man Po,et al.  Depth map misalignment correction and dilation for DIBR view synthesis , 2013, Signal Process. Image Commun..

[46]  Chaminda T. E. R. Hewage,et al.  Reduced-reference quality evaluation for compressed depth maps associated with colour plus depth 3D video , 2010, 2010 IEEE International Conference on Image Processing.

[47]  A. Murat Tekalp,et al.  Quality assessment of asymmetric stereo video coding , 2010, 2010 IEEE International Conference on Image Processing.

[48]  Alan C. Bovik,et al.  Oriented Correlation Models of Distorted Natural Images With Application to Natural Stereopair Quality Evaluation , 2015, IEEE Transactions on Image Processing.