Stereoscopic Video Quality Assessment Using Binocular Energy

Stereoscopic imaging is becoming increasingly popular. However, to ensure the best quality of experience, there is a need to develop more robust and accurate objective metrics for stereoscopic content quality assessment. Existing stereoscopic image and video metrics are either extensions of conventional 2-D metrics (with added depth or disparity information) or are based on relatively simple perceptual models. Consequently, they tend to lack the accuracy and robustness required for stereoscopic content quality assessment. This paper introduces full-reference stereoscopic image and video quality metrics based on a human visual system (HVS) model incorporating important physiological findings on binocular vision. The proposed approach is based on the following three contributions. First, it introduces a novel HVS model extending previous models to include the phenomena of binocular suppression and recurrent excitation. Second, an image quality metric based on the novel HVS model is proposed. Finally, an optimized temporal pooling strategy is introduced to extend the metric to the video domain. Both image and video quality metrics are obtained via a training procedure to establish a relationship between subjective scores and objective measures of the HVS model. The metrics are evaluated using publicly available stereoscopic image/video databases as well as a new stereoscopic video database. An extensive experimental evaluation demonstrates the robustness of the proposed quality metrics. This indicates a considerable improvement with respect to the state-of-the-art with average correlations with subjective scores of 0.86 for the proposed stereoscopic image metric and 0.89 and 0.91 for the proposed stereoscopic video metrics.

[1]  Narciso García,et al.  NAMA3DS1-COSPAD1: Subjective video quality assessment database on coding conditions introducing freely available high quality 3D stereoscopic sequences , 2012, 2012 Fourth International Workshop on Quality of Multimedia Experience.

[2]  D. Pollen,et al.  Interneuronal interaction between members of quadrature phase and anti-phase pairs in the cat's visual cortex , 1992, Vision Research.

[3]  G. Peyré Géométrie multi-échelles pour les images et les textures , 2005 .

[4]  Gregory C. DeAngelis,et al.  Depth is encoded in the visual cortex by a specialized receptive field structure , 1991, Nature.

[5]  Ingrid Heynderickx,et al.  Performance evaluation of 3D-TV systems , 2008, Electronic Imaging.

[6]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[7]  J. Schanda,et al.  Colorimetry : understanding the CIE system , 2007 .

[8]  J. Movshon,et al.  Spatial and temporal contrast sensitivity of neurones in areas 17 and 18 of the cat's visual cortex. , 1978, The Journal of physiology.

[9]  D. Hubel,et al.  Receptive fields, binocular interaction and functional architecture in the cat's visual cortex , 1962, The Journal of physiology.

[10]  D. Ferster,et al.  Computational Diversity in Complex Cells of Cat Primary Visual Cortex , 2007, The Journal of Neuroscience.

[11]  Mohamed-Chaker Larabi,et al.  A perceptual metric for stereoscopic image quality assessment based on the binocular energy , 2013, Multidimens. Syst. Signal Process..

[12]  Patrick Le Callet,et al.  Stereoscopic images quality assessment , 2007, 2007 15th European Signal Processing Conference.

[13]  R. Shapley,et al.  An egalitarian network model for the emergence of simple and complex cells in visual cortex , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Kwanghoon Sohn,et al.  No-Reference Quality Assessment for Stereoscopic Images Based on Binocular Quality Perception , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  Richard Baraniuk,et al.  The Dual-tree Complex Wavelet Transform , 2007 .

[16]  G. F. Cooper,et al.  The spatial selectivity of the visual cells of the cat , 1969, The Journal of physiology.

[17]  Touradj Ebrahimi,et al.  A perceptual quality metric for stereoscopic crosstalk perception , 2010, 2010 IEEE International Conference on Image Processing.

[18]  Qionghai Dai,et al.  Image and Video Denoising Using Adaptive Dual-Tree Discrete Wavelet Packets , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[19]  Janko Calic,et al.  A full-reference stereoscopic image quality metric based on binocular energy and regression analysis , 2015, 2015 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video (3DTV-CON).

[20]  Junyong You,et al.  PERCEPTUAL QUALITY ASSESSMENT FOR STEREOSCOPIC IMAGES BASED ON 2 D IMAGE QUALITY METRICS AND DISPARITY ANALYSIS , 2010 .

[21]  Kwanghoon Sohn,et al.  Depth map quality metric for three-dimensional video , 2009, Electronic Imaging.

[22]  Roushain Akhter,et al.  No-reference stereoscopic image quality assessment , 2010, Electronic Imaging.

[23]  Do-Kyoung Kwon,et al.  Full-reference quality assessment of stereopairs accounting for rivalry , 2013, Signal Process. Image Commun..

[24]  Touradj Ebrahimi,et al.  A comprehensive database and subjective evaluation methodology for quality of experience in stereoscopic video , 2010, Electronic Imaging.

[25]  Chaminda T. E. R. Hewage,et al.  Reduced-reference quality metric for 3D depth map transmission , 2010, 2010 3DTV-Conference: The True Vision - Capture, Transmission and Display of 3D Video.

[26]  Weisi Lin,et al.  Perceptual Full-Reference Quality Assessment of Stereoscopic Images by Considering Binocular Visual Characteristics , 2013, IEEE Transactions on Image Processing.

[27]  Ahmet M. Kondoz,et al.  Toward an Impairment Metric for Stereoscopic Video: A Full-Reference Video Quality Metric to Assess Compressed Stereoscopic Video , 2013, IEEE Transactions on Image Processing.

[28]  A. Gotchev,et al.  Quality assessment of 3D video in rate allocation experiments , 2008, 2008 IEEE International Symposium on Consumer Electronics.

[29]  Steffen Wulf,et al.  The impact of temporal pooling and spatio-temporal quality interaction on the prediction accuracy of video quality metrics , 2013, 2013 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA).

[30]  Patrick Le Callet,et al.  Quality Assessment of Stereoscopic Images , 2008, EURASIP J. Image Video Process..

[31]  Ahmet M. Kondoz,et al.  An improved model of binocular energy calculation for full-reference stereoscopic image quality assessment , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[32]  Qionghai Dai,et al.  Image Coding Using Dual-Tree Discrete Wavelet Transform , 2008, IEEE Transactions on Image Processing.

[33]  A. Aksay,et al.  Towards compound stereo-video quality metric: a specific encoder-based framework , 2006, 2006 IEEE Southwest Symposium on Image Analysis and Interpretation.

[34]  S. Mallat,et al.  Orthogonal bandelet bases for geometric images approximation , 2008 .

[35]  Weisi Lin,et al.  Perceptual visual quality metrics: A survey , 2011, J. Vis. Commun. Image Represent..

[36]  A. Lee Swindlehurst,et al.  IEEE Journal of Selected Topics in Signal Processing Inaugural Issue: [editor-in-chief's message] , 2007, J. Sel. Topics Signal Processing.

[37]  Kwanghoon Sohn,et al.  An Objective Video Quality Metric for Compressed Stereoscopic Video , 2012, Circuits Syst. Signal Process..