Visual discomfort prediction on stereoscopic 3D images without explicit disparities

Almost all existing 3D visual discomfort prediction models are based, at least in part, on features that are extracted from computed disparity maps. These include such estimated quantities such as the maximum disparity, disparity range, disparity energy and other measures of the disparity distribution. A common first step when implementing a 3D visual discomfort model is some form of disparity calculation, whence the accuracy of prediction largely depends on the accuracy of the disparity result. Unfortunately, most algorithms that compute disparity maps are expensive, and are not guaranteed to deliver sufficiently accurate or perceptually relevant disparity data. This raises the question of whether it is possible to build a 3D discomfort prediction model without explicit disparity calculation. Towards this possibility, we have developed a new feature map, called the percentage of un-linked pixels (PUP), that is descriptive of the presence of disparity, and which can be used to accurately predict experienced 3D visual discomfort without the need for actually calculating disparity values. Instead, PUP features are extracted by predicting the percentage of un-linked pixels in corresponding retinal patches of image pairs. The un-linked pixels are determined by feature classification on orientation and luminance distributions. Calculation of PUP maps is much faster than traditional disparity computation, and the experimental results demonstrate that the predictive power attained using the PUP map is highly competitive with prior models that rely on computed disparity maps. HighlightsA first of a kind 3D discomfort model without disparity calculation is proposed.A new feature map, the percentage of un-linked pixels is developed in the model.PUP map is superior to prior models that rely on computed disparity maps.

[1]  Mtm Marc Lambooij,et al.  Visual Discomfort and Visual Fatigue of Stereoscopic Displays: A Review , 2009 .

[2]  Richard Szeliski,et al.  A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms , 2001, International Journal of Computer Vision.

[3]  Patricia M. Riddell,et al.  The use of cues to convergence and accommodation in naïve, uninstructed participants , 2008, Vision Research.

[4]  Ian P. Howard,et al.  Perceiving in DepthVolume 1 Basic Mechanisms , 2012 .

[5]  Geoffrey Egnal,et al.  Detecting Binocular Half-Occlusions: Empirical Comparisons of Five Approaches , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Alan C. Bovik,et al.  Visual Importance Pooling for Image Quality Assessment , 2009, IEEE Journal of Selected Topics in Signal Processing.

[7]  Masahito Torii,et al.  Dynamic measurement of accommodative responses while viewing stereoscopic images , 2008 .

[8]  Robert Patterson,et al.  Human factors of 3‐D displays , 2007 .

[9]  Martin S Banks,et al.  Limits of stereopsis explained by local cross-correlation. , 2009, Journal of vision.

[10]  Yong Man Ro,et al.  Subjective assessment of visual discomfort induced by binocular disparity and stimulus width in stereoscopic image , 2013, Electronic Imaging.

[11]  Kwanghoon Sohn,et al.  Visual Fatigue Prediction for Stereoscopic Image , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[12]  Neil A. Dodgson,et al.  Predicting stereoscopic viewing comfort using a coherence-based computational model , 2011, CAe '11.

[13]  Joydeep Ghosh,et al.  Algorithmic assessment of 3D quality of experience for images and videos , 2011, 2011 Digital Signal Processing and Signal Processing Education Meeting (DSP/SPE).

[14]  Sumio Yano,et al.  A study of visual fatigue and visual comfort for 3D HDTV/HDTV images , 2002 .

[15]  Peter A. Howarth,et al.  Visual fatigue caused by viewing stereoscopic motion images: Background, theories, and observations , 2008, Displays.

[16]  F. Okano,et al.  Repeated vergence adaptation causes the decline of visual functions in watching stereoscopic television , 2005, Journal of Display Technology.

[17]  Alan C. Bovik,et al.  A Completely Blind Video Integrity Oracle , 2016, IEEE Transactions on Image Processing.

[18]  Robert S. Allison,et al.  Coarse-fine dichotomies in human stereopsis , 2009, Vision Research.

[19]  Alan C. Bovik,et al.  3D Visual Discomfort Predictor: Analysis of Disparity and Neural Activity Statistics , 2015, IEEE Transactions on Image Processing.

[20]  Alan C. Bovik,et al.  Video Quality Pooling Adaptive to Perceptual Distortion Severity , 2013, IEEE Transactions on Image Processing.

[21]  Alan C. Bovik,et al.  Experiments in segmenting texton patterns using localized spatial filters , 1989, Pattern Recognit..

[22]  Fumio Okano,et al.  Measurement of parallax distribution and its application to the analysis of visual comfort for stereoscopic HDTV , 2003, IS&T/SPIE Electronic Imaging.

[23]  Touradj Ebrahimi,et al.  Impact of Acquisition Distortion on the Quality of Stereoscopic Images , 2010 .

[24]  Michael J. Black,et al.  Secrets of optical flow estimation and their principles , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[25]  Kwanghoon Sohn,et al.  Visual fatigue modeling and analysis for stereoscopic video , 2012 .

[26]  Ian P. Howard,et al.  Perceiving in Depth , 2012 .

[27]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[28]  I. Heynderickx,et al.  Measuring visual discomfort associated with 3D displays , 2009, Electronic Imaging.

[29]  Alexander Toet,et al.  Visual comfort of binocular and 3D displays , 2004 .

[30]  Kwanghoon Sohn,et al.  Depth map quality metric for three-dimensional video , 2009, Electronic Imaging.

[31]  Touradj Ebrahimi,et al.  A new analysis method for paired comparison and its application to 3D quality assessment , 2011, ACM Multimedia.

[32]  Do-Kyoung Kwon,et al.  Full-reference quality assessment of stereopairs accounting for rivalry , 2013, Signal Process. Image Commun..

[33]  Yoonsik Choe,et al.  Disparity estimation using a region-dividing technique and energy-based regularization , 2004 .

[34]  Alan C. Bovik,et al.  Natural scene statistics of color and range , 2011, 2011 18th IEEE International Conference on Image Processing.

[35]  Laurie M. Wilcox,et al.  Depth magnitude and binocular disparity: a closer look at patent vs. qualitative stereopsis , 2010 .

[36]  Alan C. Bovik,et al.  Making a “Completely Blind” Image Quality Analyzer , 2013, IEEE Signal Processing Letters.

[37]  James S. Wolffsohn,et al.  Target spatial frequency determines the response to conflicting defocus- and convergence-driven accommodative stimuli , 2006, Vision Research.

[38]  Makoto Okui,et al.  Parallax distribution for ease of viewing in stereoscopic HDTV , 2002, IS&T/SPIE Electronic Imaging.

[39]  Yong Man Ro,et al.  Predicting Visual Discomfort Using Object Size and Disparity Information in Stereoscopic Images , 2013, IEEE Transactions on Broadcasting.

[40]  Y. Yeh,et al.  Limits of Fusion and Depth Judgment in Stereoscopic Color Displays , 1990, Human factors.

[41]  K.-S. Choi,et al.  Control of vortex shedding from a hemisphere by local suction , 2006, J. Vis..

[42]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[43]  Johannes Burge,et al.  The vertical horopter is not adaptable, but it may be adaptive. , 2011, Journal of vision.

[44]  Filippo Speranza,et al.  Effect of disparity and motion on visual comfort of stereoscopic images , 2006, Electronic Imaging.

[45]  G. Sperling Binocular Vision: A Physical and a Neural Theory , 1970 .

[46]  F. Okano,et al.  PARALLAX DISTRIBUTION AND VISUAL COMFORT ON STEREOSCOPIC HDTV , 2002 .

[47]  Masaki Emoto,et al.  Two factors in visual fatigue caused by stereoscopic HDTV images , 2004 .

[48]  Wilson S. Geisler,et al.  Multichannel Texture Analysis Using Localized Spatial Filters , 1990, IEEE Trans. Pattern Anal. Mach. Intell..

[49]  Joydeep Ghosh,et al.  Blind Image Quality Assessment Without Human Training Using Latent Quality Factors , 2012, IEEE Signal Processing Letters.

[50]  Matthias Wöpking,et al.  Viewing comfort with stereoscopic pictures : an experimental study on the subjective effects of disparity magnitude and depth of focus , 1995 .

[51]  Alan C. Bovik,et al.  3D Visual Discomfort Prediction: Vergence, Foveation, and the Physiological Optics of Accommodation , 2014, IEEE Journal of Selected Topics in Signal Processing.

[52]  Robert Patterson,et al.  Review Paper: Human factors of stereo displays: An update , 2009 .

[53]  Ian P. Howard,et al.  Seeing in Depth , 2008 .

[54]  Filippo Speranza,et al.  Stereoscopic 3D-TV: Visual Comfort , 2011, IEEE Transactions on Broadcasting.

[55]  EgnalGeoffrey,et al.  Detecting Binocular Half-Occlusions , 2002 .