A Computational Model for Stereoscopic Visual Saliency Prediction

Depth information plays an important role in human vision as it provides additional cues that distinguish objects from their backgrounds. This paper explores depth information for analyzing stereoscopic saliency and presents a computational model that predicts stereoscopic visual saliency based on three aspects of human vision: 1) the pop-out effect; 2) comfort zones; and 3) background effects. Through an analysis of these three phenomena, we find that most of the stereoscopic saliency region can be explained. Our model comprises three modules, each describing one aspect of saliency distribution, and a control function that can be used to adjust the three models independently. The relationship between the three models is not mutually exclusive. One, two, or three phenomena may appear in one image. Therefore, to accurately determine which phenomena the image conforms to, we have devised a selection strategy that chooses the appropriate combination of models based on the content of the image. Our approach is implemented within a framework based on the multifeature analysis. The framework considers surrounding regions, color/depth contrast, and points of interest. The selection strategy can improve the performance of the framework. A series of experiments on two recent eye-tracking datasets shows that our proposed method outperforms several state-of-the-art saliency models.

[1]  Stan Sclaroff,et al.  Exploiting Surroundedness for Saliency Detection: A Boolean Map Approach , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[2]  Ken Chen,et al.  Stereoscopic Visual Attention Model for 3D Video , 2010, MMM.

[3]  Sanghoon Lee,et al.  Transition of Visual Attention Assessment in Stereoscopic Images With Evaluation of Subjective Visual Quality and Discomfort , 2015, IEEE Transactions on Multimedia.

[4]  Rongrong Ji,et al.  RGBD Salient Object Detection: A Benchmark and Algorithms , 2014, ECCV.

[5]  Huan Du,et al.  Depth-Aware Salient Object Detection and Segmentation via Multiscale Discriminative Saliency Fusion and Bootstrap Learning , 2017, IEEE Transactions on Image Processing.

[6]  Junle Wang,et al.  Study of depth bias of observers in free viewing of still stereoscopic synthetic stimuli , 2012 .

[7]  Gangyi Jiang,et al.  A depth perception and visual comfort guided computational model for stereoscopic 3D visual saliency , 2015, Signal Process. Image Commun..

[8]  Weisi Lin,et al.  Saliency detection for stereoscopic images , 2013, 2013 Visual Communications and Image Processing (VCIP).

[9]  Shi-Min Hu,et al.  Global contrast based salient region detection , 2011, CVPR 2011.

[10]  Christine Fernandez-Maloigne,et al.  A visual attention model for stereoscopic 3D images using monocular cues , 2015, Signal Process. Image Commun..

[11]  King Ngi Ngan,et al.  Object Co-Segmentation Based on Shortest Path Algorithm and Saliency Model , 2012, IEEE Transactions on Multimedia.

[12]  Peyman Milanfar,et al.  Static and space-time visual saliency detection by self-resemblance. , 2009, Journal of vision.

[13]  T. Poggio,et al.  The analysis of stereopsis. , 1984, Annual review of neuroscience.

[14]  Heiko Hirschmüller,et al.  Evaluation of Cost Functions for Stereo Matching , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[15]  Sk Subhan,et al.  Dense and Sparse Reconstruction Error Based Saliency Descriptor , 2018 .

[16]  Jian Zhang,et al.  A Novel Saliency Model for Stereoscopic Images , 2015, 2015 International Conference on Digital Image Computing: Techniques and Applications (DICTA).

[17]  Pietro Perona,et al.  Graph-Based Visual Saliency , 2006, NIPS.

[18]  H. Pashler,et al.  Evidence for split attentional foci. , 2000, Journal of Experimental Psychology: Human Perception and Performance.

[19]  Liqing Zhang,et al.  Saliency Detection: A Spectral Residual Approach , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[20]  Zhi Liu,et al.  Salient region detection for stereoscopic images , 2014, 2014 19th International Conference on Digital Signal Processing.

[21]  Christine Fernandez-Maloigne,et al.  A statistical study of the correlation between interest points and gaze points , 2012, Electronic Imaging.

[22]  Pascal Fua,et al.  SLIC Superpixels Compared to State-of-the-Art Superpixel Methods , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Ali Borji,et al.  State-of-the-Art in Visual Attention Modeling , 2013, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  K. Madhava Krishna,et al.  Depth really Matters: Improving Visual Salient Region Detection with Depth , 2013, BMVC.

[25]  Yao Zhao,et al.  Depth-Based Stereoscopic Projection Approach for 3D Saliency Detection , 2015, PCM.

[26]  Liqing Zhang,et al.  Dynamic visual attention: searching for coding length increments , 2008, NIPS.

[27]  Xueqing Li,et al.  Leveraging stereopsis for saliency analysis , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[28]  Bärbel Mertsching,et al.  Fast Depth Saliency from Stereo for Region-Based Artificial Visual Attention , 2010, ACIVS.

[29]  David M. Hoffman,et al.  Vergence-accommodation conflicts hinder visual performance and cause visual fatigue. , 2008, Journal of vision.

[30]  Jari Takatalo,et al.  What do people look at when they watch stereoscopic movies? , 2010, Electronic Imaging.

[31]  Ya-Hsin Hsueh,et al.  Characteristics of visual fatigue under the effect of 3D animation. , 2015, Technology and health care : official journal of the European Society for Engineering and Medicine.

[32]  Marcus Barkowsky,et al.  The Importance of Visual Attention in Improving the 3D-TV Viewing Experience: Overview and New Perspectives , 2011, IEEE Transactions on Broadcasting.

[33]  S. Süsstrunk,et al.  Frequency-tuned salient region detection , 2009, CVPR 2009.

[34]  Qiang Wu,et al.  Stereoscopic visual saliency prediction based on stereo contrast and stereo focus , 2017, EURASIP J. Image Video Process..

[35]  John K. Tsotsos,et al.  Saliency Based on Information Maximization , 2005, NIPS.

[36]  Christel Chamaret,et al.  Adaptive 3D rendering based on region-of-interest , 2010, Electronic Imaging.

[37]  Qionghai Dai,et al.  Learning Blind Quality Evaluator for Stereoscopic Images Using Joint Sparse Representation , 2016, IEEE Transactions on Multimedia.

[38]  Mtm Marc Lambooij,et al.  Visual Discomfort and Visual Fatigue of Stereoscopic Displays: A Review , 2009 .

[39]  Hao Sheng,et al.  Saliency analysis based on depth contrast increased , 2016, 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[40]  Harish Katti,et al.  Depth Matters: Influence of Depth Cues on Visual Saliency , 2012, ECCV.

[41]  Henrik I. Christensen,et al.  Computational visual attention systems and their cognitive foundations: A survey , 2010, TAP.

[42]  B. Tatler,et al.  Yarbus, eye movements, and vision , 2010, i-Perception.

[43]  Jian Yu,et al.  Saliency Detection by Multitask Sparsity Pursuit , 2012, IEEE Transactions on Image Processing.

[44]  Ernst Niebur,et al.  Computing 3D saliency from a 2D image , 2013, 2013 47th Annual Conference on Information Sciences and Systems (CISS).

[45]  Aljoscha Smolic,et al.  Nonlinear disparity mapping for stereoscopic 3D , 2010, ACM Trans. Graph..

[46]  Junle Wang,et al.  Computational Model of Stereoscopic 3D Visual Saliency , 2013, IEEE Transactions on Image Processing.

[47]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[48]  Harold Pashler,et al.  A Boolean map theory of visual attention. , 2007, Psychological review.

[49]  Ronald A. Rensink,et al.  TO SEE OR NOT TO SEE: The Need for Attention to Perceive Changes in Scenes , 1997 .

[50]  Ahmet M. Kondoz,et al.  Sensitivity Analysis of the Human Visual System for Depth Cues in Stereoscopic 3-D Displays , 2011, IEEE Transactions on Multimedia.

[51]  John K. Tsotsos,et al.  An attentional framework for stereo vision , 2005, The 2nd Canadian Conference on Computer and Robot Vision (CRV'05).

[52]  Luc Van Gool,et al.  3D Saliency for Finding Landmark Buildings , 2016, 2016 Fourth International Conference on 3D Vision (3DV).