An Image Statistics–Based Model for Fixation Prediction

The problem of predicting where people look at, or equivalently salient region detection, has been related to the statistics of several types of low-level image features. Among these features, contrast and edge information seem to have the highest correlation with the fixation locations. The contrast distribution of natural images can be adequately characterized using a two-parameter Weibull distribution. This distribution catches the structure of local contrast and edge frequency in a highly meaningful way. We exploit these observations and investigate whether the parameters of the Weibull distribution constitute a simple model for predicting where people fixate when viewing natural images. Using a set of images with associated eye movements, we assess the joint distribution of the Weibull parameters at fixated and non-fixated regions. Then, we build a simple classifier based on the log-likelihood ratio between these two joint distributions. Our results show that as few as two values per image region are already enough to achieve a performance comparable with the state-of-the-art in bottom-up saliency prediction.

[1]  Frans W Cornelissen,et al.  The Eyelink Toolbox: Eye tracking with MATLAB and the Psychophysics Toolbox , 2002, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[2]  Nuno Vasconcelos,et al.  On the plausibility of the discriminant center-surround hypothesis for visual saliency. , 2008, Journal of vision.

[3]  Tim K Marks,et al.  SUN: A Bayesian framework for saliency using natural statistics. , 2008, Journal of vision.

[4]  N. L. Johnson,et al.  Continuous Univariate Distributions. , 1995 .

[5]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Arnold W. M. Smeulders,et al.  Color and Scale: The Spatial Structure of Color Images , 2000, ECCV.

[7]  Michael Brady,et al.  Saliency, Scale and Image Description , 2001, International Journal of Computer Vision.

[8]  L. Itti,et al.  Search Goal Tunes Visual Features Optimally , 2007, Neuron.

[9]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[10]  David Mumford,et al.  Statistics of natural images and models , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[11]  Jean Charles Gilbert,et al.  Numerical Optimization: Theoretical and Practical Aspects , 2003 .

[12]  Roger Sauter,et al.  Introduction to Probability and Statistics for Engineers and Scientists , 2005, Technometrics.

[13]  D H Brainard,et al.  The Psychophysics Toolbox. , 1997, Spatial vision.

[14]  Mary M Hayhoe,et al.  Task and context determine where you look. , 2016, Journal of vision.

[15]  P. Perona,et al.  Objects predict fixations better than early saliency. , 2008, Journal of vision.

[16]  Katsumi Aoki,et al.  Recent development of flow visualization , 2004, J. Vis..

[17]  Peyman Milanfar,et al.  Nonparametric bottom-up saliency detection by self-resemblance , 2009, 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[18]  Derrick J. Parkhurst,et al.  Modeling the role of salience in the allocation of overt visual attention , 2002, Vision Research.

[19]  Christof Koch,et al.  Predicting human gaze using low-level saliency combined with face detection , 2007, NIPS.

[20]  Arnold W. M. Smeulders,et al.  c ○ 2005 Springer Science + Business Media, Inc. Manufactured in The Netherlands. A Six-Stimulus Theory for Stochastic Texture , 2002 .

[21]  Antonio Torralba,et al.  Contextual guidance of eye movements and attention in real-world scenes: the role of global features in object search. , 2006, Psychological review.

[22]  Bernhard Schölkopf,et al.  Center-surround patterns emerge as optimal predictors for human saccade targets. , 2009, Journal of vision.

[23]  John K. Tsotsos,et al.  Saliency, attention, and visual search: an information theoretic approach. , 2009, Journal of vision.

[24]  Eero P. Simoncelli,et al.  Natural image statistics and neural representation. , 2001, Annual review of neuroscience.

[25]  Barbara Anne Dosher,et al.  Task precision at transfer determines specificity of perceptual learning. , 2009, Journal of vision.

[26]  Frédo Durand,et al.  Learning to predict where humans look , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[27]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[28]  M. Grabowecky,et al.  Demand-based dynamic distribution of attention and monitoring of velocities during multiple-object tracking. , 2009, Journal of vision.

[29]  J. Hupé,et al.  Bistability for audiovisual stimuli: Perceptual decision is modality specific. , 2008, Journal of vision.

[30]  Jitendra Malik,et al.  An Information Maximization Model of Eye Movements , 2004, NIPS.

[31]  Peter König,et al.  Salient features in gaze-aligned recordings of human visual input during free exploration of natural environments. , 2008, Journal of vision.

[32]  Olaf Blanke,et al.  Gravity and observer's body orientation influence the visual perception of human body postures. , 2009, Journal of vision.

[33]  F. Hamker,et al.  About the influence of post-saccadic mechanisms for visual stability on peri-saccadic compression of object location. , 2008, Journal of vision.

[34]  Pietro Perona,et al.  Graph-Based Visual Saliency , 2006, NIPS.

[35]  Robert A. Frazor,et al.  Independence of luminance and contrast in natural scenes and in the early visual system , 2005, Nature Neuroscience.

[36]  Hao Sun,et al.  The temporal properties of the response of macaque ganglion cells and central mechanisms of flicker detection. , 2007, Journal of vision.

[37]  Steven W. Zucker,et al.  Local Scale Control for Edge Detection and Blur Estimation , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[38]  Iain D. Gilchrist,et al.  Visual correlates of fixation selection: effects of scale and time , 2005, Vision Research.

[39]  Alan C. Bovik,et al.  Foveated analysis of image features at fixations , 2007, Vision Research.

[40]  J. Henderson Human gaze control during real-world scene perception , 2003, Trends in Cognitive Sciences.

[41]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[42]  A. Mizuno,et al.  A change of the leading player in flow Visualization technique , 2006, J. Vis..

[43]  S. Sutherland Eye, brain and vision , 1993, Nature.

[44]  Roland J. Baddeley,et al.  High frequency edges (but not contrast) predict where we fixate: A Bayesian system identification analysis , 2006, Vision Research.

[45]  Arnold W. M. Smeulders,et al.  Color Invariance , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[46]  Laurent Itti,et al.  Interesting objects are visually salient. , 2008, Journal of vision.

[47]  P Reinagel,et al.  Natural scene statistics at the centre of gaze. , 1999, Network.

[48]  Arnold W. M. Smeulders,et al.  Brain responses strongly correlate with Weibull image statistics when processing natural images. , 2009, Journal of vision.

[49]  W. Geisler Visual perception and the statistical properties of natural scenes. , 2008, Annual review of psychology.

[50]  K. Fujii,et al.  Visualization for the analysis of fluid motion , 2005, J. Vis..