On the plausibility of the discriminant center-surround hypothesis for visual saliency.

It has been suggested that saliency mechanisms play a role in perceptual organization. This work evaluates the plausibility of a recently proposed generic principle for visual saliency: that all saliency decisions are optimal in a decision-theoretic sense. The discriminant saliency hypothesis is combined with the classical assumption that bottom-up saliency is a center-surround process to derive a (decision-theoretic) optimal saliency architecture. Under this architecture, the saliency of each image location is equated to the discriminant power of a set of features with respect to the classification problem that opposes stimuli at center and surround. The optimal saliency detector is derived for various stimulus modalities, including intensity, color, orientation, and motion, and shown to make accurate quantitative predictions of various psychophysics of human saliency for both static and motion stimuli. These include some classical nonlinearities of orientation and motion saliency and a Weber law that governs various types of saliency asymmetries. The discriminant saliency detectors are also applied to various saliency problems of interest in computer vision, including the prediction of human eye fixations on natural scenes, motion-based saliency in the presence of ego-motion, and background subtraction in highly dynamic scenes. In all cases, the discriminant saliency detectors outperform previously proposed methods from both the saliency and the general computer vision literatures.

[1]  David Mumford,et al.  Statistics of natural images and models , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[2]  I. Motoyoshi,et al.  Visual response saturation to orientation contrast in the perception of texture boundary. , 2001, Journal of the Optical Society of America. A, Optics, image science, and vision.

[3]  J. Bergen,et al.  Texture segregation and orientation gradient , 1991, Vision Research.

[4]  Laurent Itti,et al.  Automatic foveation for video compression using a neurobiological model of visual attention , 2004, IEEE Transactions on Image Processing.

[5]  Iain D. Gilchrist,et al.  Visual correlates of fixation selection: effects of scale and time , 2005, Vision Research.

[6]  Nuno Vasconcelos,et al.  Modeling, Clustering, and Segmenting Video with Mixtures of Dynamic Textures , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[7]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[8]  H. Nothdurft The conspicuousness of orientation and motion contrast. , 1993, Spatial vision.

[9]  J. Beck Perceptual Grouping Produced by Changes in Orientation and Shape , 1966, Science.

[10]  C. Koch,et al.  Computational modelling of visual attention , 2001, Nature Reviews Neuroscience.

[11]  Stefano Soatto,et al.  Dynamic Textures , 2003, International Journal of Computer Vision.

[12]  H S Pennypacker,et al.  On Behavioral Analysis , 1981, The Behavior analyst.

[13]  Nuno Vasconcelos,et al.  Discriminant Saliency for Visual Recognition from Cluttered Scenes , 2004, NIPS.

[14]  H. Nothdurft Feature analysis and the role of similarity in preattentive vision , 1992, Perception & psychophysics.

[15]  G. Moraglia,et al.  Display organization and the detection of horizontal line segments , 1989, Perception & psychophysics.

[16]  Vision Research , 1961, Nature.

[17]  F. Attneave,et al.  What Variables Produce Similarity Grouping , 1970 .

[18]  A. Treisman Preattentive processing in vision , 1985, Comput. Vis. Graph. Image Process..

[19]  John K. Tsotsos,et al.  Saliency Based on Information Maximization , 2005, NIPS.

[20]  R. Ivry,et al.  Asymmetry in visual search for targets defined by differences in movement speed. , 1992, Journal of experimental psychology. Human perception and performance.

[21]  R. Rosenholtz A simple saliency model predicts a number of motion popout phenomena , 1999, Vision Research.

[22]  Nuno Vasconcelos,et al.  Decision-Theoretic Saliency: Computational Principles, Biological Plausibility, and Implications for Neurophysiology and Psychophysics , 2009, Neural Computation.

[23]  B Julesz,et al.  "Where" and "what" in vision. , 1985, Science.

[24]  Stan Sclaroff,et al.  Segmenting foreground objects from a dynamic textured background via a robust Kalman filter , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[25]  Zoran Zivkovic,et al.  Improved adaptive Gaussian mixture model for background subtraction , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[26]  D H Brainard,et al.  The Psychophysics Toolbox. , 1997, Spatial vision.

[27]  J. Sengupta The Nonparametric Approach , 1989 .

[28]  Nuno Vasconcelos,et al.  Scalable discriminant feature selection for image retrieval and recognition , 2004, CVPR 2004.

[29]  R. R. Clarke Transform coding of images , 1985 .

[30]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[31]  Nuno Vasconcelos,et al.  Natural Image Statistics and Low-Complexity Feature Selection , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[32]  J. Movshon,et al.  Nature and interaction of signals from the receptive field center and surround in macaque V1 neurons. , 2002, Journal of neurophysiology.

[33]  E H Adelson,et al.  Spatiotemporal energy models for the perception of motion. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[34]  C. Koch,et al.  A saliency-based search mechanism for overt and covert shifts of visual attention , 2000, Vision Research.

[35]  J. Beck Similarity grouping and peripheral discriminability under uncertainty. , 1972, The American journal of psychology.

[36]  A Treisman,et al.  Feature analysis in early vision: evidence from search asymmetries. , 1988, Psychological review.

[37]  Herbert L. Groginsky Adaptive Nonparametric Detection , 1967 .

[38]  J. Wolfe,et al.  The role of categorization in visual search for orientation. , 1992, Journal of experimental psychology. Human perception and performance.

[39]  Brian Scassellati,et al.  A Behavioral Analysis of Computational Models of Visual Attention , 2007, International Journal of Computer Vision.

[40]  H. Nothdurft Salience of Feature Contrast , 2005 .

[41]  David J. Heeger,et al.  Optical flow from spatialtemporal filters , 1987 .

[42]  S Ullman,et al.  Shifts in selective visual attention: towards the underlying neural circuitry. , 1985, Human neurobiology.

[43]  J. Beck Effect of orientation and of shape similarity on perceptual grouping , 1966 .

[44]  Stéphane Mallat,et al.  A Theory for Multiresolution Signal Decomposition: The Wavelet Representation , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[45]  D. V. van Essen,et al.  Neuronal responses to static texture patterns in area V1 of the alert macaque monkey. , 1992, Journal of neurophysiology.

[46]  Pierre Baldi,et al.  A principled approach to detecting surprising events in video , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[47]  Yaser Sheikh,et al.  Bayesian modeling of dynamic scenes for object detection , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[48]  B Julesz,et al.  Experiments in the visual perception of texture. , 1975, Scientific American.

[49]  Bernhard Schölkopf,et al.  A Nonparametric Approach to Bottom-Up Visual Saliency , 2006, NIPS.

[50]  Eero P. Simoncelli,et al.  Image compression via joint statistical characterization in the wavelet domain , 1999, IEEE Trans. Image Process..

[51]  H. C. Nothdurft,et al.  Texture segmentation and pop-out from orientation contrast , 1991, Vision Research.

[52]  H. Nothdurft Salience from feature contrast: variations with texture density , 2000, Vision Research.

[53]  Minh N. Do,et al.  Wavelet-based texture retrieval using generalized Gaussian density and Kullback-Leibler distance , 2002, IEEE Trans. Image Process..

[54]  D. Regan Orientation Discrimination for Bars Defined by Orientation Texture , 1995, Perception.

[55]  Bela Julesz,et al.  A theory of preattentive texture discrimination based on first-order statistics of textons , 2004, Biological Cybernetics.

[56]  Asha Iyer,et al.  Components of bottom-up gaze allocation in natural images , 2005, Vision Research.

[57]  Derrick J. Parkhurst,et al.  Modeling the role of salience in the allocation of overt visual attention , 2002, Vision Research.

[58]  H. C. Nothdurft,et al.  The role of local contrast in pop-out of orientation, motion and color. , 1991 .

[59]  B. Julesz A brief outline of the texton theory of human vision , 1984, Trends in Neurosciences.

[60]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[61]  Nuno Vasconcelos Feature selection by maximum marginal diversity: optimality and implications for visual recognition , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[62]  Christof Koch,et al.  Modeling attention to salient proto-objects , 2006, Neural Networks.

[63]  D. Foster,et al.  Asymmetries in oriented-line detection indicate two orthogonal filters in early vision , 1991, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[64]  Pietro Perona,et al.  Graph-Based Visual Saliency , 2006, NIPS.

[65]  D H HUBEL,et al.  RECEPTIVE FIELDS AND FUNCTIONAL ARCHITECTURE IN TWO NONSTRIATE VISUAL AREAS (18 AND 19) OF THE CAT. , 1965, Journal of neurophysiology.

[66]  ContoursJames H. Elder The Statistics of Natural Image , 1998 .

[67]  Zhaoping Li A saliency map in primary visual cortex , 2002, Trends in Cognitive Sciences.

[68]  J. Wolfe,et al.  Guided Search 2.0 A revised model of visual search , 1994, Psychonomic bulletin & review.