A coherent computational approach to model bottom-up visual attention

Visual attention is a mechanism which filters out redundant visual information and detects the most relevant parts of our visual field. Automatic determination of the most visually relevant areas would be useful in many applications such as image and video coding, watermarking, video browsing, and quality assessment. Many research groups are currently investigating computational modeling of the visual attention system. The first published computational models have been based on some basic and well-understood human visual system (HVS) properties. These models feature a single perceptual layer that simulates only one aspect of the visual system. More recent models integrate complex features of the HVS and simulate hierarchical perceptual representation of the visual input. The bottom-up mechanism is the most occurring feature found in modern models. This mechanism refers to involuntary attention (i.e., salient spatial visual features that effortlessly or involuntary attract our attention). This paper presents a coherent computational approach to the modeling of the bottom-up visual attention. This model is mainly based on the current understanding of the HVS behavior. Contrast sensitivity functions, perceptual decomposition, visual masking, and center-surround interactions are some of the features implemented in this model. The performances of this algorithm are assessed by using natural images and experimental measurements from an eye-tracking system. Two adequate well-known metrics (correlation coefficient and Kullbacl-Leibler divergence) are used to validate this model. A further metric is also defined. The results from this model are finally compared to those from a reference bottom-up model.

[1]  D. Spalding The Principles of Psychology , 1873, Nature.

[2]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[3]  Edward H. Adelson,et al.  The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[4]  S Ullman,et al.  Shifts in selective visual attention: towards the underlying neural circuitry. , 1985, Human neurobiology.

[5]  S Grossberg,et al.  Neural dynamics of perceptual grouping: Textures, boundaries, and emergent segmentations , 1985, Perception & psychophysics.

[6]  Andrew B. Watson,et al.  The cortex transform: rapid computation of simulated neural images , 1987 .

[7]  M. Eigen,et al.  Statistical geometry in sequence space: a method of quantitative comparative sequence analysis. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Scott J. Daly A visual model for optimizing the design of image processing algorithms , 1994, Proceedings of 1st International Conference on Image Processing.

[9]  S. Yantis,et al.  Visual motion and attentional capture , 1994, Perception & psychophysics.

[10]  John K. Tsotsos,et al.  Modeling Visual Attention via Selective Tuning , 1995, Artif. Intell..

[11]  C. Gilbert,et al.  Improvement in visual sensitivity by changes in local context: Parallel studies in human observers and in V1 of alert monkeys , 1995, Neuron.

[12]  J. Rieser,et al.  Attention and communication: Eye-movement-based research paradigms , 1996 .

[13]  D. S. Wooding,et al.  Fixation sequences made during visual examination of briefly presented 2D images. , 1997, Spatial vision.

[14]  L. Stark,et al.  Spontaneous Eye Movements During Visual Imagery Reflect the Content of the Visual Scene , 1997, Journal of Cognitive Neuroscience.

[15]  Zhaoping Li,et al.  A Neural Model of Contour Integration in the Primary Visual Cortex , 1998, Neural Computation.

[16]  Patrick Le Callet,et al.  Interactions of chromatic components in the perceptual quantization of the achromatic component , 1999, Electronic Imaging.

[17]  P Reinagel,et al.  Natural scene statistics at the centre of gaze. , 1999, Network.

[18]  J. Henderson,et al.  The effects of semantic consistency on eye movements during complex scene viewing , 1999 .

[19]  Christof Koch,et al.  Comparison of feature combination strategies for saliency-based visual attention systems , 1999, Electronic Imaging.

[20]  Z Li,et al.  Pre-attentive segmentation in the primary visual cortex. , 1998, Spatial vision.

[21]  Patrick Le Callet,et al.  Frequency and spatial pooling of visual differences for still image quality assessment , 2000, Electronic Imaging.

[22]  Claudio M. Privitera,et al.  Algorithms for Defining Visual Regions-of-Interest: Comparison with Eye Fixations , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[23]  C. Koch,et al.  A saliency-based search mechanism for overt and covert shifts of visual attention , 2000, Vision Research.

[24]  C. Gilbert,et al.  Spatial distribution of contextual interactions in primary visual cortex and in visual perception. , 2000, Journal of neurophysiology.

[25]  C. Koch,et al.  Models of bottom-up and top-down visual attention , 2000 .

[26]  J. Elder,et al.  Ecological statistics of Gestalt laws for the perceptual organization of contours. , 2002, Journal of vision.

[27]  David S Wooding,et al.  Eye movements of large populations: II. Deriving regions of interest, coverage, and similarity using fixation maps , 2002, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[28]  David S Wooding,et al.  Eye movements of large populations: I. Implementation and performance of an autonomous public eye tracker , 2002, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[29]  Derrick J. Parkhurst,et al.  Modeling the role of salience in the allocation of overt visual attention , 2002, Vision Research.

[30]  Neil D. B. Bruce,et al.  Evolutionary design of context-free attentional operators , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[31]  Derrick J. Parkhurst,et al.  Scene content selected by active vision. , 2003, Spatial vision.

[32]  Jeff B. Pelz,et al.  High-level aspects of oculomotor control during viewing of natural-task images , 2003, IS&T/SPIE Electronic Imaging.

[33]  Dominique Barba,et al.  MASKING EFFECT IN VISUAL ATTENTION MODELING , 2004 .

[34]  H. K. HAltTLIn THE RESPONSE OF SINGLE OPTIC NERVE FIBERS OF THE VERTEBRATE EYE TO ILLUMINATION OF THE RETINA , 2004 .

[35]  Alan C. Bovik,et al.  Point-of-gaze analysis reveals visual search strategies , 2004, IS&T/SPIE Electronic Imaging.

[36]  J. Wolfe,et al.  What attributes guide the deployment of visual attention and how do they do it? , 2004, Nature Reviews Neuroscience.

[37]  J. Nelson,et al.  Intracortical facilitation among co-oriented, co-axially aligned simple cells in cat striate cortex , 2004, Experimental Brain Research.

[38]  Patrick Le Callet,et al.  Performance assessment of a visual attention system entirely based on a human vision modeling , 2004, 2004 International Conference on Image Processing, 2004. ICIP '04..

[39]  S. Kyllingsbæk Modeling visual attention , 2006 .