A model of overt visual attention based on scale-space theory

This paper proposes a vision model for object detection based on scale-space theory, considering knowledge obtained in neurophysiology and human visual characteristics obtained in visual psychology. The proposed model is principally composed of (a) the early vision model and (b) the attention calculation model. In this paper, it is shown first that the visual characteristics can be described by a discretized scale space, considering their multichannel property, spatial nonuniformity, and orientation selectivity. The early vision model is formulated. Next, the attention calculation model and the operation algorithm are formulated on the basis of the knowledge obtained by scale-space theory. The numerical experiments reveal that the proposed model has the following properties. (i) For objects with a high intensity difference, the center of overt attention moves to the center of the object. (ii) The spatial extent of overtscovert attention can be calculated adequately. (iii) The object is captured at the central region of the retina, where the resolution is the highest. Since the proposed model is based on scale-space theory, the theory or model can be easily extended. There are other advantages from an engineering standpoint, such as simple structure, easy implementation, small computational requirements, and very few parameters to be adjusted. © 2004 Wiley Periodicals, Inc. Syst Comp Jpn, 35(10): 1–13, 2004; Published online in Wiley InterScience (). DOI 10.1002sscj.10708

[1]  R Klein,et al.  Search performance without eye movements , 1989, Perception & psychophysics.

[2]  A. Hendrickson,et al.  Human photoreceptor topography , 1990, The Journal of comparative neurology.

[3]  Eric L. Schwartz,et al.  Computational anatomy and functional architecture of striate cortex: A spatial mapping approach to perceptual coding , 1980, Vision Research.

[4]  D. Tolhurst Separate channels for the analysis of the shape and the movement of a moving visual stimulus , 1973, The Journal of physiology.

[5]  J. Rovamo,et al.  Cortical magnification factor predicts the photopic contrast sensitivity of peripheral vision , 1978, Nature.

[6]  D. Robinson,et al.  Shared neural control of attentional shifts and eye movements , 1996, Nature.

[7]  C. Eriksen,et al.  Visual attention within and around the field of focal attention: A zoom lens model , 1986, Perception & psychophysics.

[8]  C Blakemore,et al.  On the existence of neurones in the human visual system selectively sensitive to the orientation and size of retinal images , 1969, The Journal of physiology.

[9]  L. Thibos,et al.  Retinal limits to the detection and resolution of gratings. , 1987, Journal of the Optical Society of America. A, Optics and image science.

[10]  G. Rizzolatti,et al.  Reorienting attention across the horizontal and vertical meridians: Evidence in favor of a premotor theory of attention , 1987, Neuropsychologia.

[11]  P Cavanagh,et al.  Color and luminance: independent frequency shifts. , 1981, Science.

[12]  Stephen M. Pizer,et al.  Hierarchical Shape Description Via the Multiresolution Symmetric Axis Transform , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  R. Remington Attention and saccadic eye movements. , 1980, Journal of experimental psychology. Human perception and performance.

[14]  Tony Lindeberg,et al.  Detecting salient blob-like image structures and their scales with a scale-space primal sketch: A method for focus-of-attention , 1993, International Journal of Computer Vision.

[15]  I. Abramov,et al.  Color appearance in the peripheral retina: effects of stimulus size. , 1991, Journal of the Optical Society of America. A, Optics and image science.

[16]  K. D. Valois Spatial frequency adaptation can enhance contrast sensitivity , 1977, Vision Research.

[17]  T. Lindeberg,et al.  Foveal scale-space and the linear increase of receptive field size as a function of eccentricity , 1994 .

[18]  C. Koch,et al.  A saliency-based search mechanism for overt and covert shifts of visual attention , 2000, Vision Research.

[19]  Edward H. Adelson,et al.  The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[20]  R. L. de Valois,et al.  Relationship between spatial-frequency and orientation tuning of striate-cortex cells. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[21]  Tony Lindeberg,et al.  Scale-Space Theory in Computer Vision , 1993, Lecture Notes in Computer Science.