Cue-guided search: a computational model of selective attention

Selective visual attention in a natural environment can be seen as the interaction between the external visual stimulus and task specific knowledge of the required behavior. This interaction between the bottom-up stimulus and the top-down, task-related knowledge is crucial for what is selected in the space and time within the scene. In this paper, we propose a computational model for selective attention for a visual search task. We go beyond simple saliency-based attention models to model selective attention guided by top-down visual cues, which are dynamically integrated with the bottom-up information. In this way, selection of a location is accomplished by interaction between bottom-up and top-down information. First, the general structure of our model is briefly introduced and followed by a description of the top-down processing of task-relevant cues. This is then followed by a description of the processing of the external images to give three feature maps that are combined to give an overall bottom-up map. Second, the development of the formalism for our novel interactive spiking neural network (ISNN) is given, with the interactive activation rule that calculates the integration map. The learning rule for both bottom-up and top-down weight parameters are given, together with some further analysis of the properties of the resulting ISNN. Third, the model is applied to a face detection task to search for the location of a specific face that is cued. The results show that the trajectories of attention are dramatically changed by interaction of information and variations of cues, giving an appropriate, task-relevant search pattern. Finally, we discuss ways in which these results can be seen as compatible with existing psychological evidence.

[1]  Geoffrey E. Hinton,et al.  Learning Mixture Models of Spatial Coherence , 1993, Neural Computation.

[2]  Jim Kay,et al.  Activation Functions, Computational Goals, and Learning Rules for Local Processors with Contextual Guidance , 1997, Neural Computation.

[3]  Christof Koch,et al.  Attentional Selection for Object Recognition - A Gentle Way , 2002, Biologically Motivated Computer Vision.

[4]  N. Kanwisher,et al.  Visual attention: Insights from brain imaging , 2000, Nature Reviews Neuroscience.

[5]  Jianfeng Feng,et al.  Training integrate-and-fire neurons with the Informax principle II , 2003, IEEE Trans. Neural Networks.

[6]  Michael C. Mozer,et al.  Computational modeling of spatial attention , 1996 .

[7]  Jianfeng Feng,et al.  Computational neuroscience , 1986, Behavioral and Brain Sciences.

[8]  Laurent Itti,et al.  A Goal Oriented Attention Guidance Model , 2002, Biologically Motivated Computer Vision.

[9]  N. Kanwisher,et al.  The Generality of Parietal Involvement in Visual Attention , 1999, Neuron.

[10]  C. Balkenius Attention, habituation and conditioning: toward a computational model , 2000 .

[11]  G. Mangun,et al.  The neural mechanisms of top-down attentional control , 2000, Nature Neuroscience.

[12]  J. Wolfe Chapter 8 – Visual Attention , 2000 .

[13]  Michael W. Spratling Cortical region interactions and the functional role of apical dendrites. , 2002, Behavioral and cognitive neuroscience reviews.

[14]  Robert Hockey,et al.  Effects of repetition on the facilitatory and inhibitory components of orienting in visual space , 1987, Neuropsychologia.

[15]  Yehezkel Yeshurun,et al.  Context-free attentional operators: The generalized symmetry transform , 1995, International Journal of Computer Vision.

[16]  James J. Clark,et al.  Modal Control Of An Attentive Vision System , 1988, [1988 Proceedings] Second International Conference on Computer Vision.

[17]  Narendra Ahuja,et al.  Detecting Faces in Images: A Survey , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  A. Kramer,et al.  Splitting the Beam: Distribution of Attention Over Noncontiguous Regions of the Visual Field , 1995 .

[19]  Brian Scassellati,et al.  A Context-Dependent Attention System for a Social Robot , 1999, IJCAI.

[20]  Jianfeng Feng,et al.  Training the integrate-and-fire model with the informax principle: I , 2002 .

[21]  R. Desimone,et al.  Responses of Neurons in Inferior Temporal Cortex during Memory- Guided Visual Search , 1998 .

[22]  Bernd Menser,et al.  Segmentation of human faces in color images using connected operators , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[23]  Karl J. Friston,et al.  The functional anatomy of attention to visual motion. A functional MRI study. , 1998, Brain : a journal of neurology.

[24]  S. Hillyard,et al.  Event-related brain potentials in the study of visual selective attention. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[25]  H. Pashler The Psychology of Attention , 1997 .

[26]  Erik Hjelmås,et al.  Face Detection: A Survey , 2001, Comput. Vis. Image Underst..

[27]  J. Duncan Selective attention and the organization of visual information. , 1984, Journal of experimental psychology. General.

[28]  N. Kanwisher,et al.  Covert visual attention modulates face-specific activity in the human fusiform gyrus: fMRI study. , 1998, Journal of neurophysiology.

[29]  Giacomo Indiveri,et al.  A neuromorphic VLSI device for implementing 2D selective attention systems , 2001, IEEE Trans. Neural Networks.

[30]  Jianfeng Feng,et al.  Impact of Correlated Inputs on the Output of the Integrate-and-Fire Model , 2000, Neural Computation.

[31]  J. Wolfe,et al.  Guided Search 2.0 A revised model of visual search , 1994, Psychonomic bulletin & review.

[32]  D. Norman Toward a theory of memory and attention. , 1968 .

[33]  T. Poggio,et al.  Multiplying with synapses and neurons , 1992 .

[34]  Konrad P. Körding,et al.  Integrating Top-Down and Bottom-Up Sensory Processing by Somato-Dendritic Interactions , 2004, Journal of Computational Neuroscience.

[35]  John K. Tsotsos,et al.  Modeling Visual Attention via Selective Tuning , 1995, Artif. Intell..

[36]  James M. Rehg,et al.  Statistical Color Models with Application to Skin Detection , 2004, International Journal of Computer Vision.

[37]  PIERRE VAN DE LAAR,et al.  Task-Dependent Learning of Attention , 1997, Neural Networks.

[38]  Allen Allport,et al.  Visual attention , 1989 .

[39]  DeLiang Wang,et al.  Object selection based on oscillatory correlation , 1999, Neural Networks.

[40]  S. Kakade,et al.  Learning and selective attention , 2000, Nature Neuroscience.

[41]  C. Eriksen,et al.  Allocation of attention in the visual field. , 1985, Journal of experimental psychology. Human perception and performance.

[42]  Jianfeng Feng,et al.  Temporal album , 2003, IEEE Trans. Neural Networks.

[43]  J. C. Johnston,et al.  Attention and performance. , 2001, Annual review of psychology.

[44]  C. Eriksen,et al.  Visual attention within and around the field of focal attention: A zoom lens model , 1986, Perception & psychophysics.

[45]  S. Treue Neural correlates of attention in primate visual cortex , 2001, Trends in Neurosciences.

[46]  Peter Kovesi,et al.  Symmetry and Asymmetry from Local Phase , 1997 .

[47]  C. L. M. The Psychology of Attention , 1890, Nature.

[48]  S Ullman,et al.  Shifts in selective visual attention: towards the underlying neural circuitry. , 1985, Human neurobiology.

[49]  Jianfeng Feng,et al.  A dynamic neural network model on global-to-local interaction over time course , 2002, Proceedings of the 9th International Conference on Neural Information Processing, 2002. ICONIP '02..

[50]  M. Chun,et al.  Contextual Cueing: Implicit Learning and Memory of Visual Context Guides Spatial Attention , 1998, Cognitive Psychology.

[51]  M. Corbetta,et al.  Control of goal-directed and stimulus-driven attention in the brain , 2002, Nature Reviews Neuroscience.

[52]  Venu Govindaraju,et al.  Locating human faces in photographs , 1996, International Journal of Computer Vision.

[53]  M. Chun,et al.  Contextual cueing of visual attention , 2022 .

[54]  C. Koch,et al.  Computational modelling of visual attention , 2001, Nature Reviews Neuroscience.

[55]  Vivien A. Casagrande,et al.  Biophysics of Computation: Information Processing in Single Neurons , 1999 .

[56]  M. Chun,et al.  Top-Down Attentional Guidance Based on Implicit Learning of Visual Covariation , 1999 .

[57]  J. Deutsch Perception and Communication , 1958, Nature.

[58]  B. Rossion,et al.  Spatial attention triggered by eye gaze increases and speeds up early visual activity , 2001, Neuroreport.

[59]  Ernst Niebur,et al.  Controlling the Focus of Visual Selective Attention , 2002 .

[60]  Robert B. Fisher,et al.  Object-based visual attention for computer vision , 2003, Artif. Intell..

[61]  E. DeYoe,et al.  A physiological correlate of the 'spotlight' of visual attention , 1999, Nature Neuroscience.

[62]  William C. Ogden,et al.  Attended and unattended processing modes: The role of set for spatial location , 2014 .

[63]  M. Posner,et al.  Attention and the detection of signals. , 1980, Journal of experimental psychology.

[64]  Colin M. Macleod,et al.  Interdimensional interference in the Stroop effect: uncovering the cognitive and neural anatomy of attention , 2000, Trends in Cognitive Sciences.

[65]  C. Koch,et al.  A saliency-based search mechanism for overt and covert shifts of visual attention , 2000, Vision Research.

[66]  D. Navon Forest before trees: The precedence of global features in visual perception , 1977, Cognitive Psychology.

[67]  Saad Ahmed Sirohey,et al.  Human Face Segmentation and Identification , 1998 .

[68]  K Sathian,et al.  Visual search: bottom-up or top-down? , 2000, Frontiers in bioscience : a journal and virtual library.

[69]  R. Stein Some models of neuronal variability. , 1967, Biophysical journal.

[70]  Sang-Hui Park,et al.  Face Detection Using Template Matching and Ellipse Fitting , 2000 .

[71]  R. Desimone,et al.  Neural mechanisms of selective visual attention. , 1995, Annual review of neuroscience.