A visual attention network for a humanoid robot

For a humanoid robot to interact easily with a person, the robot should have human-like sensory capabilities and attentional mechanisms. Particularly useful is an active vision head controlled by a visual attention system that selects viewpoints in the environment as a function of the robot's task. This paper describes a model of human visual attention called FeatureGate, which is a locally connected, pyramidal, artificial neural network that operates on 2D feature maps of the environment. Given a set of feature maps, and the description of a specific target, FeatureGate finds the location whose features most closely match those of the target. The paper describes the network, its implementation, a series of tests that characterize its performance with respect to a person's performance on a similar task, and its use in the control of an active vision system.

[1]  C. Eriksen,et al.  Temporal and spatial characteristics of selective encoding from visual displays , 1972 .

[2]  Martin Jägersand,et al.  Saliency Maps and Attention Selection in Scale and Spatial Coordinates: An Information Theoretic Approach , 1995, ICCV.

[3]  K. Cave,et al.  Spatial Attention in Visual Search for Features and Feature Conjunctions , 1995 .

[4]  R. Shiffrin,et al.  Controlled and automatic human information processing: I , 1977 .

[5]  K. Cave The FeatureGate model of visual selection , 1999, Psychological research.

[6]  I. Rybak,et al.  A model of attention-guided visual perception and recognition , 1998, Vision Research.

[7]  Dana H. Ballard,et al.  Animate Vision , 1991, Artif. Intell..

[8]  K. Cave,et al.  Flexibility in Spatial Attention Before and After Practice , 1997 .

[9]  M. Posner,et al.  Attention and the detection of signals. , 1980, Journal of experimental psychology.

[10]  A. Treisman,et al.  Automaticity and preattentive processing. , 1992, The American journal of psychology.

[11]  Mark H. Johnson,et al.  Components of Visual Orienting in Early Infancy: Contingency Learning, Anticipatory Looking, and Disengaging , 1991, Journal of Cognitive Neuroscience.

[12]  Susan L. Franzel,et al.  Guided search: an alternative to the feature integration model for visual search. , 1989, Journal of experimental psychology. Human perception and performance.

[13]  Jeremy M Wolfe,et al.  Modeling the role of parallel processing in visual search , 1990, Cognitive Psychology.

[14]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[15]  J. Duncan,et al.  Visual search and stimulus similarity. , 1989, Psychological review.

[16]  Aparna L Ratan,et al.  The Role of Fixation and Visual Attention in Object Recognition , 1995 .