Cognitive Vision Needs Attention to Link Sensing with Recognition

“Cognitive computer vision is concerned with integration and control of vision systems using explicit but not necessarily symbolic models of context, situation and goaldirected behaviour” (Vernon 2003 [473]). This paper discusses one small but critical slice of a cognitive computer vision system, that of visual attention. The presentation begins with a brief discussion on a definition for attention followed by an enumeration of the different ways in which attention should play a role in computer vision and cognitive vision systems in particular. The Selective Tuning Model is then overviewed with an emphasis on its components that are most relevant for cognitive vision, namely the winner-take-all processing, the use of distributed saliency and feature binding as a link to recognition.

[1]  Hermann von Helmholtz,et al.  Treatise on Physiological Optics , 1962 .

[2]  Christoph von der Malsburg,et al.  The What and Why of Binding: Review The Modeler's Perspective , 1999 .

[3]  D. J. Felleman,et al.  Distributed hierarchical processing in the primate cerebral cortex. , 1991, Cerebral cortex.

[4]  H B Barlow,et al.  Single units and sensation: a neuron doctrine for perceptual psychology? , 1972, Perception.

[5]  John K. Tsotsos,et al.  An Attentional Prototype for Early Vision , 1992, ECCV.

[6]  John K. Tsotsos Analyzing vision at the complexity level , 1990, Behavioral and Brain Sciences.

[7]  John K. Tsotsos,et al.  Directing attention to onset and offset of image events for eye-head movement control , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[8]  John K. Tsotsos,et al.  Modeling Visual Attention via Selective Tuning , 1995, Artif. Intell..

[9]  A. Treisman,et al.  Illusory conjunctions in the perception of objects , 1982, Cognitive Psychology.

[10]  A. Treisman,et al.  A feature-integration theory of attention , 1980, Cognitive Psychology.

[11]  A. Roskies The Binding Problem , 1999, Neuron.

[12]  Ernst D. Dickmanns,et al.  A general dynamic vision architecture for UGV and UAV , 1992, Applied Intelligence.

[13]  John K. Tsotsos The Complexity of Perceptual Search Tasks , 1989, IJCAI.

[14]  John K. Tsotsos On the relative complexity of active vs. passive visual search , 2004, International Journal of Computer Vision.

[15]  Shumeet Baluja,et al.  Dynamic Relevance: Vision-Based Focus of Attention Using Artificial Neural Networks. (Technical Note) , 1997, Artif. Intell..