Efficient Visual Search: A Connectionist Solution 1

Searching for objects in scenes is a natural task for people and has been extensively studied by psychologists. In this paper we examine this task from a connectionist perspective. Computational complexity arguments suggest that parallel feed-forward networks cannot perform this task ef ficiently. One difficulty is that, in order to distinguish the tar get from distractors, a combination of features must be associated with a single object. Often called the binding problem, this requirement presents a serious hurdle for connectionist models of visual processing when multiple objects are present. Psychophysical experiments suggest that people use covert visual attention to get around this problem. In this paper we describe a psychologically plausible system which uses a focus of attention mechanism to locate target objects. A strategy that combines top-down and bottom-up information is used to minimize search time. The behavior of the resulting system matches the reaction time behavior of people in several interesting tasks.

[1]  Ralph Linsker,et al.  How to Generate Ordered Maps by Maximizing the Mutual Information between Input and Output Signals , 1989, Neural Computation.

[2]  D L Sparks,et al.  Translation of sensory signals into commands for control of saccadic eye movements: role of primate superior colliculus. , 1986, Physiological reviews.

[3]  A. Treisman Features and Objects: The Fourteenth Bartlett Memorial Lecture , 1988, The Quarterly journal of experimental psychology. A, Human experimental psychology.

[4]  Irvin Rock,et al.  Results of a new method for investigating inattention in visual-perception , 1990 .

[5]  A. Treisman,et al.  Conjunction search revisited. , 1990, Journal of experimental psychology. Human perception and performance.

[6]  Michael C. Mozer,et al.  Perception of multiple objects - a connectionist approach , 1991, Neural network modeling and connectionism.

[7]  Jon Driver,et al.  Visual search for a conjunction of movement and form is parallel , 1988, Nature.

[8]  Susan L. Franzel,et al.  Guided search: an alternative to the feature integration model for visual search. , 1989, Journal of experimental psychology. Human perception and performance.

[9]  S. Yantis,et al.  Abrupt visual onsets and selective attention: voluntary versus automatic allocation. , 1990, Journal of experimental psychology. Human perception and performance.

[10]  David Chapman,et al.  Vision, instruction, and action , 1990 .

[11]  Oscar Firschein,et al.  Readings in computer vision: issues, problems, principles, and paradigms , 1987 .

[12]  G W Humphreys,et al.  Visual search for targets defined by combinations of color, shape, and size: An examination of the task constraints on feature and conjunction searches , 1987, Perception & psychophysics.

[13]  Ken Nakayama,et al.  Serial and parallel processing of visual feature conjunctions , 1986, Nature.

[14]  H. Egeth,et al.  Searching for conjunctively defined targets. , 1984, Journal of experimental psychology. Human perception and performance.

[15]  James L. McClelland,et al.  Open Questions About Computation in Cerebral Cortex , 1987 .