Visual Search for Object Features

In this work we present the computational algorithm that combines perceptual and cognitive information during the visual search for object features. The algorithm is initially driven purely by the bottom-up information but during the recognition process it becomes more constrained by the top-down information. Furthermore, we propose a concrete model for integrating information from successive saccades and demonstrate the necessity of using two coordinate systems for measuring feature locations. During the search process, across saccades, the network uses an object-based coordinate system, while during a fixation the network uses the retinal coordinate system that is tied to the location of the fixation point. The only information that the network stores during saccadic exploration is the identity of the features on which it has fixated and their locations with respect to the object-centered system.

[1]  David E. Irwin,et al.  Integrating visual information from successive fixations. , 1982, Science.

[2]  K. Rayner Eye Movements and Visual Cognition , 1992 .

[3]  Leon N. Cooper,et al.  Interactive Parts Model: An Application to Recognition of On-line Cursive Script , 2000, NIPS.

[4]  S Ullman,et al.  Shifts in selective visual attention: towards the underlying neural circuitry. , 1985, Human neurobiology.

[5]  Eric B. Baum Computational Learning & Cognition: Proceedings of the Third NEC Research Symposium , 1993 .

[6]  S. Iversen,et al.  Substance P analog, DiMe-C7: evidence for stability in rat brain and prolonged central actions. , 1982 .

[7]  J. O'Regan,et al.  Solving the "real" mysteries of visual perception: the world as an outside memory. , 1992, Canadian journal of psychology.

[8]  D. Simons,et al.  CHAPTER 13 – Change Blindness , 2005 .

[9]  David Schuster,et al.  Biologically inspired recognition system for car detection from real-time video streams , 2004 .

[10]  A. L. I︠A︡rbus Eye Movements and Vision , 1967 .

[11]  Nancy Millette,et al.  How People Look at Pictures , 1935 .

[12]  J. Henderson,et al.  Accurate visual memory for previously attended objects in natural scenes , 2002 .

[13]  Alexander Pollatsek,et al.  What Is Integrated Across Fixations , 1992 .

[14]  J. Henderson,et al.  High-level scene perception. , 1999, Annual review of psychology.

[15]  P. Neskovic,et al.  NEURAL NETWORK­BASED CONTEXT DRIVEN RECOGNITION OF ON­LINE CURSIVE SCRIP , 2004 .

[16]  Jagath C. Rajapakse,et al.  Neural Information Processing: Research and Development , 2004 .

[17]  A. L. Yarbus,et al.  Eye Movements and Vision , 1967, Springer US.