CHAPTER 91 – Probabilistic Models of Attention Based on Iconic Representations and Predictive Coding

We describe two models of attention that use probabilistic principles to compute task-relevant variables. In the first model, objects and visual scenes are represented iconically using spatial filters at multiple scales. A maximum likelihood–based approach is used to compute the location of a target in a given scene. The eye movements generated by such a strategy are shown to be similar to human eye movement patterns elicited during visual search in naturalistic scenes. The second model is based on the statistical concept of predictive coding. It assumes that top-down feedback from higher cortical areas conveys predictions of expected activity at lower levels while the errors in prediction are conveyed through feed-forward connections. The model explains how multiple objects in a scene can be recognized sequentially without an explicit spotlight of attention. An extension of the model provides an interpretation of object-based versus spatial attention in terms of interactions between “what and “where” networks in the visual pathway.

[1]  N. Kanwisher,et al.  Visual attention: Insights from brain imaging , 2000, Nature Reviews Neuroscience.

[2]  Rajesh P. N. Rao,et al.  Modeling Saccadic Targeting in Visual Search , 1995, NIPS.

[3]  Rajesh P. N. Rao,et al.  Dynamic Model of Visual Recognition Predicts Neural Response Properties in the Visual Cortex , 1997, Neural Computation.

[4]  D. Navon Forest before trees: The precedence of global features in visual perception , 1977, Cognitive Psychology.

[5]  Rajesh P. N. Rao,et al.  PSYCHOLOGICAL SCIENCE Research Article EYE MOVEMENTS REVEAL THE SPATIOTEMPORAL DYNAMICS OE VISUAL SEARCH , 2022 .

[6]  Rajesh P. N. Rao,et al.  Development of localized oriented receptive fields by learning a translation-invariant code for natural images. , 1998, Network.

[7]  C. Koch,et al.  A saliency-based search mechanism for overt and covert shifts of visual attention , 2000, Vision Research.

[8]  A. Oliva,et al.  From Blobs to Boundary Edges: Evidence for Time- and Spatial-Scale-Dependent Scene Recognition , 1994 .

[9]  W. Newsome,et al.  Visual attention: Spotlights, highlights and visual awareness , 1996, Current Biology.

[10]  Jitendra Malik,et al.  Computational framework for determining stereo correspondence from a set of linear spatial filters , 1992, Image Vis. Comput..

[11]  Jitendra Malik,et al.  A Computational Framework for Determining Stereo Correspondence from a Set of Linear Spatial Filters , 1991, ECCV.

[12]  Rajesh P. N. Rao,et al.  Eye movements in iconic visual search , 2002, Vision Research.

[13]  J T McIlwain,et al.  Distributed spatial coding in the superior colliculus: A review , 1991, Visual Neuroscience.

[14]  D. J. Felleman,et al.  Distributed hierarchical processing in the primate cerebral cortex. , 1991, Cerebral cortex.

[15]  Rajesh P. N. Rao,et al.  An Active Vision Architecture Based on Iconic Representations , 1995, Artif. Intell..

[16]  Christof Koch,et al.  Control of Selective Visual Attention: Modeling the Where Pathway , 1995, NIPS.

[17]  S Ullman,et al.  Shifts in selective visual attention: towards the underlying neural circuitry. , 1985, Human neurobiology.

[18]  Rajesh P. N. Rao Correlates of Attention in a Model of Dynamic Visual Recognition , 1997, NIPS.

[19]  R. Desimone,et al.  Neural mechanisms of selective visual attention. , 1995, Annual review of neuroscience.

[20]  John K. Tsotsos,et al.  Modeling Visual Attention via Selective Tuning , 1995, Artif. Intell..

[21]  Edward H. Adelson,et al.  The Design and Use of Steerable Filters , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[22]  Joachim M. Buhmann,et al.  Distortion Invariant Object Recognition in the Dynamic Link Architecture , 1993, IEEE Trans. Computers.

[23]  A. Borst Seeing smells: imaging olfactory learning in bees , 1999, Nature Neuroscience.

[24]  Geoffrey E. Hinton,et al.  Learning and relearning in Boltzmann machines , 1986 .

[25]  Steven J. Nowlan,et al.  Maximum Likelihood Competitive Learning , 1989, NIPS.

[26]  D. Whitteridge,et al.  Learning and Relearning , 1959, Science's STKE.

[27]  Geoffrey E. Hinton A Parallel Computation that Assigns Canonical Object-Based Frames of Reference , 1981, IJCAI.

[28]  Rajesh P. N. Rao,et al.  An optimal estimation approach to visual perception and learning , 1999, Vision Research.

[29]  Philipp Slusallek,et al.  Introduction to real-time ray tracing , 2005, SIGGRAPH Courses.

[30]  D. V. van Essen,et al.  A neurobiological model of visual attention and invariant pattern recognition based on dynamic routing of information , 1993, The Journal of neuroscience : the official journal of the Society for Neuroscience.