A Model of the Superior Colliculus Predicts Fixation Locations during Scene Viewing and Visual Search

Modern computational models of attention predict fixations using saliency maps and target maps, which prioritize locations for fixation based on feature contrast and target goals, respectively. But whereas many such models are biologically plausible, none have looked to the oculomotor system for design constraints or parameter specification. Conversely, although most models of saccade programming are tightly coupled to underlying neurophysiology, none have been tested using real-world stimuli and tasks. We combined the strengths of these two approaches in MASC, a model of attention in the superior colliculus (SC) that captures known neurophysiological constraints on saccade programming. We show that MASC predicted the fixation locations of humans freely viewing naturalistic scenes and performing exemplar and categorical search tasks, a breadth achieved by no other existing model. Moreover, it did this as well or better than its more specialized state-of-the-art competitors. MASC's predictive success stems from its inclusion of high-level but core principles of SC organization: an over-representation of foveal information, size-invariant population codes, cascaded population averaging over distorted visual and motor maps, and competition between motor point images for saccade programming, all of which cause further modulation of priority (attention) after projection of saliency and target maps to the SC. Only by incorporating these organizing brain principles into our models can we fully understand the transformation of complex visual information into the saccade programs underlying movements of overt attention. With MASC, a theoretical footing now exists to generate and test computationally explicit predictions of behavioral and neural responses in visually complex real-world contexts. SIGNIFICANCE STATEMENT The superior colliculus (SC) performs a visual-to-motor transformation vital to overt attention, but existing SC models cannot predict saccades to visually complex real-world stimuli. We introduce a brain-inspired SC model that outperforms state-of-the-art image-based competitors in predicting the sequences of fixations made by humans performing a range of everyday tasks (scene viewing and exemplar and categorical search), making clear the value of looking to the brain for model design. This work is significant in that it will drive new research by making computationally explicit predictions of SC neural population activity in response to naturalistic stimuli and tasks. It will also serve as a blueprint for the construction of other brain-inspired models, helping to usher in the next generation of truly intelligent autonomous systems.

[1]  R. Wurtz,et al.  Composition and topographic organization of signals sent from the frontal eye field to the superior colliculus. , 2000, Journal of neurophysiology.

[2]  J T McIlwain,et al.  Visual receptive fields and their images in superior colliculus of the cat. , 1975, Journal of neurophysiology.

[3]  R. Wurtz,et al.  Saccade-related activity in monkey superior colliculus. I. Characteristics of burst and buildup cells. , 1995, Journal of neurophysiology.

[4]  Víctor Leborán,et al.  On the relationship between optical variability, visual saliency, and eye fixations: a computational approach. , 2012, Journal of vision.

[5]  F. Bremmer,et al.  Visual receptive field modulation in the lateral intraparietal area during attentive fixation and free gaze. , 2002, Cerebral cortex.

[6]  Gabriela Csurka,et al.  Visual categorization with bags of keypoints , 2002, eccv 2004.

[7]  C. Bruce,et al.  Primate frontal eye fields. I. Single neurons discharging before saccades. , 1985, Journal of neurophysiology.

[8]  Michael S. Landy,et al.  Computational models of visual attention , 2011, Vision Research.

[9]  D. V. van Essen,et al.  Spatial Attention Effects in Macaque Area V4 , 1997, The Journal of Neuroscience.

[10]  G. Zelinsky A theory of eye movements during target acquisition. , 2008, Psychological review.

[11]  Robert G Alexander,et al.  Visual similarity effects in categorical search. , 2011, Journal of vision.

[12]  R. Klein,et al.  Searching for inhibition of return in visual search: A review , 2010, Vision Research.

[13]  C. Koch,et al.  Computational modelling of visual attention , 2001, Nature Reviews Neuroscience.

[14]  A. S. Ramoa,et al.  Intrinsic circuitry of the superior colliculus: pharmacophysiological identification of horizontally oriented inhibitory interneurons. , 1998, Journal of neurophysiology.

[15]  Wilson S. Geisler,et al.  Real-time simulation of arbitrary visual fields , 2002, ETRA.

[16]  N. J. Gandhi,et al.  Two-dimensional saccade-related population activity in superior colliculus in monkey. , 1998, Journal of neurophysiology.

[17]  J. Mcilwain Lateral spread of neural excitation during microstimulation in intermediate gray layer of cat's superior colliculus. , 1982, Journal of neurophysiology.

[18]  J. Bisley,et al.  Been there, seen that: a neural mechanism for performing efficient visual search. , 2009, Journal of neurophysiology.

[19]  G. Rhodes,et al.  Sex-specific norms code face identity. , 2011, Journal of vision.

[20]  James W Bisley,et al.  The what, where, and why of priority maps and their interactions with visual working memory , 2015, Annals of the New York Academy of Sciences.

[21]  G. G. Gregoriou,et al.  Functional imaging of the primate superior colliculus during saccades to visual targets , 2001, Nature Neuroscience.

[22]  R. Klein,et al.  A Model of Saccade Initiation Based on the Competitive Integration of Exogenous and Endogenous Signals in the Superior Colliculus , 2001, Journal of Cognitive Neuroscience.

[23]  T. Foulsham,et al.  It depends on how you look at it: Scanpath comparison in multiple dimensions with MultiMatch, a vector-based approach , 2012, Behavior Research Methods.

[24]  Pietro Perona,et al.  Graph-Based Visual Saliency , 2006, NIPS.

[25]  F. Ottes,et al.  Visuomotor fields of the superior colliculus: A quantitative model , 1986, Vision Research.

[26]  D. Munoz,et al.  Lateral inhibitory interactions in the intermediate layers of the monkey superior colliculus. , 1998, Journal of neurophysiology.

[27]  John H. R. Maunsell,et al.  The visual field representation in striate cortex of the macaque monkey: Asymmetries, anisotropies, and individual variability , 1984, Vision Research.

[28]  Raymond Klein,et al.  Inhibitory tagging system facilitates visual search , 1988, Nature.

[29]  J. Wolfe,et al.  Guided Search 2.0 A revised model of visual search , 1994, Psychonomic bulletin & review.

[30]  Julie M. Harris,et al.  Optimal integration of shading and binocular disparity for depth perception. , 2012, Journal of vision.

[31]  M. Goldberg,et al.  Neuronal Activity in the Lateral Intraparietal Area and Spatial Attention , 2003, Science.

[32]  Jeremiah Y. Cohen,et al.  The neural basis of saccade target selection , 1995 .

[33]  Edward H. Adelson,et al.  The Laplacian Pyramid as a Compact Image Code , 1983, IEEE Trans. Commun..

[34]  A. V. van Opstal,et al.  Dynamic ensemble coding of saccades in the monkey superior colliculus. , 2006, Journal of neurophysiology.

[35]  G. Zelinsky,et al.  Short article: Search guidance is proportional to the categorical specificity of a target cue , 2009, Quarterly journal of experimental psychology.

[36]  Yifan Peng,et al.  Modelling eye movements in a categorical search task , 2013, Philosophical Transactions of the Royal Society B: Biological Sciences.

[37]  Wei Zhang,et al.  The Role of Top-down and Bottom-up Processes in Guiding Eye Movements during Visual Search , 2005, NIPS.

[38]  R. Desimone,et al.  Neural mechanisms of selective visual attention. , 1995, Annual review of neuroscience.

[39]  M. A. Basso,et al.  Response Normalization in the Superficial Layers of the Superior Colliculus as a Possible Mechanism for Saccadic Averaging , 2014, The Journal of Neuroscience.

[40]  Frédo Durand,et al.  Learning to predict where humans look , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[41]  Eric L. Schwartz,et al.  Computational anatomy and functional architecture of striate cortex: A spatial mapping approach to perceptual coding , 1980, Vision Research.

[42]  Timothy F. Brady,et al.  Conceptual Distinctiveness Supports Detailed Visual Long-term Memory for Real-world Objects the Fidelity of Long-term Memory for Visual Information , 2022 .

[43]  F. Löffler,et al.  Guided cobalamin biosynthesis supports Dehalococcoides mccartyi reductive dechlorination activity , 2013, Philosophical Transactions of the Royal Society B: Biological Sciences.

[44]  Laurent Itti,et al.  Beyond bottom-up: Incorporating task-dependent influences into a computational model of spatial attention , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[45]  Michele A Basso,et al.  A hard-wired priority map in the superior colliculus shaped by asymmetric inhibitory circuitry. , 2015, Journal of neurophysiology.

[46]  Robert M. McPeek,et al.  Deficits in saccade target selection after inactivation of superior colliculus , 2004, Nature Neuroscience.

[47]  Robert A. Marino,et al.  Distinct local circuit properties of the superficial and intermediate layers of the rodent superior colliculus , 2014, The European journal of neuroscience.

[48]  M. Posner,et al.  Components of visual orienting , 1984 .

[49]  D. Sparks,et al.  The deep layers of the superior colliculus. , 1989, Reviews of oculomotor research.

[50]  A. Berthoz,et al.  From brainstem to cortex: Computational models of saccade generation circuitry , 2005, Progress in Neurobiology.

[51]  Marisa Carrasco,et al.  Attentional enhancement of spatial resolution: linking behavioural and neurophysiological evidence , 2013, Nature Reviews Neuroscience.

[52]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[53]  Michael J. Swain,et al.  Color indexing , 1991, International Journal of Computer Vision.

[54]  Jillian H. Fecteau,et al.  Salience, relevance, and firing: a priority map for target selection , 2006, Trends in Cognitive Sciences.

[55]  Xin Chen,et al.  Real-world visual search is dominated by top-down guidance , 2006, Vision Research.

[56]  Ali Borji,et al.  Analysis of Scores, Datasets, and Models in Visual Saliency Prediction , 2013, 2013 IEEE International Conference on Computer Vision.

[57]  G. Zelinsky,et al.  Modeling guidance and recognition in categorical search: bridging human and computer object detection. , 2012, Journal of vision.

[58]  Christof Koch,et al.  A Model of Saliency-Based Visual Attention for Rapid Scene Analysis , 2009 .

[59]  Gregory J Zelinsky,et al.  TAM: Explaining off-object fixations and central fixation tendencies as effects of population averaging during search , 2012, Visual cognition.

[60]  Robert A. Marino,et al.  Spatial relationships of visuomotor transformations in the superior colliculus map. , 2008, Journal of neurophysiology.

[61]  Christopher D. Carello,et al.  Manipulating Intent Evidence for a Causal Role of the Superior Colliculus in Target Selection , 2004, Neuron.

[62]  A. J. Van Opstal,et al.  Comparison of saccades evoked by visual stimulation and collicular electrical stimulation in the alert monkey , 2004, Experimental Brain Research.

[63]  R. Desimone Visual attention mediated by biased competition in extrastriate visual cortex. , 1998, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[64]  D. Sparks,et al.  Population coding of saccadic eye movements by neurons in the superior colliculus , 1988, Nature.

[65]  L. Itti,et al.  Search Goal Tunes Visual Features Optimally , 2007, Neuron.

[66]  Jan Theeuwes,et al.  ScanMatch: A novel method for comparing fixation sequences , 2010, Behavior research methods.

[67]  M. Goldberg,et al.  Attention, intention, and priority in the parietal lobe. , 2010, Annual review of neuroscience.

[68]  J. V. Gisbergen,et al.  Collicular ensemble coding of saccades based on vector summation , 1987, Neuroscience.

[69]  R. Krauzlis,et al.  Superior colliculus and visual spatial attention. , 2013, Annual review of neuroscience.

[70]  Tandra Ghose,et al.  Generalization between canonical and non-canonical views in object recognition. , 2013, Journal of vision.

[71]  Christopher Hunt,et al.  Notes on the OpenSURF Library , 2009 .

[72]  James T. Mcllwain Point images in the visual system: new interest in an old idea , 1986, Trends in Neurosciences.