Multiple Object Response Normalization in Monkey Inferotemporal Cortex

The highest stages of the visual ventral pathway are commonly assumed to provide robust representation of object identity by disregarding confounding factors such as object position, size, illumination, and the presence of other objects (clutter). However, whereas neuronal responses in monkey inferotemporal cortex (IT) can show robust tolerance to position and size changes, previous work shows that responses to preferred objects are usually reduced by the presence of nonpreferred objects. More broadly, we do not yet understand multiple object representation in IT. In this study, we systematically examined IT responses to pairs and triplets of objects in three passively viewing monkeys across a broad range of object effectiveness. We found that, at least under these limited clutter conditions, a large fraction of the response of each IT neuron to multiple objects is reliably predicted as the average of its responses to the constituent objects in isolation. That is, multiple object responses depend primarily on the relative effectiveness of the constituent objects, regardless of object identity. This average effect becomes virtually perfect when populations of IT neurons are pooled. Furthermore, the average effect cannot simply be explained by attentional shifts but behaves as a primarily feedforward response property. Together, our observations are most consistent with mechanistic models in which IT neuronal outputs are normalized by summed synaptic drive into IT or spiking activity within IT and suggest that normalization mechanisms previously revealed at earlier visual areas are operating throughout the ventral visual stream.

[1]  J. Rice Mathematical Statistics and Data Analysis , 1988 .

[2]  T. Poggio,et al.  Hierarchical models of object recognition in cortex , 1999, Nature Neuroscience.

[3]  Tomaso Poggio,et al.  Generalization in vision and motor control , 2004, Nature.

[4]  W. Newsome,et al.  The Variable Discharge of Cortical Neurons: Implications for Connectivity, Computation, and Information Coding , 1998, The Journal of Neuroscience.

[5]  S. Thorpe,et al.  Taking the MAX from neuronal responses , 2003, Trends in Cognitive Sciences.

[6]  R. Vogels,et al.  Spatial sensitivity of macaque inferior temporal neurons , 2000, The Journal of comparative neurology.

[7]  William R. Softky,et al.  The highly irregular firing of cortical cells is inconsistent with temporal integration of random EPSPs , 1993, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[8]  J. Movshon,et al.  Linearity and Normalization in Simple Cells of the Macaque Primary Visual Cortex , 1997, The Journal of Neuroscience.

[9]  M. Kenward,et al.  An Introduction to the Bootstrap , 2007 .

[10]  Eero P. Simoncelli,et al.  Natural signal statistics and sensory gain control , 2001, Nature Neuroscience.

[11]  David L. Sheinberg,et al.  Noticing Familiar Objects in Real World Scenes: The Role of Temporal Cortical Neurons in Natural Vision , 2001, The Journal of Neuroscience.

[12]  B. C. Motter,et al.  The zone of focal attention during active visual search , 1998, Vision Research.

[13]  D. Heeger Normalization of cell responses in cat striate cortex , 1992, Visual Neuroscience.

[14]  J. Maunsell,et al.  Anterior inferotemporal neurons of monkeys engaged in object recognition can be highly sensitive to object retinal position. , 2003, Journal of neurophysiology.

[15]  S. Thorpe,et al.  Rapid categorization of natural images by rhesus monkeys , 1998, Neuroreport.

[16]  D. V. van Essen,et al.  Spatial Attention Effects in Macaque Area V4 , 1997, The Journal of Neuroscience.

[17]  R. Desimone,et al.  Interacting Roles of Attention and Visual Salience in V4 , 2003, Neuron.

[18]  K. H. Britten,et al.  Spatial Summation in the Receptive Fields of MT Neurons , 1999, The Journal of Neuroscience.

[19]  Edmund T Rolls,et al.  The Receptive Fields of Inferior Temporal Cortex Neurons in Natural Scenes , 2003, The Journal of Neuroscience.

[20]  Christian R. Shelton,et al.  Morphable Surface Models , 2000, International Journal of Computer Vision.

[21]  R. Desimone,et al.  Neural mechanisms of selective visual attention. , 1995, Annual review of neuroscience.

[22]  T. Gawne,et al.  Responses of primate visual cortical V4 neurons to simultaneously presented stimuli. , 2002, Journal of neurophysiology.

[23]  K. H. Britten,et al.  Contrast dependence of response normalization in area MT of the rhesus macaque. , 2002, Journal of neurophysiology.

[24]  G. Orban,et al.  Shape interactions in macaque inferior temporal neurons. , 1999, Journal of neurophysiology.

[25]  T. Sato,et al.  Interactions of visual stimuli in the receptive fields of inferior temporal neurons in awake macaques , 2004, Experimental Brain Research.

[26]  H. Intraub,et al.  Presentation rate and the representation of briefly glimpsed pictures in memory. , 1980, Journal of experimental psychology. Human learning and memory.

[27]  E. Miller,et al.  Suppression of visual responses of neurons in inferior temporal cortex of the awake macaque by addition of a second stimulus , 1993, Brain Research.

[28]  M. Tovée,et al.  The responses of single neurons in the temporal visual cortical areas of the macaque when more than one stimulus is present in the receptive field , 2004, Experimental Brain Research.

[29]  B. C. Motter,et al.  The guidance of eye movements during active visual search , 1998, Vision Research.

[30]  Michael N. Shadlen,et al.  Noise, neural codes and cortical organization , 1994, Current Opinion in Neurobiology.

[31]  M. Potter Short-term conceptual memory for pictures. , 1976, Journal of experimental psychology. Human learning and memory.

[32]  R. Desimone,et al.  Selective attention gates visual processing in the extrastriate cortex. , 1985, Science.

[33]  T. Poggio,et al.  Are Cortical Models Really Bound by the “Binding Problem”? , 1999, Neuron.

[34]  M. Tarr,et al.  Visual Object Recognition , 1996, ISTCS.

[35]  J H Maunsell,et al.  The Brain's Visual World: Representation of Visual Targets in Cerebral Cortex , 1995, Science.

[36]  R. Desimone,et al.  Responses of Neurons in Inferior Temporal Cortex during Memory- Guided Visual Search , 1998 .

[37]  D. J. Felleman,et al.  Distributed hierarchical processing in the primate cerebral cortex. , 1991, Cerebral cortex.

[38]  Keiji Tanaka,et al.  Inferotemporal cortex and object vision. , 1996, Annual review of neuroscience.

[39]  Eero P. Simoncelli,et al.  Computational models of cortical visual processing. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[40]  S. Thorpe,et al.  How parallel is visual processing in the ventral pathway? , 2004, Trends in Cognitive Sciences.

[41]  R. Desimone,et al.  Competitive Mechanisms Subserve Attention in Macaque Areas V2 and V4 , 1999, The Journal of Neuroscience.

[42]  William Bialek,et al.  Spikes: Exploring the Neural Code , 1996 .

[43]  R. Wurtz,et al.  Responses of MT and MST neurons to one and two moving objects in the receptive field. , 1997, Journal of neurophysiology.

[44]  J. Maunsell,et al.  Form representation in monkey inferotemporal cortex is virtually unaltered by free viewing , 2000, Nature Neuroscience.

[45]  J. Movshon,et al.  Nature and interaction of signals from the receptive field center and surround in macaque V1 neurons. , 2002, Journal of neurophysiology.

[46]  P. Fldik,et al.  The Speed of Sight , 2001, Journal of Cognitive Neuroscience.

[47]  Gary S. Rubin,et al.  Reading without saccadic eye movements , 1992, Vision Research.

[48]  John H. R. Maunsell,et al.  Attentional modulation of visual motion processing in cortical areas MT and MST , 1996, Nature.

[49]  G. Orban,et al.  Responses of macaque inferior temporal neurons to overlapping shapes. , 1997, Cerebral cortex.

[50]  Tomaso Poggio,et al.  Intracellular measurements of spatial integration and the MAX operation in complex cells of the cat primary visual cortex. , 2004, Journal of neurophysiology.

[51]  Matthew Turk,et al.  A Morphable Model For The Synthesis Of 3D Faces , 1999, SIGGRAPH.