Object Ensemble Processing in Human Anterior-Medial Ventral Visual Cortex

Our visual system can extract summary statistics from large collections of similar objects without forming detailed representations of the individual objects in the ensemble. Such object ensemble representation is adaptive and allows us to overcome the capacity limitation associated with representing specific objects. Surprisingly, little is known about the neural mechanisms supporting such object ensemble representation. Here we showed human observers identical photographs of the same object ensemble, different photographs depicting the same ensemble, or different photographs depicting different ensembles. We observed fMRI adaptation in anterior-medial ventral visual cortex whenever object ensemble statistics repeated, even when local image features differed across photographs. Interestingly, such object ensemble processing is closely related to texture and scene processing in the brain. In contrast, the lateral occipital area, a region involved in object–shape processing, showed adaptation only when identical photographs were repeated. These results provide the first step toward understanding the neural underpinnings of real-world object ensemble representation.

[1]  M. Chun,et al.  Dissociating Task Performance from fMRI Repetition Attenuation in Ventral Visual Cortex , 2007, The Journal of Neuroscience.

[2]  Dwight J. Kravitz,et al.  Real-World Scene Representations in High-Level Visual Cortex: It's the Spaces More Than the Places , 2011, The Journal of Neuroscience.

[3]  Russell A. Epstein Parahippocampal and retrosplenial contributions to human spatial navigation , 2008, Trends in Cognitive Sciences.

[4]  R. Malach,et al.  Object-related activity revealed by functional magnetic resonance imaging in human occipital cortex. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[5]  D H Brainard,et al.  The Psychophysics Toolbox. , 1997, Spatial vision.

[6]  D. Ariely Seeing Sets: Representation by Statistical Properties , 2001, Psychological science.

[7]  Robert Sekuler,et al.  Coherent global motion percepts from stochastic local motions , 1984, Vision Research.

[8]  Russell A. Epstein,et al.  Differential parahippocampal and retrosplenial involvement in three types of visual scene recognition. , 2006, Cerebral cortex.

[9]  J. Lund,et al.  Compulsory averaging of crowded orientation signals in human vision , 2001, Nature Neuroscience.

[10]  M. Torrens Co-Planar Stereotaxic Atlas of the Human Brain—3-Dimensional Proportional System: An Approach to Cerebral Imaging, J. Talairach, P. Tournoux. Georg Thieme Verlag, New York (1988), 122 pp., 130 figs. DM 268 , 1990 .

[11]  A. Treisman,et al.  Statistical processing: computing the average size in perceptual groups , 2005, Vision Research.

[12]  Sharon L. Thompson-Schill,et al.  Learning Places from Views: Variation in Scene Processing as a Function of Experience and Navigational Ability , 2005, Journal of Cognitive Neuroscience.

[13]  A. Treisman,et al.  Attentional spread in the statistical processing of visual displays , 2005, Perception & psychophysics.

[14]  Z Kourtzi,et al.  Representation of Perceived Object Shape by the Human Lateral Occipital Complex , 2001, Science.

[15]  Jonathan S. Cant,et al.  fMR-adaptation reveals separate processing regions for the perception of form and texture in the human ventral stream , 2008, Experimental Brain Research.

[16]  G K Humphrey,et al.  The Role of Surface Information in Object Recognition: Studies of a Visual Form Agnosic and Normal Subjects , 1994, Perception.

[17]  Naokazu Goda,et al.  Transformation from image-based to perceptual representation of materials along the human ventral visual pathway , 2011, NeuroImage.

[18]  M. Chun,et al.  Selecting and perceiving multiple visual objects , 2009, Trends in Cognitive Sciences.

[19]  Aude Oliva,et al.  Spatial ensemble statistics are efficient codes that can be represented with reduced attention , 2009, Proceedings of the National Academy of Sciences.

[20]  Jonathan S. Cant,et al.  Scratching Beneath the Surface: New Insights into the Functional Properties of the Lateral Occipital Area and Parahippocampal Place Area , 2011, The Journal of Neuroscience.

[21]  Zoe Kourtzi,et al.  Spatiotemporal characteristics of form analysis in the human visual cortex revealed by rapid event-related fMRI adaptation , 2005, NeuroImage.

[22]  S. Edelman,et al.  Differential Processing of Objects under Various Viewing Conditions in the Human Lateral Occipital Complex , 1999, Neuron.

[23]  Nancy Kanwisher,et al.  Divide and conquer: A defense of functional localizers , 2006, NeuroImage.

[24]  A. Treisman,et al.  Representation of statistical properties , 2003, Vision Research.

[25]  G. Orban,et al.  Attention to 3-D Shape, 3-D Motion, and Texture in 3-D Structure from Motion Displays , 2004, Journal of Cognitive Neuroscience.

[26]  Ravi S. Menon,et al.  Intrinsic signal changes accompanying sensory stimulation: functional brain mapping with magnetic resonance imaging. , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[27]  K. Grill-Spector,et al.  Repetition and the brain: neural models of stimulus-specific effects , 2006, Trends in Cognitive Sciences.

[28]  Eero P. Simoncelli,et al.  A Parametric Texture Model Based on Joint Statistics of Complex Wavelet Coefficients , 2000, International Journal of Computer Vision.

[29]  Chun-Chia Kung,et al.  Is Region-of-Interest Overlap Comparison a Reliable Measure of Category Specificity? , 2007, Journal of Cognitive Neuroscience.

[30]  H. Wilson,et al.  Size-invariant but viewpoint-dependent representation of faces , 2006, Vision Research.

[31]  Kalanit Grill-Spector,et al.  Representation of shapes, edges, and surfaces across multiple cues in the human visual cortex. , 2008, Journal of neurophysiology.

[32]  Andrew P. Duchon,et al.  The human visual system averages speed information , 1992, Vision Research.

[33]  M. Goodale,et al.  Sight Unseen: An Exploration of Conscious and Unconscious Vision , 2004 .

[34]  D G Pelli,et al.  The VideoToolbox software for visual psychophysics: transforming numbers into movies. , 1997, Spatial vision.

[35]  Robert W. Kentridge,et al.  Separate channels for processing form, texture, and color: evidence from FMRI adaptation and visual object agnosia. , 2010, Cerebral cortex.

[36]  S. Edelman,et al.  Human Brain Mapping 6:316–328(1998) � A Sequence of Object-Processing Stages Revealed by fMRI in the Human Occipital Lobe , 2022 .

[37]  Timothy J. Andrews,et al.  Distinct representations for facial identity and changeable aspects of faces in the human temporal lobe , 2004, NeuroImage.

[38]  Suk Won Han,et al.  The neural correlates of visual working memory encoding: A time-resolved fMRI study , 2011, Neuropsychologia.

[39]  Yaoda Xu,et al.  The Neural Fate of Task-Irrelevant Features in Object-Based Processing , 2010, The Journal of Neuroscience.

[40]  Dhiraj Joshi,et al.  Object Categorization: Computer and Human Vision Perspectives , 2008 .

[41]  Paul E. Downing,et al.  Viewpoint-Specific Scene Representations in Human Parahippocampal Cortex , 2003, Neuron.

[42]  A. Oliva,et al.  The Representation of Simple Ensemble Visual Features Outside the Focus of Attention , 2008, Psychological science.

[43]  A. Oliva,et al.  Diagnostic Colors Mediate Scene Recognition , 2000, Cognitive Psychology.

[44]  Robert W. Kentridge,et al.  Separate processing of texture and form in the ventral stream: evidence from FMRI and visual agnosia. , 2010, Cerebral cortex.

[45]  Kalanit Grill-Spector,et al.  Object Categorization: What Has fMRI Taught Us About Object Recognition? , 2009 .

[46]  G. Alvarez Representing multiple objects as an ensemble enhances visual cognition , 2011, Trends in Cognitive Sciences.

[47]  Karl J. Friston,et al.  Statistical parametric maps in functional imaging: A general linear approach , 1994 .

[48]  Jonathan S. Cant,et al.  Cerebral Cortex Advance Access published April 28, 2006 Attention to Form or Surface Properties Modulates Different Regions of Human , 2022 .

[49]  K. Grill-Spector,et al.  The dynamics of object-selective activation correlate with recognition performance in humans , 2000, Nature Neuroscience.

[50]  M. Chun,et al.  Dissociable neural mechanisms supporting visual short-term memory for objects , 2006, Nature.

[51]  Timothy J. Andrews,et al.  fMR-adaptation reveals a distributed representation of inanimate objects and places in human visual cortex , 2005, NeuroImage.

[52]  N. Kanwisher,et al.  The Fusiform Face Area: A Module in Human Extrastriate Cortex Specialized for Face Perception , 1997, The Journal of Neuroscience.

[53]  Daniel D. Dilks,et al.  Mirror-Image Sensitivity and Invariance in Object and Scene Processing Pathways , 2011, The Journal of Neuroscience.

[54]  H. Bülthoff,et al.  Perceptual Organization of Local Elements into Global Shapes in the Human Visual Cortex , 2003, Current Biology.

[55]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[56]  Nancy Kanwisher,et al.  A cortical representation of the local visual environment , 1998, Nature.