Neural Selectivity for Real-World Object Size In Natural Images

Real-world size is a functionally important high-level visual property of objects that supports interactions with our physical environment. Critically, real-world-size is robust over changes in visual appearance as projected onto our retinae such that large and small objects are correctly perceived to have different real-world sizes. To better understand the neural basis of this phenomenon, we examined whether the neural coding of real-world size holds for objects embedded in complex natural scene images, as well as whether real-world size effects are present for both inanimate and animate objects, whether low- and mid-level visual features can account for size selectivity, and whether neural size tuning is best described by a linear, logarithmic, or exponential neural coding function. To address these questions, we used a large-scale dataset of fMRI responses to natural images combined with per-voxel regression and contrasts. Importantly, the resultant pattern of size selectivity for objects embedded in natural scenes was aligned with prior results using isolated objects. Extending this finding, we also found that size coding exists for both animate and inanimate objects, that low-level visual features cannot account for neural size preferences, and size tuning functions have different shapes for large versus small preferring voxels. Together, these results indicate that real-world size is an ecologically significant dimension in the larger space of behaviorally-relevant cortical representations that support interactions with the world around us.

[1]  Jacob S. Prince,et al.  Selectivity for food in human ventral visual cortex , 2023, bioRxiv.

[2]  John A. Pyles,et al.  Improving the accuracy of single-trial fMRI response estimates using GLMsingle , 2022, eLife.

[3]  M. Tarr,et al.  Low-level tuning biases in higher visual cortex reflect the semantic informativeness of visual features , 2022, bioRxiv.

[4]  Jia Liu,et al.  Real-world size of objects serves as an axis of object space , 2021, Communications Biology.

[5]  Emily J. Allen,et al.  A massive 7T fMRI dataset to bridge cognitive neuroscience and artificial intelligence , 2021, Nature Neuroscience.

[6]  Vineet Gandhi,et al.  Tidying Deep Saliency Prediction Architectures , 2020, 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS).

[7]  Russell A. Epstein,et al.  Scene Perception in the Human Brain. , 2019, Annual review of vision science.

[8]  Abhinav Gupta,et al.  BOLD5000, a public fMRI dataset while viewing 5000 visual images , 2018, Scientific Data.

[9]  Jonathan Winawer,et al.  The Human Connectome Project 7 Tesla retinotopy dataset: Description and population receptive field analysis , 2018, Journal of vision.

[10]  Talia Konkle,et al.  Mid-level visual features underlie the high-level categorical organization of the ventral stream , 2018, Proceedings of the National Academy of Sciences.

[11]  Ghislain St-Yves,et al.  The feature-weighted receptive field: an interpretable encoding model for complex feature spaces , 2017, NeuroImage.

[12]  Bria Long,et al.  A familiar-size Stroop effect in the absence of basic-level recognition , 2017, Cognition.

[13]  Russell A. Epstein,et al.  Coding of Object Size and Object Category in Human Visual Cortex , 2016, Cerebral cortex.

[14]  Michael A. Cohen,et al.  Mid-level perceptual features distinguish objects of different real-world sizes. , 2016, Journal of experimental psychology. General.

[15]  Jack L. Gallant,et al.  Pycortex: an interactive surface visualizer for fMRI , 2015, Front. Neuroinform..

[16]  Qi Zhao,et al.  SALICON: Saliency in Context , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[17]  R. Tootell,et al.  Thinking Outside the Box: Rectilinear Shapes Selectively Activate Scene-Selective Cortex , 2014, The Journal of Neuroscience.

[18]  Laura A. Kelley,et al.  Animal visual illusion and confusion: the importance of a perceptual perspective , 2014 .

[19]  A. Caramazza,et al.  Tripartite Organization of the Ventral Stream by Animacy and Object Size , 2013, The Journal of Neuroscience.

[20]  Joseph J. Lim,et al.  Sketch Tokens: A Learned Mid-level Representation for Contour and Object Detection , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  Roger B. H. Tootell,et al.  A Cardinal Orientation Bias in Scene-Selective Visual Cortex , 2012, The Journal of Neuroscience.

[22]  A. Oliva,et al.  A Real-World Size Organization of Object Responses in Occipitotemporal Cortex , 2012, Neuron.

[23]  A. Oliva,et al.  Canonical Visual Size for Real-world Objects , 2010 .

[24]  William A Phillips,et al.  The Ebbinghaus illusion deceives adults but not young children. , 2010, Developmental science.

[25]  Jody C Culham,et al.  Is That within Reach? fMRI Reveals That the Human Superior Parieto-Occipital Cortex Encodes Objects Reachable by the Hand , 2009, The Journal of Neuroscience.

[26]  D. Kersten,et al.  The representation of perceived angular size in human primary visual cortex , 2006, Nature Neuroscience.

[27]  M. Bar Visual objects in context , 2004, Nature Reviews Neuroscience.

[28]  Antonio Torralba,et al.  Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope , 2001, International Journal of Computer Vision.

[29]  Bruce D. McCandliss,et al.  The visual word form area: expertise for reading in the fusiform gyrus , 2003, Trends in Cognitive Sciences.

[30]  R. Passingham,et al.  Objects automatically potentiate action: an fMRI study of implicit processing , 2003, The European journal of neuroscience.

[31]  R. Ellis,et al.  The potentiation of grasp types during visual object categorization , 2001 .

[32]  S Lehéricy,et al.  The visual word form area: spatial and temporal characterization of an initial stage of reading in normal subjects and posterior split-brain patients. , 2000, Brain : a journal of neurology.

[33]  Nancy Kanwisher,et al.  A cortical representation of the local visual environment , 1998, Nature.

[34]  N. Kanwisher,et al.  The Fusiform Face Area: A Module in Human Extrastriate Cortex Specialized for Face Perception , 1997, The Journal of Neuroscience.

[35]  J. Sergent,et al.  Functional neuroanatomy of face and object processing. A positron emission tomography study. , 1992, Brain : a journal of neurology.

[36]  A. Slater,et al.  Size constancy at birth: newborn infants' responses to retinal and real size. , 1990, Journal of experimental child psychology.

[37]  T. Shallice,et al.  Category specific semantic impairments. , 1998, Brain : a journal of neurology.