A dual role for shape skeletons in human vision: perceptual organization and object recognition

Shape perception is crucial for object recognition. However, it remains unknown exactly how shape information is represented, and, consequently, used by the visual system. Here, we hypothesized that the visual system represents “shape skeletons” to both (1) perceptually organize contours and component parts into a shape percept, and (2) compare shapes to recognize objects. Using functional magnetic resonance imaging (fMRI) and representational similarity analysis (RSA), we found that a model of skeletal similarity explained significant unique variance in the response profiles of V3 and LO, regions known to be involved in perceptual organization and object recognition, respectively. Moreover, the skeletal model remained predictive in these regions even when controlling for other models of visual similarity that approximate low- to high-level visual features (i.e., Gabor-jet, GIST, HMAX, and AlexNet), and across different surface forms, a manipulation that altered object contours while preserving the underlying skeleton. Together, these findings shed light on the functional roles of shape skeletons in human vision, as well as the computational properties of V3 and LO.

[1]  Nikolaus Kriegeskorte,et al.  Frontiers in Systems Neuroscience Systems Neuroscience , 2022 .

[2]  Philip N. Klein,et al.  Recognition of shapes by editing their shock graphs , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  R. O’Brien,et al.  A Caution Regarding Rules of Thumb for Variance Inflation Factors , 2007 .

[4]  Hongjing Lu,et al.  Deep convolutional networks do not classify based on global object shape , 2018, PLoS Comput. Biol..

[5]  Liang Wang,et al.  Probabilistic Maps of Visual Topography in Human Cortex. , 2015, Cerebral cortex.

[6]  N. Kanwisher,et al.  The lateral occipital complex and its role in object recognition , 2001, Vision Research.

[7]  Rüdiger von der Heydt,et al.  Spike Synchrony Reveals Emergence of Proto-Objects in Visual Cortex , 2015, The Journal of Neuroscience.

[8]  Sven J. Dickinson,et al.  Local contour symmetry facilitates scene categorization , 2019, Cognition.

[9]  Peter U. Tse,et al.  Extrastriate cortical activity reflects segmentation of motion into independent sources , 2010, Neuropsychologia.

[10]  Wei Ji Ma,et al.  Bayesian microsaccade detection , 2017, Journal of vision.

[11]  K. Grill-Spector,et al.  The dynamics of object-selective activation correlate with recognition performance in humans , 2000, Nature Neuroscience.

[12]  Sabine Kastner,et al.  Defining the Units of Competition: Influences of Perceptual Organization on Competitive Interactions in Human Visual Cortex , 2010, Journal of Cognitive Neuroscience.

[13]  Bela Julesz,et al.  Medial-point description of shape: a representation for action coding and its psychophysical correlates , 1998, Vision Research.

[14]  Robert L. Whitwell,et al.  The lateral‐occipital and the inferior‐frontal cortex play different roles during the naming of visually presented objects , 2009, Human brain mapping.

[15]  S. Kastner,et al.  The Functional Neuroanatomy of Object Agnosia: A Case Study , 2010, Neuron.

[16]  David Cox,et al.  Recurrent computations for visual pattern completion , 2017, Proceedings of the National Academy of Sciences.

[17]  Matthias Bethge,et al.  ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness , 2018, ICLR.

[18]  S Kastner,et al.  Topographic organization of areas V3 and V4 and its relation to supra-areal organization of the primate visual system , 2015, Visual Neuroscience.

[19]  Geoffrey E. Hinton,et al.  ImageNet classification with deep convolutional neural networks , 2012, Commun. ACM.

[20]  Scott L. Brincat,et al.  Dynamic Shape Synthesis in Posterior Inferotemporal Cortex , 2006, Neuron.

[21]  Russell A. Epstein,et al.  Computational mechanisms underlying cortical responses to the affordance properties of visual scenes , 2017, bioRxiv.

[22]  Daniel L. K. Yamins,et al.  Deep Neural Networks Rival the Representation of Primate IT Cortex for Core Visual Object Recognition , 2014, PLoS Comput. Biol..

[23]  Irving Biederman,et al.  An applet for the Gabor similarity scaling of the differences between complex stimuli , 2016, Attention, perception & psychophysics.

[24]  Shimon Ullman,et al.  Atoms of recognition in human and computer vision , 2016, Proceedings of the National Academy of Sciences.

[25]  R. von der Heydt,et al.  A neural model of figure-ground organization. , 2007, Journal of neurophysiology.

[26]  Sven J. Dickinson,et al.  Scene Categorization From Contours: Medial Axis Based Salience Measures , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Alex R. Wade,et al.  Emergence of symmetry selectivity in the visual areas of the human brain: fMRI responses to symmetry presented in both frontoparallel and slanted planes , 2018, Human brain mapping.

[28]  Charles E Connor,et al.  Underlying principles of visual shape selectivity in posterior inferotemporal cortex , 2004, Nature Neuroscience.

[29]  P. Downing,et al.  Selectivity for the human body in the fusiform gyrus. , 2005, Journal of neurophysiology.

[30]  F. Qiu,et al.  Neural representation of transparent overlay , 2007, Nature Neuroscience.

[31]  Talia Konkle,et al.  Mid-level visual features underlie the high-level categorical organization of the ventral stream , 2018, Proceedings of the National Academy of Sciences.

[32]  I. Biederman Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.

[33]  J. Feldman,et al.  Superordinate shape classification using natural shape statistics , 2011, Cognition.

[34]  Stella F. Lourenco,et al.  Skeletal representations of shape in human vision: Evidence for a pruned medial axis model , 2019, Journal of vision.

[35]  Marlene Behrmann,et al.  Correction: The large-scale organization of shape processing in the ventral and dorsal pathways. , 2017 .

[36]  B. Scholl,et al.  “Please Tap the Shape, Anywhere You Like” , 2014, Psychological science.

[37]  Wim Vanduffel,et al.  Symmetry activates extrastriate visual cortex in human and nonhuman primates. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[38]  George A Alvarez,et al.  Mid-level perceptual features contain early cues to animacy. , 2017, Journal of vision.

[39]  G. Orban,et al.  The kinetic occipital region in human visual cortex. , 1997, Cerebral cortex.

[40]  Johan Wagemans,et al.  Configural Gestalts remain nothing more than the sum of their parts in visual agnosia , 2013, i-Perception.

[41]  James J. DiCarlo,et al.  Evidence that recurrent circuits are critical to the ventral stream’s execution of core object recognition behavior , 2018, Nature Neuroscience.

[42]  Jonas Kubilius,et al.  Brain-Score: Which Artificial Neural Network for Object Recognition is most Brain-Like? , 2018, bioRxiv.

[43]  G. Aguirre,et al.  Different spatial scales of shape similarity representation in lateral and ventral LOC. , 2009, Cerebral cortex.

[44]  James J. DiCarlo,et al.  How Does the Brain Solve Visual Object Recognition? , 2012, Neuron.

[45]  Richard C Saunders,et al.  Receptive field focus of visual area V4 neurons determines responses to illusory surfaces , 2013, Proceedings of the National Academy of Sciences.

[46]  Yuka Sasaki,et al.  Processing local signals into global patterns , 2007, Current Opinion in Neurobiology.

[47]  Ha Hong,et al.  Performance-optimized hierarchical models predict neural responses in higher visual cortex , 2014, Proceedings of the National Academy of Sciences.

[48]  Irving Biederman,et al.  Sensitivity to nonaccidental properties across various shape dimensions , 2012, Vision Research.

[49]  J. Psotka Perceptual processes that may create stick figures and balance. , 1978, Journal of experimental psychology. Human perception and performance.

[50]  Jack L. Gallant,et al.  Fourier power, subjective distance, and object categories all provide plausible models of BOLD responses in scene-selective visual areas , 2015, Front. Comput. Neurosci..

[51]  Ernst Niebur,et al.  Medial axis generation in a model of perceptual organization , 2012, 2012 46th Annual Conference on Information Sciences and Systems (CISS).

[52]  I. Biederman,et al.  Subordinate-level object classification reexamined , 1999, Psychological research.

[53]  Daniel D. Dilks,et al.  Mirror-Image Sensitivity and Invariance in Object and Scene Processing Pathways , 2011, The Journal of Neuroscience.

[54]  Thomas Serre,et al.  A feedforward architecture accounts for rapid categorization , 2007, Proceedings of the National Academy of Sciences.

[55]  N. Logothetis,et al.  Visual Areas in Macaque Cortex Measured Using Functional Magnetic Resonance Imaging , 2002, The Journal of Neuroscience.

[56]  S. Edelman,et al.  Cue-Invariant Activation in Object-Related Areas of the Human Occipital Lobe , 1998, Neuron.

[57]  Adam S. Lowet,et al.  Seeing structure: Shape skeletons modulate perceived similarity , 2018, Attention, perception & psychophysics.

[58]  Benjamin B. Kimia,et al.  On the role of medial geometry in human vision , 2003, Journal of Physiology-Paris.

[59]  Jörn Diedrichsen,et al.  Reliability of dissimilarity measures for multi-voxel pattern analysis , 2016, NeuroImage.

[60]  A. Parker Binocular depth perception and the cerebral cortex , 2007, Nature Reviews Neuroscience.

[61]  Andy C. H. Lee,et al.  Temporal lobe contribution to perceptual function: A tale of three patient groups , 2016, Neuropsychologia.

[62]  M. Behrmann,et al.  Independent representation of parts and the relations between them: evidence from integrative agnosia. , 2006, Journal of experimental psychology. Human perception and performance.

[63]  Paul Schrater,et al.  Shape perception reduces activity in human primary visual cortex , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[64]  Manish Singh,et al.  Bayesian estimation of the shape skeleton , 2006, Proceedings of the National Academy of Sciences.

[65]  Heinrich H Bülthoff,et al.  Image-based object recognition in man, monkey and machine , 1998, Cognition.

[66]  H. P. Op de Beeck,et al.  Dissociations and Associations between Shape and Category Representations in the Two Visual Pathways , 2015, The Journal of Neuroscience.

[67]  Johan Wagemans,et al.  The representation of symmetry in multi-voxel response patterns and functional connectivity throughout the ventral visual stream , 2019, NeuroImage.

[68]  David C. Plaut,et al.  Protracted Developmental Trajectory of Shape Processing along the Two Visual Pathways , 2019, Journal of Cognitive Neuroscience.

[69]  Soojin Park,et al.  Neural representation of object orientation: A dissociation between MVPA and Repetition Suppression , 2016, NeuroImage.

[70]  Vladislav Ayzenberg,et al.  Skeletal descriptions of shape provide unique perceptual information for object recognition , 2019, bioRxiv.

[71]  HARRY BLUM,et al.  Shape description using weighted symmetric axis features , 1978, Pattern Recognit..

[72]  J. Hegdé,et al.  A comparative study of shape representation in macaque visual areas v2 and v4. , 2007, Cerebral cortex.

[73]  Andrew E Welchman,et al.  The Human Brain in Depth: How We See in 3D. , 2016, Annual review of vision science.

[74]  D. J. Felleman,et al.  Receptive field properties of neurons in area V3 of macaque monkey extrastriate cortex. , 1987, Journal of neurophysiology.

[75]  Vladislav Ayzenberg,et al.  Skeletal descriptions of shape provide unique perceptual information for object recognition , 2019 .

[76]  H. Neumann,et al.  The Role of Attention in Figure-Ground Segregation in Areas V1 and V4 of the Visual Cortex , 2012, Neuron.

[77]  Irving Biederman,et al.  Cortical representation of medial axis structure. , 2013, Cerebral cortex.

[78]  Leila Montaser-Kouhsari,et al.  Orientation-selective adaptation to illusory contours in human visual cortex. , 2010, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[79]  Antonio Torralba,et al.  Building the gist of a scene: the role of global image features in recognition. , 2006, Progress in brain research.

[80]  R. von der Heydt,et al.  Coding of Border Ownership in Monkey Visual Cortex , 2000, The Journal of Neuroscience.

[81]  N. Kanwisher,et al.  The Human Body , 2001 .

[82]  Colin W. G. Clifford,et al.  The influence of global form on local orientation anisotropies in human visual cortex , 2010, NeuroImage.

[83]  Po-Jang Hsieh,et al.  fMRI reveals that non‐local processing in ventral retinotopic cortex underlies perceptual grouping by temporal synchrony , 2008, Human brain mapping.

[84]  I. Biederman,et al.  Shape Tuning in Macaque Inferior Temporal Cortex , 2003, The Journal of Neuroscience.

[85]  Benjamin B. Kimia,et al.  Skeleton Search: Category-Specific Object Recognition and Segmentation Using a Skeletal Shape Model , 2011, International Journal of Computer Vision.

[86]  Chaz Firestone,et al.  Can you simultaneously represent a figure as both an object and an open contour? Hybrid shape representations revealed by the "tap-the-shape" task. , 2015, Journal of vision.

[87]  Saharon Shelah On Independent Representation , 2001 .

[88]  Eric T. Carlson,et al.  Medial Axis Shape Coding in Macaque Inferotemporal Cortex , 2012, Neuron.

[89]  Francesco Lacquaniti,et al.  Ocular tracking of occluded ballistic trajectories: Effects of visual context and of target law of motion. , 2019, Journal of vision.

[90]  Marcelo Gomes Mattar,et al.  de Bruijn cycles for neural decoding , 2011, NeuroImage.

[91]  J. Feldman,et al.  Shape discrimination along morph-spaces , 2019, Vision Research.

[92]  Mark W. Woolrich,et al.  Advances in functional and structural MR image analysis and implementation as FSL , 2004, NeuroImage.

[93]  Morgan D. Barense,et al.  The human medial temporal lobe processes online representations of complex objects , 2007, Neuropsychologia.

[94]  Lina I. Davitt,et al.  Shape Information Mediating Basic- and Subordinate-Level Object Recognition Revealed by Analyses of Eye Movements , 2013, Journal of experimental psychology. Human perception and performance.