Skeletal representations of shape in human vision: Evidence for a pruned medial axis model

A representation of shape that is low dimensional and stable across minor disruptions is critical for object recognition. Computer vision research suggests that such a representation can be supported by the medial axis—a computational model for extracting a shape's internal skeleton. However, few studies have shown evidence of medial axis processing in humans, and even fewer have examined how the medial axis is extracted in the presence of disruptive contours. Here, we tested whether human skeletal representations of shape reflect the medial axis transform (MAT), a computation sensitive to all available contours, or a pruned medial axis, which ignores contours that may be considered “noise.” Across three experiments, participants (N = 2062) were shown complete, perturbed, or illusory two-dimensional shapes on a tablet computer and were asked to tap the shapes anywhere once. When directly compared with another viable model of shape perception (based on principal axes), participants' collective responses were better fit by the medial axis, and a direct test of boundary avoidance suggested that this result was not likely because of a task-specific cognitive strategy (Experiment 1). Moreover, participants' responses reflected a pruned computation in shapes with small or large internal or external perturbations (Experiment 2) and under conditions of illusory contours (Experiment 3). These findings extend previous work by suggesting that humans extract a relatively stable medial axis of shapes. A relatively stable skeletal representation, reflected by a pruned model, may be well equipped to support real-world shape perception and object recognition.

[1]  James H Elder,et al.  Shape from Contour: Computation and Representation. , 2018, Annual review of vision science.

[2]  Benjamin B. Kimia,et al.  Skeleton Search: Category-Specific Object Recognition and Segmentation Using a Skeletal Shape Model , 2011, International Journal of Computer Vision.

[3]  Philip N. Klein,et al.  Recognition of shapes by editing their shock graphs , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[4]  Alexandru Telea,et al.  An Augmented Fast Marching Method for Computing Skeletons and Centerlines , 2002, VisSym.

[5]  Adam S. Lowet,et al.  Seeing structure: Shape skeletons modulate perceived similarity , 2018, Attention, perception & psychophysics.

[6]  G K Humphrey,et al.  An Examination of the Effects of Axis Foreshortening, Monocular Depth Cues, and Visual Field on Object Identification , 1993, The Quarterly journal of experimental psychology. A, Human experimental psychology.

[7]  G. Kanizsa Subjective contours. , 1976, Scientific American.

[8]  Xing Zhang,et al.  A skeleton pruning algorithm based on information fusion , 2013, Pattern Recognit. Lett..

[9]  Eric T. Carlson,et al.  Medial Axis Shape Coding in Macaque Inferotemporal Cortex , 2012, Neuron.

[10]  E. Warrington,et al.  Visual Object Recognition in Patients with Right-Hemisphere Lesions: Axes or Features? , 1986, Perception.

[11]  D. Marr,et al.  Representation and recognition of the spatial organization of three-dimensional shapes , 1978, Proceedings of the Royal Society of London. Series B. Biological Sciences.

[12]  Manish Singh,et al.  An Integrated Bayesian Approach to Shape Representation and Perceptual Organization , 2013, Shape Perception in Human and Computer Vision.

[13]  Donald D. Hoffman,et al.  Parsing silhouettes: The short-cut rule , 1999, Perception & psychophysics.

[14]  Filipp Schmidt,et al.  Visual perception of shape altered by inferred causal history , 2016, Scientific Reports.

[15]  Benjamin B. Kimia,et al.  On the role of medial geometry in human vision , 2003, Journal of Physiology-Paris.

[16]  Jacob Feldman,et al.  The influence of shape and skeletal axis structure on texture perception. , 2009, Journal of vision.

[17]  Ernst Niebur,et al.  Medial axis generation in a model of perceptual organization , 2012, 2012 46th Annual Conference on Information Sciences and Systems (CISS).

[18]  A. Taylor,et al.  The contribution of the right parietal lobe to object recognition. , 1973, Cortex; a journal devoted to the study of the nervous system and behavior.

[19]  D. Mumford,et al.  The role of the primary visual cortex in higher level vision , 1998, Vision Research.

[20]  B. Julesz,et al.  Perceptual sensitivity maps within globally defined visual shapes , 1994, Nature.

[21]  Michael Leyton,et al.  Inferring Causal History from Shape , 1989, Cogn. Sci..

[22]  Bernd Hamann,et al.  Mathematical Foundations of Scientific Visualization, Computer Graphics, and Massive Data Exploration , 2009, Mathematics and Visualization.

[23]  Alfred M. Bruckstein,et al.  Pruning Medial Axes , 1998, Comput. Vis. Image Underst..

[24]  Ty W. Boyer,et al.  Do Eye Movements During Shape Discrimination Reveal an Underlying Geometric Structure , 2017 .

[25]  Christopher R. Genovese,et al.  Cosmic web reconstruction through density ridges: method and algorithm , 2015, 1501.05303.

[26]  Zhaoping Li,et al.  Can VI Mechanisms Account for Figure-Ground and Medial Axis Effects? , 1999, NIPS.

[27]  Tai Sing Lee,et al.  Computations in the early visual cortex , 2003, Journal of Physiology-Paris.

[28]  J. Wagemans,et al.  Segmentation of object outlines into parts: A large-scale integrative study , 2006, Cognition.

[29]  Bela Julesz,et al.  Medial-point description of shape: a representation for action coding and its psychophysical correlates , 1998, Vision Research.

[30]  B. Kimia,et al.  Perceptual Organization as Object Recognition Divided by Two , 2001 .

[31]  Albert S. Bregman,et al.  Asking the “What For” Question in Auditory Perception , 2017 .

[32]  N. Kanwisher,et al.  Attention as inference: selection is probabilistic; responses are all-or-none samples. , 2009, Journal of experimental psychology. General.

[33]  M. Leyton Symmetry, Causality, Mind , 1999 .

[34]  Stella F. Lourenco,et al.  Using Spatial Categories to Reason about Location , 2007 .

[35]  Ali Shokoufandeh,et al.  Shock Graphs and Shape Matching , 1998, International Journal of Computer Vision.

[36]  Markus Seidl,et al.  A study on skeletonization of complex petroglyph shapes , 2016, Multimedia Tools and Applications.

[37]  D. Heeger,et al.  Neuronal basis of contrast discrimination , 1999, Vision Research.

[38]  Debbie M. Kelly,et al.  Reorienting in Virtual 3D Environments: Do Adult Humans Use Principal Axes, Medial Axes or Local Geometry? , 2013, PloS one.

[39]  H. Blum Biological shape and visual science (part I) , 1973 .

[40]  D. Hubel,et al.  Receptive fields of single neurones in the cat's striate cortex , 1959, The Journal of physiology.

[41]  Benjamin B. Kimia,et al.  On the Local Form and Transitions of Symmetry Sets, Medial Axes, and Shocks , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[42]  Tyng-Luh Liu,et al.  Approximate tree matching and shape similarity , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[43]  Stephane Durocher,et al.  Comparing geometric models for orientation: Medial vs. principal axes , 2011, Communicative & integrative biology.

[44]  C R Gallistel,et al.  Shape parameters explain data from spatial transformations: comment on Pearce et al. (2004) and Tommasi & Polli (2004). , 2005, Journal of experimental psychology. Animal behavior processes.

[45]  R. Dhandapani,et al.  Role of scale in partitioning shape , 2002, Proceedings. International Conference on Image Processing.

[46]  Sven J. Dickinson,et al.  Optimal inference for hierarchical skeleton abstraction , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[47]  M. Kenward,et al.  An Introduction to the Bootstrap , 2007 .

[48]  Z Kourtzi,et al.  Representation of Perceived Object Shape by the Human Lateral Occipital Complex , 2001, Science.

[49]  Jean-Daniel Boissonnat,et al.  Stability and Computation of Medial Axes - a State-of-the-Art Report , 2009, Mathematical Foundations of Scientific Visualization, Computer Graphics, and Massive Data Exploration.

[50]  HARRY BLUM,et al.  Shape description using weighted symmetric axis features , 1978, Pattern Recognit..

[51]  Johan Wagemans,et al.  Identification of Everyday Objects on the Basis of Silhouette and Outline Versions , 2007, Perception.

[52]  E. J. Green,et al.  A Layered View of Shape Perception , 2015, The British Journal for the Philosophy of Science.

[53]  Benjamin B. Kimia,et al.  Shapes, shocks, and deformations I: The components of two-dimensional shape and the reaction-diffusion space , 1995, International Journal of Computer Vision.

[54]  Debashis Kushary,et al.  Bootstrap Methods and Their Application , 2000, Technometrics.

[55]  J. Psotka Perceptual processes that may create stick figures and balance. , 1978, Journal of experimental psychology. Human perception and performance.

[56]  Olaf Kübler,et al.  Hierarchic Voronoi skeletons , 1995, Pattern Recognit..

[57]  J. Andrade-Cetto Object Recognition , 2003 .

[58]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[59]  I. Biederman Recognition-by-components: a theory of human image understanding. , 1987, Psychological review.

[60]  S. Palmer,et al.  A century of Gestalt psychology in visual perception: I. Perceptual grouping and figure-ground organization. , 2012, Psychological bulletin.

[61]  Donald D. Hoffman,et al.  Parts of recognition , 1984, Cognition.

[62]  Jochen Lang,et al.  Skeleton pruning by contour approximation and the integer medial axis transform , 2012, Comput. Graph..

[63]  Irving Biederman,et al.  Cortical representation of medial axis structure. , 2013, Cerebral cortex.

[64]  Stephen M. Pizer,et al.  Hierarchical Shape Description Via the Multiresolution Symmetric Axis Transform , 1987, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[65]  Eileen Kowler,et al.  Localization of shapes: eye movements and perception compared , 2003, Vision Research.

[66]  Manish Singh,et al.  Bayesian estimation of the shape skeleton , 2006, Proceedings of the National Academy of Sciences.

[67]  Michael McCloskey,et al.  How Are Object Shape Axes Defined? Evidence From Mirror-Image Confusions , 2019, Journal of experimental psychology. Human perception and performance.

[68]  Roland W. Fleming,et al.  Bent out of shape: The visual inference of non-rigid shape transformations applied to objects , 2016, Vision Research.

[69]  B. Scholl,et al.  “Please Tap the Shape, Anywhere You Like” , 2014, Psychological science.

[70]  Eileen Kowler,et al.  Shapes, surfaces and saccades , 1999, Vision Research.