Body-form and body-pose recognition with a hierarchical model of the ventral stream

When learning to recognize a novel body shape, e.g., a panda bear, we are not misled by changes in its pose. A "jumping panda bear" is readily recognized, despite having no prior visual experience with the conjunction of these concepts. Likewise, a novel pose can be estimated in an invariant way, with respect to the actor’s body shape. These body and pose recognition tasks require invariance to non-generic transformations [10, 16] that previous models of the ventral stream do not have. We show that the addition of biologically plausible, class-specific mechanisms associating previouslyviewed actors in a range of poses enables a hierarchical model of object recognition to account for this human capability. These associations could be acquired in an unsupervised manner from past experience.

[1]  Cosimo Urgesi,et al.  Magnetic Stimulation of Extrastriate Body Area Impairs Visual Processing of Nonfacial Body Parts , 2004, Current Biology.

[2]  Boris Suchan,et al.  Reduction of gray matter density in the extrastriate body area in women with anorexia nervosa , 2010, Behavioural Brain Research.

[3]  P. Downing,et al.  Selectivity for the human body in the fusiform gyrus. , 2005, Journal of neurophysiology.

[4]  S. Thorpe,et al.  Speed of processing in the human visual system , 1996, Nature.

[5]  Nancy Kanwisher,et al.  A cortical representation of the local visual environment , 1998, Nature.

[6]  Cosimo Urgesi,et al.  Visual body perception in anorexia nervosa. , 2012, The International journal of eating disorders.

[7]  D Grönemeyer,et al.  Changes in neuronal correlates of body image processing by means of cognitive-behavioural body image therapy for eating disorders: a randomized controlled fMRI study , 2010, Psychological Medicine.

[8]  N. Kanwisher,et al.  The Fusiform Face Area: A Module in Human Extrastriate Cortex Specialized for Face Perception , 1997, The Journal of Neuroscience.

[9]  Bruno Rossion,et al.  Understanding the functional neuroanatomy of acquired prosopagnosia , 2007, NeuroImage.

[10]  N. Kanwisher,et al.  The Human Body , 2001 .

[11]  A. Leff,et al.  Structural anatomy of pure and hemianopic alexia , 2006, Journal of Neurology, Neurosurgery & Psychiatry.

[12]  Dietrich Grönemeyer,et al.  Reduced connectivity between the left fusiform body area and the extrastriate body area in anorexia nervosa is associated with body image distortion , 2013, Behavioural Brain Research.

[13]  S Lehéricy,et al.  The visual word form area: spatial and temporal characterization of an initial stage of reading in normal subjects and posterior split-brain patients. , 2000, Brain : a journal of neurology.

[14]  J. Devlin,et al.  Triple Dissociation of Faces, Bodies, and Objects in Extrastriate Cortex , 2009, Current Biology.

[15]  C. Urgesi,et al.  The Neural Basis of Body Form and Body Action Agnosia , 2008, Neuron.

[16]  P. Downing,et al.  The role of occipitotemporal body-selective regions in person perception , 2011, Cognitive neuroscience.

[17]  G. Yovel,et al.  TMS Evidence for the Involvement of the Right Occipital Face Area in Early Face Processing , 2007, Current Biology.

[18]  Thomas Serre,et al.  A feedforward architecture accounts for rapid categorization , 2007, Proceedings of the National Academy of Sciences.

[19]  G. Winocur,et al.  What Is Special about Face Recognition? Nineteen Experiments on a Person with Visual Object Agnosia and Dyslexia but Normal Face Recognition , 1997, Journal of Cognitive Neuroscience.

[20]  Joel Z. Leibo,et al.  Why The Brain Separates Face Recognition From Object Recognition , 2011, NIPS.

[21]  Bruce D. McCandliss,et al.  The visual word form area: expertise for reading in the fusiform gyrus , 2003, Trends in Cognitive Sciences.

[22]  N. Kanwisher,et al.  The fusiform face area: a cortical region specialized for the perception of faces , 2006, Philosophical Transactions of the Royal Society B: Biological Sciences.

[23]  Lorenzo Rosasco,et al.  The computational magic of the ventral stream: sketch of a theory (and why some deep architectures work). , 2012 .