Category Training Induces Cross-modal Object Representations in the Adult Human Brain

The formation of cross-modal object representations was investigated using a novel paradigm that was previously successful in establishing unimodal visual category learning in monkeys and humans. The stimulus set consisted of six categories of bird shapes and sounds that were morphed to create different exemplars of each category. Subjects learned new cross-modal bird categories using a one-back task. Over time, the subjects became faster and more accurate in categorizing the birds. After 3 days of training, subjects were scanned while passively viewing and listening to trained and novel bird types. Stimulus blocks consisted of bird sounds only, bird pictures only, matching pictures and sounds (cross-modal congruent), and mismatching pictures and sounds (cross-modal incongruent). fMRI data showed unimodal and cross-modal training effects in the right fusiform gyrus. In addition, the left STS showed cross-modal training effects in the absence of unimodal training effects. Importantly, for both the right fusiform gyrus and the left STS, the newly formed cross-modal representation was specific for the trained categories. Learning did not generalize to incongruent combinations of learned sounds and shapes; their response did not differ from the response to novel cross-modal bird types. Moreover, responses were larger for congruent than for incongruent cross-modal bird types in the right fusiform gyrus and STS, providing further evidence that categorization training induced the formation of meaningful cross-modal object representations.

[1]  I. Gauthier,et al.  Expertise for cars and birds recruits brain areas involved in face recognition , 2000, Nature Neuroscience.

[2]  Robert J Zatorre,et al.  Anatomical Correlates of Learning Novel Speech Sounds , 2002, Neuron.

[3]  A. Amedi,et al.  Functional imaging of human crossmodal identification and object recognition , 2005, Experimental Brain Research.

[4]  Raphael M Barishansky,et al.  Birds of a feather flock together , 1998, Nature.

[5]  R. Campbell,et al.  Evidence from functional magnetic resonance imaging of crossmodal binding in the human heteromodal cortex , 2000, Current Biology.

[6]  Lars Muckli,et al.  Cortical Plasticity of Audio–Visual Object Representations , 2008, Cerebral cortex.

[7]  M. Naumer,et al.  Semantics and the multisensory brain: How meaning modulates processes of audio-visual integration , 2008, Brain Research.

[8]  M. Honda,et al.  Behavioral / Systems / Cognitive Functionally Segregated Neural Substrates for Arbitrary Audiovisual Paired-Association Learning , 2005 .

[9]  J. Kaiser,et al.  Object Familiarity and Semantic Congruency Modulate Responses in Cortical Audiovisual Integration Areas , 2007, The Journal of Neuroscience.

[10]  Andreas Kleinschmidt,et al.  Interaction of Face and Voice Areas during Speaker Recognition , 2005, Journal of Cognitive Neuroscience.

[11]  M. van Turennout,et al.  UvA-DARE ( Digital Academic Repository ) Birds of a feather flock together : Experience-driven formation of visual object categories in human ventral temporal cortex , 2008 .

[12]  Gian Luca Romani,et al.  Audio-visual crossmodal interactions in environmental perception: an fMRI investigation , 2004, Cognitive Processing.

[13]  C. Price,et al.  The role of the posterior superior temporal sulcus in audiovisual processing. , 2008, Cerebral cortex.

[14]  Thomas E. Nichols,et al.  Thresholding of Statistical Maps in Functional Neuroimaging Using the False Discovery Rate , 2002, NeuroImage.

[15]  Michael X. Cohen,et al.  Neural Mechanisms of Expert Skills in Visual Working Memory , 2006, The Journal of Neuroscience.

[16]  E. G. Jones Cerebral Cortex , 1987, Cerebral Cortex.

[17]  G. Rhodes,et al.  Is the Fusiform Face Area Specialized for Faces, Individuation, or Expert Individuation? , 2004, Journal of Cognitive Neuroscience.

[18]  Alex Martin,et al.  A neural system for learning about object function. , 2006, Cerebral cortex.

[19]  B. Argall,et al.  Integration of Auditory and Visual Information about Objects in Superior Temporal Sulcus , 2004, Neuron.

[20]  Rutvik H. Desai,et al.  Specialization along the Left Superior Temporal Sulcus for Auditory Categorization , 2010, Cerebral cortex.

[21]  Alex Martin,et al.  Semantic memory and the brain: structure and processes , 2001, Current Opinion in Neurobiology.

[22]  A. Giraud,et al.  Implicit Multisensory Associations Influence Voice Recognition , 2006, PLoS biology.

[23]  J. Talairach,et al.  Co-Planar Stereotaxic Atlas of the Human Brain: 3-Dimensional Proportional System: An Approach to Cerebral Imaging , 1988 .

[24]  B. Argall,et al.  Unraveling multisensory integration: patchy organization within human STS multisensory cortex , 2004, Nature Neuroscience.

[25]  R. Goebel,et al.  Integration of Letters and Speech Sounds in the Human Brain , 2004, Neuron.

[26]  Rainer Goebel,et al.  Top–down task effects overrule automatic multisensory responses to letter–sound pairs in auditory association cortex , 2006, NeuroImage.

[27]  Rainer Goebel,et al.  The effect of temporal asynchrony on the multisensory integration of letters and speech sounds. , 2006, Cerebral cortex.

[28]  Karl J. Friston,et al.  Psychophysiological and Modulatory Interactions in Neuroimaging , 1997, NeuroImage.

[29]  K. Grill-Spector,et al.  Repetition and the brain: neural models of stimulus-specific effects , 2006, Trends in Cognitive Sciences.

[30]  J. Downar,et al.  A cortical network sensitive to stimulus salience in a neutral behavioral context across multiple sensory modalities. , 2002, Journal of neurophysiology.

[31]  Jeffrey R. Binder,et al.  Left Posterior Temporal Regions are Sensitive to Auditory Categorization , 2008, Journal of Cognitive Neuroscience.

[32]  Yaoda Xu Revisiting the role of the fusiform face area in visual expertise. , 2005, Cerebral cortex.

[33]  M. Riesenhuber,et al.  Categorization Training Results in Shape- and Category-Selective Human Neural Plasticity , 2007, Neuron.

[34]  Jeffery A. Jones,et al.  Multisensory Integration Sites Identified by Perception of Spatial Wavelet Filtered Visual Speech Gesture Information , 2004, Journal of Cognitive Neuroscience.

[35]  M. Tarr,et al.  Activation of the middle fusiform 'face area' increases with expertise in recognizing novel objects , 1999, Nature Neuroscience.

[36]  N. Kanwisher,et al.  Discrimination Training Alters Object Representations in Human Extrastriate Cortex , 2006, The Journal of Neuroscience.

[37]  L. Tyler,et al.  Binding crossmodal object features in perirhinal cortex. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[38]  R. Poldrack,et al.  Recovering Meaning Left Prefrontal Cortex Guides Controlled Semantic Retrieval , 2001, Neuron.

[39]  P. Bertelson,et al.  Multisensory integration, perception and ecological validity , 2003, Trends in Cognitive Sciences.

[40]  Rainer Goebel,et al.  An Efficient Algorithm for Topologically Correct Segmentation of the Cortical Sheet in Anatomical MR Volumes , 2001, NeuroImage.

[41]  Peter Indefrey,et al.  Formation of Category Representations in Superior Temporal Sulcus , 2010, Journal of Cognitive Neuroscience.

[42]  David J. Freedman,et al.  A Comparison of Primate Prefrontal and Inferior Temporal Cortices during Visual Categorization , 2003, The Journal of Neuroscience.

[43]  T. Chaminade,et al.  Stone tools, language and the brain in human evolution , 2012, Philosophical Transactions of the Royal Society B: Biological Sciences.

[44]  David J. Freedman,et al.  Categorical representation of visual stimuli in the primate prefrontal cortex. , 2001, Science.

[45]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[46]  Johan Wagemans,et al.  Subordinate Categorization Enhances the Neural Selectivity in Human Object-selective Cortex for Fine Shape Differences , 2009, Journal of Cognitive Neuroscience.

[47]  D. Heeger,et al.  Linear Systems Analysis of Functional Magnetic Resonance Imaging in Human V1 , 1996, The Journal of Neuroscience.