The human voice areas: Spatial organization and inter-individual variability in temporal and extra-temporal cortices

fMRI studies increasingly examine functions and properties of non-primary areas of human auditory cortex. However there is currently no standardized localization procedure to reliably identify specific areas across individuals such as the standard ‘localizers’ available in the visual domain. Here we present an fMRI ‘voice localizer’ scan allowing rapid and reliable localization of the voice-sensitive ‘temporal voice areas’ (TVA) of human auditory cortex. We describe results obtained using this standardized localizer scan in a large cohort of normal adult subjects. Most participants (94%) showed bilateral patches of significantly greater response to vocal than non-vocal sounds along the superior temporal sulcus/gyrus (STS/STG). Individual activation patterns, although reproducible, showed high inter-individual variability in precise anatomical location. Cluster analysis of individual peaks from the large cohort highlighted three bilateral clusters of voice-sensitivity, or “voice patches” along posterior (TVAp), mid (TVAm) and anterior (TVAa) STS/STG, respectively. A series of extra-temporal areas including bilateral inferior prefrontal cortex and amygdalae showed small, but reliable voice-sensitivity as part of a large-scale cerebral voice network. Stimuli for the voice localizer scan and probabilistic maps in MNI space are available for download.

[1]  D. Grandjean,et al.  Processing of emotional vocalizations in bilateral inferior frontal cortex , 2013, Neuroscience & Biobehavioral Reviews.

[2]  Julia Kastner,et al.  Introduction to Robust Estimation and Hypothesis Testing , 2005 .

[3]  Pascal Belin,et al.  Learning-induced changes in the cerebral processing of voice identity. , 2011, Cerebral cortex.

[4]  Kenneth Carling,et al.  Resistant outlier rules and the non-Gaussian case , 1998 .

[5]  Wolfgang Grodd,et al.  Cerebral representation of non-verbal emotional perception: fMRI reveals audiovisual integration area between voice- and face-sensitive regions in the superior temporal sulcus , 2009, Neuropsychologia.

[6]  N. Kraus,et al.  Auditory brainstem's sensitivity to human voices. , 2015, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[7]  Karl J. Friston,et al.  Analysis of fMRI Time-Series Revisited—Again , 1995, NeuroImage.

[8]  W. Grodd,et al.  The voices of seduction: cross-gender effects in processing of erotic prosody. , 2007, Social cognitive and affective neuroscience.

[9]  V. van de Ven,et al.  The brain's voices: comparing nonclinical auditory hallucinations and imagery. , 2011, Cerebral cortex.

[10]  Cyril R. Pernet,et al.  Misconceptions in the use of the General Linear Model applied to functional MRI: a tutorial for junior neuro-imagers , 2014, Front. Neurosci..

[11]  Michael Erb,et al.  Audiovisual integration of emotional signals in voice and face: An event-related fMRI study , 2007, NeuroImage.

[12]  W. K. Simmons,et al.  Circular analysis in systems neuroscience: the dangers of double dipping , 2009, Nature Neuroscience.

[13]  Pascal Belin,et al.  Gender differences in the temporal voice areas , 2014, Front. Neurosci..

[14]  Á. Miklósi,et al.  Voice-Sensitive Regions in the Dog and Human Brain Are Revealed by Comparative fMRI , 2014, Current Biology.

[15]  Karl J. Friston,et al.  Slice-timing effects and their correction in functional MRI , 2011, NeuroImage.

[16]  D. Altman,et al.  Statistics Notes: Measurement error and correlation coefficients , 1996, BMJ.

[17]  N. Kanwisher,et al.  The Fusiform Face Area: A Module in Human Extrastriate Cortex Specialized for Face Perception , 1997, The Journal of Neuroscience.

[18]  John D E Gabrieli,et al.  Human Voice Recognition Depends on Language Ability , 2011, Science.

[19]  J. Hillenbrand,et al.  Acoustic characteristics of American English vowels. , 1994, The Journal of the Acoustical Society of America.

[20]  Vincent Schmithorst,et al.  A combined bootstrap/histogram analysis approach for computing a lateralization index from neuroimaging data , 2006, NeuroImage.

[21]  Marko Wilke,et al.  LI-tool: A new toolbox to assess lateralization in functional MR-data , 2007, Journal of Neuroscience Methods.

[22]  D. Wildgruber,et al.  Emotional voice areas: anatomic location, functional properties, and structural connections revealed by combined fMRI/DTI. , 2012, Cerebral cortex.

[23]  J. Fleiss,et al.  Intraclass correlations: uses in assessing rater reliability. , 1979, Psychological bulletin.

[24]  M. Leboyer,et al.  Abnormal Voice Processing in Autism : a fMRI study , 2004 .

[25]  R. Zatorre,et al.  Voice-selective areas in human auditory cortex , 2000, Nature.

[26]  K. Scherer,et al.  The voices of wrath: brain responses to angry prosody in meaningless speech , 2005, Nature Neuroscience.

[27]  Pascal Belin,et al.  Stimulus Complexity and Categorical Effects in Human Auditory Cortex: An Activation Likelihood Estimation Meta-Analysis , 2011, Front. Psychology.

[28]  Didier Grandjean,et al.  Amygdala subregions differentially respond and rapidly adapt to threatening voices , 2013, Cortex.

[29]  D. Perrett,et al.  A differential neural response in the human amygdala to fearful and happy facial expressions , 1996, Nature.

[30]  Anders M. Dale,et al.  Cortical Surface-Based Analysis I. Segmentation and Surface Reconstruction , 1999, NeuroImage.

[31]  Simon B. Eickhoff,et al.  A new SPM toolbox for combining probabilistic cytoarchitectonic maps and functional imaging data , 2005, NeuroImage.

[32]  Sophie K. Scott,et al.  Cortical asymmetries in speech perception: what's wrong, what's right and what's left? , 2012, Trends in Cognitive Sciences.

[33]  Dimitri Van De Ville,et al.  Decoding of Emotional Information in Voice-Sensitive Cortices , 2009, Current Biology.

[34]  A. Dale,et al.  Cortical Surface-Based Analysis II: Inflation, Flattening, and a Surface-Based Coordinate System , 1999, NeuroImage.

[35]  Rainer Goebel,et al.  Measuring structural–functional correspondence: Spatial variability of specialised brain regions after macro-anatomical alignment , 2012, NeuroImage.

[36]  Karl J. Friston,et al.  Topological FDR for neuroimaging , 2010, NeuroImage.

[37]  Sophie K. Scott,et al.  Human brain mechanisms for the early analysis of voices , 2006, NeuroImage.

[38]  Anne-Lise Giraud,et al.  Distinct functional substrates along the right superior temporal sulcus for the processing of voices , 2004, NeuroImage.

[39]  Wolfgang Grodd,et al.  Cerebral processing of emotional prosody—influence of acoustic parameters and arousal , 2008, NeuroImage.

[40]  John Ashburner,et al.  A fast diffeomorphic image registration algorithm , 2007, NeuroImage.

[41]  Pascal Belin,et al.  Cerebral Processing of Voice Gender Studied Using a Continuous Carryover fMRI Design , 2012, Cerebral cortex.

[42]  Onur Güntürkün,et al.  Neuroscience and Biobehavioral Reviews the Ontogenesis of Language Lateralization and Its Relation to Handedness , 2022 .

[43]  Marc Joliot,et al.  Gaussian Mixture Modeling of Hemispheric Lateralization for Language in a Large Sample of Healthy Individuals Balanced for Handedness , 2014, PloS one.

[44]  Sophie K. Scott,et al.  Impaired auditory recognition of fear and anger following bilateral amygdala lesions , 1997, Nature.

[45]  R. Zatorre,et al.  Human temporal-lobe response to vocal sounds. , 2002, Brain research. Cognitive brain research.

[46]  Dirk Wildgruber,et al.  Functional responses and structural connections of cortical areas for processing faces and voices in the superior temporal sulcus , 2013, NeuroImage.

[47]  Jonathan D. Power,et al.  Statistical improvements in functional magnetic resonance imaging analyses produced by censoring high‐motion data points , 2014, Human brain mapping.

[48]  Rainer Goebel,et al.  Development from childhood to adulthood increases morphological and functional inter-individual variability in the right superior temporal cortex , 2013, NeuroImage.

[49]  David C. Van Essen,et al.  Application of Information Technology: An Integrated Software Suite for Surface-based Analyses of Cerebral Cortex , 2001, J. Am. Medical Informatics Assoc..

[50]  Rainer Goebel,et al.  "Who" Is Saying "What"? Brain-Based Decoding of Human Voice and Speech , 2008, Science.

[51]  Andrew D. Engell,et al.  The role of the amygdala in implicit evaluation of emotionally neutral faces. , 2008, Social cognitive and affective neuroscience.

[52]  R. Goebel,et al.  Mirror-Symmetric Tonotopic Maps in Human Primary Auditory Cortex , 2003, Neuron.

[53]  Pascal Belin,et al.  Abnormal cortical voice processing in autism , 2004, Nature Neuroscience.

[54]  Jean-François Démonet,et al.  Specific, selective or preferential: Comments on category specificity in neuroimaging , 2007, NeuroImage.

[55]  Pascal Belin,et al.  Crossmodal Adaptation in Right Posterior Superior Temporal Sulcus during Face–Voice Emotional Integration , 2014, The Journal of Neuroscience.

[56]  J. Grafman,et al.  The Human Amygdala: An Evolved System for Relevance Detection , 2003, Reviews in the neurosciences.

[57]  Mark E. Bastin,et al.  Adaptive thresholding for reliable topological inference in single subject fMRI analysis , 2012, Front. Hum. Neurosci..

[58]  Pascal Belin,et al.  Norm-Based Coding of Voice Identity in Human Auditory Cortex , 2013, Current Biology.

[59]  Bruno L. Giordano,et al.  Automatic domain-general processing of sound source identity in the left posterior middle frontal gyrus , 2014, Cortex.

[60]  Elia Formisano,et al.  Processing of Natural Sounds in Human Auditory Cortex: Tonotopy, Spectral Tuning, and Relation to Voice Sensitivity , 2012, The Journal of Neuroscience.

[61]  Pascal Belin,et al.  Amygdala responses to nonlinguistic emotional vocalizations , 2007, NeuroImage.

[62]  Bharath Chandrasekaran,et al.  Neural Processing of What and Who Information in Speech , 2011, Journal of Cognitive Neuroscience.