You had me at “Hello”: Rapid extraction of dialect information from spoken words

Research on the neuronal underpinnings of speaker identity recognition has identified voice-selective areas in the human brain with evolutionary homologues in non-human primates who have comparable areas for processing species-specific calls. Most studies have focused on estimating the extent and location of these areas. In contrast, relatively few experiments have investigated the time-course of speaker identity, and in particular, dialect processing and identification by electro- or neuromagnetic means. We show here that dialect extraction occurs speaker-independently, pre-attentively and categorically. We used Standard American English and African-American English exemplars of 'Hello' in a magnetoencephalographic (MEG) Mismatch Negativity (MMN) experiment. The MMN as an automatic change detection response of the brain reflected dialect differences that were not entirely reducible to acoustic differences between the pronunciations of 'Hello'. Source analyses of the M100, an auditory evoked response to the vowels suggested additional processing in voice-selective areas whenever a dialect change was detected. These findings are not only relevant for the cognitive neuroscience of language, but also for the social sciences concerned with dialect and race perception.

[1]  Lynn M. Farnsworth,et al.  The perceptual representation of voice gender. , 1995, The Journal of the Acoustical Society of America.

[2]  R. Näätänen,et al.  Auditory frequency discrimination and event-related potentials. , 1985, Electroencephalography and clinical neurophysiology.

[3]  R. Näätänen The perception of speech sounds by the human brain as reflected by the mismatch negativity (MMN) and its magnetic equivalent (MMNm). , 2001, Psychophysiology.

[4]  Bernhard Ross,et al.  The Neurotopography of Vowels as Mirrored by Evoked Magnetic Field Measurements , 1996, Brain and Language.

[5]  D. Bates,et al.  Mixed-Effects Models in S and S-PLUS , 2001 .

[6]  T. Carrell,et al.  Acoustic versus phonetic representation of speech as reflected by the mismatch negativity event-related potential. , 1993, Electroencephalography and clinical neurophysiology.

[7]  Roy D. Patterson,et al.  Size Information in the Production and Perception of Communication Sounds , 2008 .

[8]  R. Ilmoniemi,et al.  Responses of the primary auditory cortex to pitch changes in a sequence of tone pips: Neuromagnetic recordings in man , 1984, Neuroscience Letters.

[9]  R. Fay,et al.  Auditory perception of sound sources , 2007 .

[10]  D. Poeppel,et al.  Auditory Cortex Accesses Phonological Categories: An MEG Mismatch Study , 2000, Journal of Cognitive Neuroscience.

[11]  R. Näätänen,et al.  The mismatch negativity (MMN) in basic research of central auditory processing: A review , 2007, Clinical Neurophysiology.

[12]  K. Alho,et al.  Generators of electrical and magnetic mismatch responses in humans , 2005, Brain Topography.

[13]  M. Dorman,et al.  Cortical auditory evoked potential correlates of categorical perception of voice-onset time. , 1999, The Journal of the Acoustical Society of America.

[14]  D E Hartman,et al.  Perceptual features of speech for males in four perceived age decades. , 1976, The Journal of the Acoustical Society of America.

[15]  D. Rendall,et al.  Vocal recognition of individuals and kin in free-ranging rhesus monkeys , 1996, Animal Behaviour.

[16]  Jonathan Z. Simon,et al.  Abstract Journal of Neuroscience Methods 165 (2007) 297–305 Denoising based on time-shift PCA , 2007 .

[17]  E. Schröger,et al.  Personal significance is encoded automatically by the human brain: an event‐related potential study with ringtones , 2007, The European journal of neuroscience.

[18]  D. Poeppel,et al.  Latency of the auditory evoked neuromagnetic field components: stimulus dependence and insights toward perception. , 2000, Journal of clinical neurophysiology : official publication of the American Electroencephalographic Society.

[19]  H Fischer,et al.  Differential response in the human amygdala to racial outgroup vs ingroup face stimuli , 2000, Neuroreport.

[20]  M. Dorman,et al.  Neurophysiologic correlates of cross-language phonetic perception. , 2000, The Journal of the Acoustical Society of America.

[21]  Asif A Ghazanfar,et al.  Interactions between the Superior Temporal Sulcus and Auditory Cortex Mediate Dynamic Face/Voice Integration in Rhesus Monkeys , 2008, The Journal of Neuroscience.

[22]  R. Ilmoniemi,et al.  Processing of novel sounds and frequency changes in the human auditory cortex: magnetoencephalographic recordings. , 1998, Psychophysiology.

[23]  I. Winkler,et al.  The effect of small variation of the frequent auditory stimulus on the event-related brain potential to the infrequent stimulus. , 1990, Psychophysiology.

[24]  A. Kleinschmidt,et al.  Modulation of neural responses to speech by directing attention to voices or verbal content. , 2003, Brain research. Cognitive brain research.

[25]  William A. Cunningham,et al.  PSYCHOLOGICAL SCIENCE Research Article Separable Neural Components in the Processing of Black and White , 2022 .

[26]  M. Kutas,et al.  Psycholinguistics Electrified II (1994–2005) , 2006 .

[27]  R. Näätänen,et al.  The mismatch negativity (MMN): towards the optimal paradigm , 2004, Clinical Neurophysiology.

[28]  Bruce D. Bartholow,et al.  The neural correlates of race , 2009, Trends in Cognitive Sciences.

[29]  G. Potts,et al.  Effects of dialect on merger perception: ERP and behavioral correlates , 2005, Brain and Language.

[30]  W. Ritter,et al.  The Nature of Preattentive Storage in the Auditory System , 1995, Journal of Cognitive Neuroscience.

[31]  C Pantev,et al.  Magnetic and electric brain activity evoked by the processing of tone and vowel stimuli , 1995, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[32]  M. Scherg,et al.  A Source Analysis of the Late Human Auditory Evoked Potentials , 1989, Journal of Cognitive Neuroscience.

[33]  C. Schroeder,et al.  Detection of stimulus deviance within primate primary auditory cortex: intracortical mechanisms of mismatch negativity (MMN) generation , 1994, Brain Research.

[34]  E. Diesch,et al.  Magnetic fields elicited by tones and vowel formants reveal tonotopy and nonlinear summation of cortical activation. , 1997, Psychophysiology.

[35]  M S Hämäläinen,et al.  Effects of intensity variation on human auditory evoked magnetic fields. , 1995, Acta oto-laryngologica.

[36]  R. Ilmoniemi,et al.  Temporal window of integration of auditory information in the human brain. , 1998, Psychophysiology.

[37]  P. Belin,et al.  Electrophysiological markers of voice familiarity , 2006, The European journal of neuroscience.

[38]  M Molnár,et al.  Evoked potential correlates of stimulus deviance during wakefulness and sleep in cat--animal model of mismatch negativity. , 1987, Electroencephalography and clinical neurophysiology.

[39]  S. Fiske,et al.  Controlling Racial Prejudice , 2005, Psychological science.

[40]  R. Ilmoniemi,et al.  Language-specific phoneme representations revealed by electric and magnetic brain responses , 1997, Nature.

[41]  R. Hari,et al.  Auditory evoked transient and sustained magnetic fields of the human brain localization of neural generators , 1980, Experimental Brain Research.

[42]  A. Dale,et al.  Human posterior auditory cortex gates novel sounds to consciousness. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[43]  H. Yabe,et al.  Temporal window of integration revealed by MMN to sound omission , 1997, Neuroreport.

[44]  Bernd Lütkenhöner,et al.  High-Precision Neuromagnetic Study of the Functional Organization of the Human Auditory Cortex , 1998, Audiology and Neurotology.

[45]  W Ritter,et al.  A Review of Event‐Related Potential Components Discovered in the Context of Studying P3 a , 1992, Annals of the New York Academy of Sciences.

[46]  Paavo Alku,et al.  Memory Traces for Words as Revealed by the Mismatch Negativity , 2001, NeuroImage.

[47]  Erich Schröger,et al.  Is My Mobile Ringing? Evidence for Rapid Processing of a Personally Significant Sound in Humans , 2010, The Journal of Neuroscience.

[48]  E. Amenedo,et al.  MMN in the visual modality: a review , 2003, Biological Psychology.

[49]  R. Zatorre,et al.  Voice-selective areas in human auditory cortex , 2000, Nature.

[50]  R. C. Oldfield The assessment and analysis of handedness: the Edinburgh inventory. , 1971, Neuropsychologia.

[51]  W. Idsardi,et al.  Perceptual and Phonetic Experiments on American English Dialect Identification , 1999 .

[52]  Erich Schröger,et al.  Pre-attentive auditory processing of lexicality , 2004, Brain and Language.

[53]  P. Belin Voice processing in human and non-human primates , 2006, Philosophical Transactions of the Royal Society B: Biological Sciences.

[54]  P. Belin,et al.  Before Speech: Cerebral Voice Processing in Infants , 2010, Neuron.

[55]  W. V. van Dommelen,et al.  Acoustic Parameters in Speaker Height and Weight Identification: Sex-Specific Behaviour , 1995, Language and speech.

[56]  Angela D. Friederici,et al.  Early Parallel Processing of Auditory Word and Voice Information , 2002, NeuroImage.

[57]  R Näätänen,et al.  Replicability of MEG and EEG measures of the auditory N1/N1m-response. , 1998, Electroencephalography and clinical neurophysiology.

[58]  Risto Näätänen,et al.  Word-specific cortical activity as revealed by the mismatch negativity. , 2004, Psychophysiology.

[59]  R. Ilmoniemi,et al.  Functional Specialization of the Human Auditory Cortex in Processing Phonetic and Musical Sounds: A Magnetoencephalographic (MEG) Study , 1999, NeuroImage.

[60]  R. Näätänen,et al.  Preattentive voice discrimination by the human brain as indexed by the mismatch negativity , 2001, Neuroscience Letters.

[61]  F. Pulvermüller,et al.  Memory traces for inflectional affixes as shown by mismatch negativity , 2002, The European journal of neuroscience.

[62]  Y. Miyashita,et al.  Image, language, brain , 2000 .

[63]  T. Picton,et al.  The N1 wave of the human electric and magnetic response to sound: a review and an analysis of the component structure. , 1987, Psychophysiology.

[64]  Paavo Alku,et al.  Disentangling the effects of phonation and articulation: Hemispheric asymmetries in the auditory N1m response of the human brain , 2005, BMC Neuroscience.

[65]  T. Picton,et al.  Event‐related brain activity associated with auditory pattern processing , 1998, Neuroreport.

[66]  Alain de Cheveigné,et al.  Sensor noise suppression , 2008, Journal of Neuroscience Methods.

[67]  T. Elbert,et al.  Cortical representation of vowels reflects acoustic dissimilarity determined by formant frequencies. , 2003, Brain research. Cognitive brain research.

[68]  E. Schröger,et al.  Familiarity affects environmental sound processing outside the focus of attention: An event-related potential study , 2009, Clinical Neurophysiology.

[69]  Pascal Belin,et al.  Is voice processing species-specific in human auditory cortex? An fMRI study , 2004, NeuroImage.

[70]  K. Lehnertz,et al.  Neuromagnetic evidence of an amplitopic organization of the human auditory cortex. , 1989, Electroencephalography and clinical neurophysiology.

[71]  K. Reinikainen,et al.  Event-related potentials to repetition and change of auditory stimuli. , 1992, Electroencephalography and clinical neurophysiology.

[72]  Jonas Obleser,et al.  Attentional influences on functional mapping of speech sounds in human auditory cortex , 2004, BMC Neuroscience.

[73]  P. Belin,et al.  Thinking the voice: neural correlates of voice perception , 2004, Trends in Cognitive Sciences.

[74]  N. Logothetis,et al.  A voice region in the monkey brain , 2008, Nature Neuroscience.

[75]  D. Wallace,et al.  TGF-β1 induction of the adenine nucleotide translocator 1 in astrocytes occurs through Smads and Sp1 transcription factors , 2004 .

[76]  Paul Boersma,et al.  Praat: doing phonetics by computer , 2003 .

[77]  Michael J Cortese,et al.  Handbook of Psycholinguistics , 2011 .

[78]  Richard E. Turner,et al.  The processing and perception of size information in speech sounds. , 2005, The Journal of the Acoustical Society of America.

[79]  Jonas Obleser,et al.  Magnetic Brain Response Mirrors Extraction of Phonological Features from Spoken Vowels , 2004, Journal of Cognitive Neuroscience.

[80]  M Steinschneider,et al.  Demonstration of mismatch negativity in the monkey. , 1992, Electroencephalography and clinical neurophysiology.

[81]  R. Näätänen Attention and brain function , 1992 .

[82]  Rainer Goebel,et al.  "Who" Is Saying "What"? Brain-Based Decoding of Human Voice and Speech , 2008, Science.

[83]  Angela D. Friederici,et al.  Localizing pre-attentive auditory memory-based comparison: Magnetic mismatch negativity to pitch change , 2007, NeuroImage.

[84]  R. Näätänen,et al.  Mismatch negativity--the measure for central sound representation accuracy. , 1997, Audiology & neuro-otology.

[85]  Colin Phillips,et al.  The influence of meaning on the perception of speech sounds. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[86]  K Mathiak,et al.  Contralaterality of cortical auditory processing at the level of the M50/M100 complex and the mismatch field: A whole-head magnetoencephalography study , 2001, Neuroreport.

[87]  Elisabeth Dévière,et al.  Analyzing linguistic data: a practical introduction to statistics using R , 2009 .

[88]  I. Winkler,et al.  Memory-based or afferent processes in mismatch negativity (MMN): a review of the evidence. , 2005, Psychophysiology.

[89]  A. Young,et al.  Neural responses to facial and vocal expressions of fear and disgust , 1998, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[90]  D. Poeppel,et al.  Processing of vowels in supratemporal auditory cortex , 1997, Neuroscience Letters.

[91]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[92]  D. Poeppel,et al.  Latency of auditory evoked M100 as a function of tone frequency , 1996, Neuroreport.

[93]  R. Zatorre,et al.  Adaptation to speaker's voice in right anterior temporal lobe , 2003, Neuroreport.

[94]  Anne-Lise Giraud,et al.  Distinct functional substrates along the right superior temporal sulcus for the processing of voices , 2004, NeuroImage.

[95]  A. Stevens,et al.  Dissociating the cortical basis of memory for voices, words and tones. , 2004, Brain research. Cognitive brain research.

[96]  D E Hartman,et al.  The perceptual identity and characteristics of aging in normal male adult speakers. , 1979, Journal of communication disorders.

[97]  B. Argall,et al.  Unraveling multisensory integration: patchy organization within human STS multisensory cortex , 2004, Nature Neuroscience.

[98]  J. Sarvas Basic mathematical and electromagnetic concepts of the biomagnetic inverse problem. , 1987, Physics in medicine and biology.

[99]  Erich Schröger,et al.  Familiarity Affects the Processing of Task-irrelevant Auditory Deviance , 2005, Journal of Cognitive Neuroscience.

[100]  T. Picton,et al.  Evoked potential audiometry. , 1976, The Journal of otolaryngology.

[101]  N. Gage,et al.  Vowel categorization induces departure of M100 latency from acoustic prediction , 2004, Neuroreport.

[102]  M J Owren,et al.  The role of vocal tract filtering in identity cueing in rhesus monkey (Macaca mulatta) vocalizations. , 1998, The Journal of the Acoustical Society of America.