When speaker identity is unavoidable: Neural processing of speaker identity cues in natural speech

Speech sound acoustic properties vary largely across speakers and accents. When perceiving speech, adult listeners normally disregard non-linguistic variation caused by speaker or accent differences, in order to comprehend the linguistic message, e.g. to correctly identify a speech sound or a word. Here we tested whether the process of normalizing speaker and accent differences, facilitating the recognition of linguistic information, is found at the level of neural processing, and whether it is modulated by the listeners' native language. In a multi-deviant oddball paradigm, native and nonnative speakers of Dutch were exposed to naturally-produced Dutch vowels varying in speaker, sex, accent, and phoneme identity. Unexpectedly, the analysis of mismatch negativity (MMN) amplitudes elicited by each type of change shows a large degree of early perceptual sensitivity to non-linguistic cues. This finding on perception of naturally-produced stimuli contrasts with previous studies examining the perception of synthetic stimuli wherein adult listeners automatically disregard acoustic cues to speaker identity. The present finding bears relevance to speech normalization theories, suggesting that at an unattended level of processing, listeners are indeed sensitive to changes in fundamental frequency in natural speech tokens.

[1]  Z. Szmeja Recenzja podręcznika pt. „CT Teaching Manual. A Systematic Approach to CT Reading” ☆ ☆☆ , 2008 .

[2]  B. MacWhinney,et al.  IMPLICIT AND EXPLICIT MEASURES OF SENSITIVITY TO VIOLATIONS IN SECOND LANGUAGE GRAMMAR: An Event-Related Potential Investigation , 2005, Studies in Second Language Acquisition.

[3]  Arnaud Delorme,et al.  EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis , 2004, Journal of Neuroscience Methods.

[4]  Sophie Dufour,et al.  Behavioral and electrophysiological evidence for the impact of regional variation on phoneme perception , 2009, Cognition.

[5]  P. Deltenre,et al.  Mismatch Negativity (MMN) evoked by sound duration contrasts: An unexpected major effect of deviance direction on amplitudes , 2009, Clinical Neurophysiology.

[6]  Mathias Scharinger,et al.  You had me at “Hello”: Rapid extraction of dialect information from spoken words , 2011, NeuroImage.

[7]  E. Schröger,et al.  Pre-attentive categorization of vowel formant structure in complex tones. , 2004, Brain research. Cognitive brain research.

[8]  J. Hillenbrand,et al.  Acoustic characteristics of American English vowels. , 1994, The Journal of the Acoustical Society of America.

[9]  Erich Schröger,et al.  Pre-attentive perception of vowel phonemes from variable speech stimuli. , 2004, Psychophysiology.

[10]  R. Näätänen,et al.  The mismatch negativity (MMN): towards the optimal paradigm , 2004, Clinical Neurophysiology.

[11]  T. Carrell,et al.  Central Auditory System Plasticity Associated with Speech Discrimination Training , 1995, Journal of Cognitive Neuroscience.

[12]  L. Oakes Infancy Guidelines for Publishing Eye-Tracking Data. , 2010, Infancy : the official journal of the International Society on Infant Studies.

[13]  Roel Smits,et al.  A comparison of vowel normalization procedures for language variation research. , 2004, The Journal of the Acoustical Society of America.

[14]  Steven J. Luck,et al.  ERPLAB: an open-source toolbox for the analysis of event-related potentials , 2014, Front. Hum. Neurosci..

[15]  R. Ilmoniemi,et al.  Language-specific phoneme representations revealed by electric and magnetic brain responses , 1997, Nature.

[16]  P. Jusczyk,et al.  The role of talker-specific information in word segmentation by infants. , 2000, Journal of experimental psychology. Human perception and performance.

[17]  Elizabeth K. Johnson,et al.  Developmental Changes in Infants' Ability to Cope with Dialect Variation in Word Recognition. , 2010, Infancy : the official journal of the International Society on Infant Studies.

[18]  Paola Escudero,et al.  MODELING VOWEL NORMALIZATION AND SOUND PERCEPTION AS SEQUENTIAL PROCESSES , 2007 .

[19]  James M McQueen,et al.  Effects of native language on perceptual sensitivity to phonetic cues , 2012, Neuroreport.

[20]  I. Winkler,et al.  ‘Primitive intelligence’ in the auditory cortex , 2001, Trends in Neurosciences.

[21]  Cory D. Bonn,et al.  Indexical and linguistic processing by 12-month-olds: Discrimination of speaker, accent and vowel differences , 2017, PloS one.

[22]  P. Escudero,et al.  Speaker and Accent Variation Are Handled Differently: Evidence in Native and Non-Native Listeners , 2016, PloS one.