Change and novelty detection in speech and non-speech sound streams

The aim of the present study was to determine differences in cortical processing of consonant-vowel syllables and acoustically matched non-speech sounds, as well as novel human and nonhuman sounds. Event-related potentials (ERPs) were recorded to vowel, vowel duration, consonant, syllable intensity, and frequency changes as well as corresponding changes in their non-speech counterparts with the multi-feature mismatch negativity (MMN) paradigm. Enhanced responses to linguistically relevant deviants were expected. Indeed, the vowel and frequency deviants elicited significantly larger MMNs in the speech than non-speech condition. Minimum-norm source localization algorithm was applied to determine hemispheric asymmetry in the responses. Language relevant deviants (vowel, duration and - to a lesser degree - frequency) showed higher activation in the left than right hemisphere to stimuli in the speech condition. Novel sounds elicited novelty P3 waves, the amplitude of which for nonhuman sounds was larger in the speech than non-speech condition. The current MMN results imply enhanced processing of linguistically relevant information at the pre-attentive stage and in this way support the domain-specific model of speech perception.

[1]  A. Harkrider,et al.  Neurophysiological indices of speech and nonspeech stimulus processing. , 2005, Journal of speech, language, and hearing research : JSLHR.

[2]  R. Näätänen,et al.  Hemispheric processing of duration changes in speech and non-speech sounds , 2004, Neuroreport.

[3]  P. Tallal,et al.  Neurobiology of speech perception. , 1997, Annual review of neuroscience.

[4]  Risto Näätänen,et al.  Linguistic relevance of duration within the native language determines the accuracy of speech-sound duration processing. , 2003, Brain research. Cognitive brain research.

[5]  J. Pierrehumbert The perception of fundamental frequency declination. , 1979, The Journal of the Acoustical Society of America.

[6]  S. Palva,et al.  Discrimination of Speech and of Complex Nonspeech Sounds of Different Temporal Structure in the Left and Right Cerebral Hemispheres , 2000, NeuroImage.

[7]  V. Knott,et al.  Auditory hallucinations and the mismatch negativity: processing speech and non-speech sounds in schizophrenia. , 2008, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[8]  I. Winkler,et al.  Do N1/MMN, P3a, and RON form a strongly coupled chain reflecting the three stages of auditory distraction? , 2008, Biological Psychology.

[9]  Erich Schröger,et al.  Familiarity Affects the Processing of Task-irrelevant Auditory Deviance , 2005, Journal of Cognitive Neuroscience.

[10]  Robert J Zatorre,et al.  Neural specializations for speech and pitch: moving beyond the dichotomies , 2008, Philosophical Transactions of the Royal Society B: Biological Sciences.

[11]  Roy D. Patterson,et al.  Locating the initial stages of speech–sound processing in human temporal cortex , 2006, NeuroImage.

[12]  S. Scott Voice processing in monkey and human brains , 2008, Trends in Cognitive Sciences.

[13]  P. Alku,et al.  A method for generating natural-sounding speech stimuli for cognitive brain research , 1999, Clinical Neurophysiology.

[14]  Erich Schröger,et al.  Pre-attentive perception of vowel phonemes from variable speech stimuli. , 2004, Psychophysiology.

[15]  Karl J. Friston,et al.  The mismatch negativity: A review of underlying mechanisms , 2009, Clinical Neurophysiology.

[16]  K. Reinikainen,et al.  Right hemisphere dominance of different mismatch negativities. , 1991, Electroencephalography and clinical neurophysiology.

[17]  Friedemann Pulvermüller,et al.  Determinants of dominance: Is language laterality explained by physical or linguistic features of speech? , 2005, NeuroImage.

[18]  G. McArthur,et al.  Speech and non-speech processing in people with specific language impairment: A behavioural and electrophysiological study , 2005, Brain and Language.

[19]  R. Simons,et al.  On the relationship of P3a and the Novelty-P3 , 2001, Biological Psychology.

[20]  R. Näätänen,et al.  Measurement of extensive auditory discrimination profiles using the mismatch negativity (MMN) of the auditory event-related potential (ERP) , 2007, Clinical Neurophysiology.

[21]  C Pantev,et al.  Magnetic and electric brain activity evoked by the processing of tone and vowel stimuli , 1995, The Journal of neuroscience : the official journal of the Society for Neuroscience.

[22]  R. Zatorre,et al.  Voice-selective areas in human auditory cortex , 2000, Nature.

[23]  Jeffrey R. Binder,et al.  Left Posterior Temporal Regions are Sensitive to Auditory Categorization , 2008, Journal of Cognitive Neuroscience.

[24]  Dennis Butler Fry The Physics of Speech , 1979 .

[25]  P. Alku,et al.  Are different kinds of acoustic features processed differently for speech and non-speech sounds? , 2001, Brain research. Cognitive brain research.

[26]  O Bertrand,et al.  Analysis of speech sounds is left-hemisphere predominant at 100-150ms after sound onset. , 1999, Neuroreport.

[27]  J. Polich Updating P 300 : An Integrative Theory of P 3 a and P 3 b , 2009 .

[28]  Raymond D. Kent,et al.  Acoustic Analysis of Speech , 2009 .

[29]  D. Poeppel,et al.  Speech perception at the interface of neurobiology and linguistics , 2008, Philosophical Transactions of the Royal Society B: Biological Sciences.

[30]  R. Ilmoniemi,et al.  Language-specific phoneme representations revealed by electric and magnetic brain responses , 1997, Nature.

[31]  I. Winkler,et al.  Memory-based or afferent processes in mismatch negativity (MMN): a review of the evidence. , 2005, Psychophysiology.

[32]  F. Díaz,et al.  The contribution of AERPs (MMN and LDN) to studying temporal vs. linguistic processing deficits in children with reading difficulties. , 2006, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[33]  A. Liberman,et al.  On the relation of speech to language , 2000, Trends in Cognitive Sciences.

[34]  Juhani Järvikivi,et al.  Focus in production: tonal shape, intensity and word order. , 2007, The Journal of the Acoustical Society of America.

[35]  M. Tervaniemi,et al.  Lateralization of auditory-cortex functions , 2003, Brain Research Reviews.

[36]  Acoustical comparison of the monophthong systems in Finnish, Mongolian and Udmurt , 2005, Journal of the International Phonetic Association.

[37]  Paavo Alku,et al.  Fast multi-feature paradigm for recording several mismatch negativities (MMNs) to phonetic and acoustic changes in speech sounds , 2009, Biological Psychology.

[38]  S. Bentin,et al.  Processing specificity for human voice stimuli: electrophysiological evidence , 2001, Neuroreport.

[39]  Mietta Lennes,et al.  FINNISH SOUND STRUCTURE. PHONETICS, PHONOLOGY, PHONOTACTICS AND PROSODY , 2010 .

[40]  M. Tervaniemi,et al.  The mismatch negativity in cognitive and clinical neuroscience: Theoretical and methodological considerations , 2007, Biological Psychology.

[41]  Steven Greenberg,et al.  Listening to Speech : An Auditory Perspective , 2012 .

[42]  A. Lahiri,et al.  Neurobiological Evidence for Abstract Phonological Representations in the Mental Lexicon during Speech Recognition , 2004, Journal of Cognitive Neuroscience.

[43]  P. Tallal,et al.  Neurobiological Basis of Speech: A Case for the Preeminence of Temporal Processing , 1993, Annals of the New York Academy of Sciences.

[44]  Juha Virtanen,et al.  Hemispheric lateralization in preattentive processing of speech sounds , 1998, Neuroscience Letters.

[45]  Frank Becker,et al.  Mismatch negativity elicited by tones and speech sounds: Changed topographical distribution in aphasia , 2007, Brain and Language.

[46]  Eric D Young,et al.  Neural representation of spectral and temporal information in speech , 2008, Philosophical Transactions of the Royal Society B: Biological Sciences.

[47]  R Todd Constable,et al.  Differentiation of speech and nonspeech processing within primary auditory cortex. , 2006, The Journal of the Acoustical Society of America.

[48]  D. Pisoni,et al.  The Handbook of Speech Perception , 2004 .

[49]  P. Alku,et al.  Mismatch negativity (MMN) elicited by changes in phoneme length: A cross-linguistic study , 2006, Brain Research.

[50]  Erich Schröger,et al.  Pre-attentive auditory processing of lexicality , 2004, Brain and Language.

[51]  P. Belin,et al.  Thinking the voice: neural correlates of voice perception , 2004, Trends in Cognitive Sciences.

[52]  D. Friedman,et al.  The novelty P3: an event-related brain potential (ERP) sign of the brain's evaluation of novelty , 2001, Neuroscience & Biobehavioral Reviews.

[53]  R. Ilmoniemi,et al.  Brain responses reveal the learning of foreign language phonemes. , 1999, Psychophysiology.

[54]  Roy D Patterson,et al.  Functional imaging of the auditory processing applied to speech sounds , 2008, Philosophical Transactions of the Royal Society B: Biological Sciences.

[55]  Martti Vainio,et al.  Modulation of the mismatch negativity (MMN) to vowel duration changes in native speakers of Finnish and German as a result of language experience. , 2007, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[56]  G. Rainer,et al.  Cognitive neuroscience: Neural mechanisms for detecting and remembering novel events , 2003, Nature Reviews Neuroscience.

[57]  R. Näätänen,et al.  Mismatch Negativity: Clinical and Other Applications , 2000, Audiology and Neurotology.

[58]  V. Hacquard,et al.  The effects of inventory on vowel perception in French and Spanish: An MEG study , 2007, Brain and Language.

[59]  R. Näätänen,et al.  Selective tuning of cortical sound‐feature processing by language experience , 2006, The European journal of neuroscience.

[60]  R. Näätänen,et al.  The mismatch negativity (MMN): towards the optimal paradigm , 2004, Clinical Neurophysiology.

[61]  Sonja A. Kotz,et al.  Early emotional prosody perception based on different speaker voices , 2008, Neuroreport.

[62]  R Näätänen,et al.  Phoneme representations of the human brain as reflected by event-related potentials. , 1999, Electroencephalography and clinical neurophysiology. Supplement.

[63]  C. Sebastian,et al.  Speech versus tone processing in compensated dyslexia: discrimination and lateralization with a dichotic mismatch negativity (MMN) paradigm. , 2008, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[64]  P. Wong,et al.  Hemispheric specialization of linguistic pitch patterns , 2002, Brain Research Bulletin.

[65]  Sophie Dufour,et al.  Behavioral and electrophysiological evidence for the impact of regional variation on phoneme perception , 2009, Cognition.

[66]  J. Polich,et al.  Neuropsychology and neuropharmacology of P3a and P3b. , 2006, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[67]  Mandy J. Maguire,et al.  Event-Related Evoked Potentials (ERPs) in Speech Perception , 2008 .

[68]  Willy Serniclaes,et al.  Neural correlates of switching from auditory to speech perception , 2005, NeuroImage.

[69]  I. Winkler,et al.  Involuntary Attention and Distractibility as Evaluated with Event-Related Brain Potentials , 2000, Audiology and Neurotology.

[70]  F. Pulvermüller,et al.  Language outside the focus of attention: The mismatch negativity as a tool for studying higher cognitive processes , 2006, Progress in Neurobiology.

[71]  C E Tenke,et al.  Dissociation of brain ERP topographies for tonal and phonetic oddball tasks. , 1998, Psychophysiology.

[72]  R J Ilmoniemi,et al.  Noise affects speech-signal processing differently in the cerebral hemispheres. , 1999, Neuroreport.

[73]  R. Näätänen,et al.  The mismatch negativity (MMN) in basic research of central auditory processing: A review , 2007, Clinical Neurophysiology.

[74]  P. Belin,et al.  Can Spectro-Temporal Complexity Explain the Autistic Pattern of Performance on Auditory Tasks? , 2006, Journal of autism and developmental disorders.

[75]  R. Ilmoniemi,et al.  Interpreting magnetic fields of the brain: minimum norm estimates , 2006, Medical and Biological Engineering and Computing.

[76]  C. Escera,et al.  Role of Mismatch Negativity and Novelty-P3 in Involuntary Auditory Attention , 2007 .

[77]  J. Polich Updating P300: An integrative theory of P3a and P3b , 2007, Clinical Neurophysiology.

[78]  Shlomo Bentin,et al.  Neural sensitivity to human voices: ERP evidence of task and attentional influences. , 2003, Psychophysiology.