Auditory Cortex Processes Variation in Our Own Speech

As we talk, we unconsciously adjust our speech to ensure it sounds the way we intend it to sound. However, because speech production involves complex motor planning and execution, no two utterances of the same sound will be exactly the same. Here, we show that auditory cortex is sensitive to natural variations in self-produced speech from utterance to utterance. We recorded event-related potentials (ERPs) from ninety-nine subjects while they uttered “ah” and while they listened to those speech sounds played back. Subjects' utterances were sorted based on their formant deviations from the previous utterance. Typically, the N1 ERP component is suppressed during talking compared to listening. By comparing ERPs to the least and most variable utterances, we found that N1 was less suppressed to utterances that differed greatly from their preceding neighbors. In contrast, an utterance's difference from the median formant values did not affect N1. Trial-to-trial pitch (f0) deviation and pitch difference from the median similarly did not affect N1. We discuss mechanisms that may underlie the change in N1 suppression resulting from trial-to-trial formant change. Deviant utterances require additional auditory cortical processing, suggesting that speaking-induced suppression mechanisms are optimally tuned for a specific production.

[1]  R. Näätänen,et al.  Auditory frequency discrimination and event-related potentials. , 1985, Electroencephalography and clinical neurophysiology.

[2]  E. B. Newman,et al.  A Scale for the Measurement of the Psychological Magnitude Pitch , 1937 .

[3]  Xiaoqin Wang,et al.  Dynamics of auditory-vocal interaction in monkey auditory cortex. , 2005, Cerebral cortex.

[4]  Douglas D. O'Shaughnessy,et al.  Speech communication : human and machine , 1987 .

[5]  O. Creutzfeldt,et al.  Neuronal activity in the human lateral temporal lobe , 1989, Experimental Brain Research.

[6]  Xiaoqin Wang,et al.  Sensory-motor interaction in the primate auditory cortex during self-initiated vocalizations. , 2003, Journal of neurophysiology.

[7]  A. Noll Cepstrum pitch determination. , 1967, The Journal of the Acoustical Society of America.

[8]  S. Sober,et al.  Adult birdsong is actively maintained by error correction , 2009, Nature Neuroscience.

[9]  M. Sommer,et al.  Corollary discharge across the animal kingdom , 2008, Nature Reviews Neuroscience.

[10]  J. Ford,et al.  Auditory cortex responsiveness during talking and listening: early illness schizophrenia and patients at clinical high-risk for psychosis. , 2012, Schizophrenia bulletin.

[11]  Daniel M. Wolpert,et al.  Forward Models for Physiological Motor Control , 1996, Neural Networks.

[12]  E. Chang,et al.  Human cortical sensorimotor network underlying feedback control of vocal pitch , 2013, Proceedings of the National Academy of Sciences.

[13]  Jason A. Tourville,et al.  Neural mechanisms underlying auditory feedback control of speech , 2008, NeuroImage.

[14]  Xiaoqin Wang,et al.  Neural substrates of vocalization feedback monitoring in primate auditory cortex , 2008, Nature.

[15]  F. Guenther,et al.  Vowel Category Boundaries Enhance Cortical and Behavioral Responses to Speech Feedback Alterations , 2013, The Journal of Neuroscience.

[16]  F. Guenther Cortical interactions underlying the production of speech sounds. , 2006, Journal of communication disorders.

[17]  J. Ford,et al.  Assessing corollary discharge in humans using noninvasive neurophysiological methods , 2010, Nature Protocols.

[18]  E. Holst,et al.  Das Reafferenzprinzip , 2004, Naturwissenschaften.

[19]  Judith M Ford,et al.  Anticipating the future: automatic prediction failures in schizophrenia. , 2012, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[20]  H. Lane,et al.  The Lombard Sign and the Role of Hearing in Speech , 1971 .

[21]  J. Perkell,et al.  Sensorimotor adaptation to feedback perturbations of vowel acoustics and its relation to perception. , 2007, The Journal of the Acoustical Society of America.

[22]  Jennifer S. Pardo,et al.  On phonetic convergence during conversational interaction. , 2006, The Journal of the Acoustical Society of America.

[23]  Judith M Ford,et al.  Corollary Discharge Dysfunction in Schizophrenia: Evidence for an Elemental Deficit , 2008, Clinical EEG and neuroscience.

[24]  G. Curio,et al.  Speaking modifies voice‐evoked activity in the human auditory cortex , 2000, Human brain mapping.

[25]  Michael I. Jordan,et al.  Sensorimotor adaptation in speech production. , 1998, Science.

[26]  R. Näätänen,et al.  Stimulus deviance and evoked potentials , 1982, Biological Psychology.

[27]  S. Nagarajan,et al.  What Does Motor Efference Copy Represent? Evidence from Speech Production , 2013, The Journal of Neuroscience.

[28]  J. Karvanen,et al.  Trimmed estimators for robust averaging of event-related potentials , 2005, Journal of Neuroscience Methods.

[29]  H. Lane,et al.  Regulation of voice communication by sensory dynamics. , 1970, The Journal of the Acoustical Society of America.

[30]  J. Ford,et al.  Fine-tuning of auditory cortex during speech production. , 2005, Psychophysiology.

[31]  N. Squires,et al.  Two varieties of long-latency positive waves evoked by unpredictable auditory stimuli in man. , 1975, Electroencephalography and clinical neurophysiology.

[32]  C. Larson,et al.  Error-dependent modulation of speech-induced auditory suppression for pitch-shifted voice feedback , 2011, BMC Neuroscience.

[33]  Karl J. Friston,et al.  Prediction, perception and agency , 2012, International journal of psychophysiology : official journal of the International Organization of Psychophysiology.

[34]  Srikantan S. Nagarajan,et al.  Speech Production as State Feedback Control , 2011, Front. Hum. Neurosci..

[35]  R. Sperry Neural basis of the spontaneous optokinetic response produced by visual inversion. , 1950, Journal of comparative and physiological psychology.

[36]  J. Ford,et al.  Neurophysiological evidence of corollary discharge dysfunction in schizophrenia. , 2001, The American journal of psychiatry.

[37]  K. Munhall,et al.  Compensation following real-time manipulation of formants in isolated vowels. , 2006, The Journal of the Acoustical Society of America.

[38]  Judith M. Ford,et al.  The Corollary Discharge in Humans Is Related to Synchronous Neural Oscillations , 2011, Journal of Cognitive Neuroscience.

[39]  Berthold Hedwig,et al.  The Cellular Basis of a Corollary Discharge , 2006, Science.

[40]  Charles R. Larson,et al.  Interactions between auditory and somatosensory feedback for voice F0 control , 2008, Experimental Brain Research.

[41]  M. Merzenich,et al.  Modulation of the Auditory Cortex during Speech: An MEG Study , 2002, Journal of Cognitive Neuroscience.

[42]  P. Chauvel,et al.  Neuromagnetic source localization of auditory evoked fields and intracerebral evoked potentials: a comparison of data in the same patients , 2001, Clinical Neurophysiology.

[43]  R. Näätänen,et al.  Early selective-attention effect on evoked potential reinterpreted. , 1978, Acta psychologica.

[44]  N Suga,et al.  Site of Neural Attenuation of Responses to Self-Vocalized Sounds in Echolocating Bats , 1974, Science.

[45]  Margot J. Taylor,et al.  Guidelines for using human event-related potentials to study cognition: recording standards and publication criteria. , 2000, Psychophysiology.

[46]  C. Fowler,et al.  Gestural drift in a bilingual speaker of Brazilian Portuguese and English , 1997 .

[47]  R. J. Beers,et al.  Motor Learning Is Optimally Tuned to the Properties of Motor Noise , 2009, Neuron.

[48]  Satrajit S. Ghosh,et al.  Weak Responses to Auditory Feedback Perturbation during Articulation in Persons Who Stutter: Evidence for Abnormal Auditory-Motor Transformation , 2012, PloS one.

[49]  Kevin G Munhall,et al.  Adaptive control of vowel formant frequency: evidence from real-time formant manipulation. , 2006, The Journal of the Acoustical Society of America.