Conflict monitoring in speech processing: An fMRI study of error detection in speech production and perception

To minimize the number of errors in speech, and thereby facilitate communication, speech is monitored before articulation. It is, however, unclear at which level during speech production monitoring takes place, and what mechanisms are used to detect and correct errors. The present study investigated whether internal verbal monitoring takes place through the speech perception system, as proposed by perception-based theories of speech monitoring, or whether mechanisms independent of perception are applied, as proposed by production-based theories of speech monitoring. With the use of fMRI during a tongue twister task we observed that error detection in internal speech during noise-masked overt speech production and error detection in speech perception both recruit the same neural network, which includes pre-supplementary motor area (pre-SMA), dorsal anterior cingulate cortex (dACC), anterior insula (AI), and inferior frontal gyrus (IFG). Although production and perception recruit similar areas, as proposed by perception-based accounts, we did not find activation in superior temporal areas (which are typically associated with speech perception) during internal speech monitoring in speech production as hypothesized by these accounts. On the contrary, results are highly compatible with a domain general approach to speech monitoring, by which internal speech monitoring takes place through detection of conflict between response options, which is subsequently resolved by a domain general executive center (e.g., the ACC).

[1]  D. Poeppel,et al.  Health, USA Reviewed by: , 2010 .

[2]  Douglas C. Noll,et al.  Overt Verbal Responding during fMRI Scanning: Empirical Investigations of Problems and Potential Solutions , 1999, NeuroImage.

[3]  A. Postma Detection of errors during speech production: a review of speech monitoring models , 2000, Cognition.

[4]  R. C. Oldfield The assessment and analysis of handedness: the Edinburgh inventory. , 1971, Neuropsychologia.

[5]  E. Formisano,et al.  Neural correlates of verbal feedback processing: An fMRI study employing overt speech , 2007, Human brain mapping.

[6]  F. Dick,et al.  Voxel-based lesion–symptom mapping , 2003, Nature Neuroscience.

[7]  Bradley R. Buchsbaum,et al.  The Search for the Phonological Store: From Loop to Convolution , 2008, Journal of Cognitive Neuroscience.

[8]  D. Yves von Cramon,et al.  The Role of Intact Frontostriatal Circuits in Error Processing , 2006, Journal of Cognitive Neuroscience.

[9]  Elizabeth R. Blacfkmer,et al.  Theories of monitoring and the timing of repairs in spontaneous speech , 1991, Cognition.

[10]  D. Boatman Cortical bases of speech perception:evidence from functional lesion studies , 2004, Cognition.

[11]  M. Herrmann,et al.  Source localization (LORETA) of the error-related-negativity (ERN/Ne) and positivity (Pe). , 2004, Brain research. Cognitive brain research.

[12]  S. Kotz,et al.  On emotional conflict: interference resolution of happy and angry prosody reveals valence-specific effects. , 2010, Cerebral cortex.

[13]  Michele T. Diaz,et al.  A comparison of brain activity evoked by single content and function words: An fMRI investigation of implicit word processing , 2009, Brain Research.

[14]  Colin Humphries,et al.  Syntactic and Semantic Modulation of Neural Activity during Auditory Sentence Comprehension , 2006, Journal of Cognitive Neuroscience.

[15]  Jonathan D. Cohen,et al.  The neural basis of error detection: conflict monitoring and the error-related negativity. , 2004, Psychological review.

[16]  J. Rauschecker,et al.  Maps and streams in the auditory cortex: nonhuman primates illuminate human speech processing , 2009, Nature Neuroscience.

[17]  Peter Indefrey,et al.  The Spatial and Temporal Signatures of Word Production Components: A Critical Update , 2011, Front. Psychology.

[18]  Gary S. Dell,et al.  Inner speech slips exhibit lexical bias, but not the phonemic similarity effect , 2008, Cognition.

[19]  B. Burle,et al.  Action Monitoring and Medial Frontal Cortex: Leading Role of Supplementary Motor Area , 2014, Science.

[20]  Thomas Berg,et al.  The aftermath of error occurrence: Psycholinguistic evidence from cut-offs , 1986 .

[21]  H. Kolk,et al.  The effects of noise masking and required accuracy on speech errors, disfluencies, and self-repairs. , 1992, Journal of speech and hearing research.

[22]  M. Botvinick,et al.  Conflict monitoring and cognitive control. , 2001, Psychological review.

[23]  B. Argall,et al.  Integration of Auditory and Visual Information about Objects in Superior Temporal Sulcus , 2004, Neuron.

[24]  Marcel Brass,et al.  How social is error observation? The neural mechanisms underlying the observation of human and machine errors. , 2014, Social cognitive and affective neuroscience.

[25]  N. Alpert,et al.  Localization of Syntactic Comprehension by Positron Emission Tomography , 1996, Brain and Language.

[26]  M. Pickering,et al.  An integrated theory of language production and comprehension. , 2013, The Behavioral and brain sciences.

[27]  Lorraine K. Tyler,et al.  Word Retrieval Failures in Old Age: The Relationship between Structure and Function , 2010, Journal of Cognitive Neuroscience.

[28]  G. Curio,et al.  Speaking modifies voice‐evoked activity in the human auditory cortex , 2000, Human brain mapping.

[29]  G. Dell,et al.  Is comprehension necessary for error detection? A conflict-based account of monitoring in speech production , 2011, Cognitive Psychology.

[30]  Ana Raposo,et al.  The contribution of fronto-parietal regions to sentence comprehension: Insights from the Moses illusion , 2013, NeuroImage.

[31]  S. Petersen,et al.  A procedure for identifying regions preferentially activated by attention to semantic and phonological relations using functional magnetic resonance imaging , 2003, Neuropsychologia.

[32]  Els Severens,et al.  Functional mechanisms involved in the internal inhibition of taboo words. , 2012, Social cognitive and affective neuroscience.

[33]  Martin J. Pickering,et al.  Human Neuroscience Hypothesis and Theory Article Self-, Other-, and Joint Monitoring Using Forward Models , 2022 .

[34]  David Poeppel,et al.  The Effect of Imagination on Stimulation: The Functional Specificity of Efference Copies in Speech Processing , 2013, Journal of Cognitive Neuroscience.

[35]  Xiaoqin Wang,et al.  Dynamics of auditory-vocal interaction in monkey auditory cortex. , 2005, Cerebral cortex.

[36]  Sophie K Scott,et al.  The Effect of Delayed Auditory Feedback on Activity in the Temporal Lobe While Speaking: a Positron Emission Tomography Study Pet Scanning Functional Imaging , 2022 .

[37]  H. E. Brown,et al.  Utilizing hemodynamic delay and dispersion to detect fMRI signal change without auditory interference: The behavior interleaved gradients technique , 1999, Magnetic resonance in medicine.

[38]  Kevin G. Munhall,et al.  Functional Overlap between Regions Involved in Speech Perception and in Monitoring One's Own Voice during Speech Production , 2010, Journal of Cognitive Neuroscience.

[39]  W. Levelt,et al.  Speaking: From Intention to Articulation , 1990 .

[40]  Frank H Guenther,et al.  The DIVA model: A neural theory of speech acquisition and production , 2011, Language and cognitive processes.

[41]  J. Abbs,et al.  Sensorimotor actions in the control of multi-movement speech gestures , 1983, Trends in Neurosciences.

[42]  P. McGuire,et al.  An fMRI study of verbal self-monitoring: neural correlates of auditory verbal feedback. , 2006, Cerebral cortex.

[43]  Harold Bekkering,et al.  Self-identification and empathy modulate error-related brain activity during the observation of penalty shots between friend and foe. , 2009, Social cognitive and affective neuroscience.

[44]  C. Price,et al.  Phonological decisions require both the left and right supramarginal gyri , 2010, Proceedings of the National Academy of Sciences.

[45]  C. Carter,et al.  Error Detection, Correction, and Prevention in the Brain: A Brief Review of Data and Theories , 2006, Clinical EEG and neuroscience.

[46]  Riitta Salmelin,et al.  Subject's own speech reduces reactivity of the human auditory cortex , 1999, Neuroscience Letters.

[47]  C. Carter,et al.  The Timing of Action-Monitoring Processes in the Anterior Cingulate Cortex , 2002, Journal of Cognitive Neuroscience.

[48]  W. Levelt,et al.  Monitoring and self-repair in speech , 1983, Cognition.

[49]  L. Cohen,et al.  The role of the supplementary motor area (SMA) in word production , 2006, Brain Research.

[50]  Elia Formisano,et al.  The Sensory Consequences of Speaking: Parametric Neural Cancellation during Speech in Auditory Cortex , 2011, PloS one.

[51]  Richard S. J. Frackowiak,et al.  The anatomy of phonological and semantic processing in normal subjects. , 1992, Brain : a journal of neurology.

[52]  William W. Graves,et al.  Where is the semantic system? A critical review and meta-analysis of 120 functional neuroimaging studies. , 2009, Cerebral cortex.

[53]  C. Weiller,et al.  Correct and erroneous picture naming responses in healthy subjects , 2009, Neuroscience Letters.

[54]  C. Price The anatomy of language: a review of 100 fMRI studies published in 2009 , 2010, Annals of the New York Academy of Sciences.

[55]  Maarten A. S. Boksem,et al.  A Potential Role of the Inferior Frontal Gyrus and Anterior Insula in Cognitive Control, Brain Rhythms, and Event-Related Potentials , 2011, Front. Psychology.

[56]  Stuart Rosen,et al.  A positron emission tomography study of the neural basis of informational and energetic masking effects in speech perception. , 2004, The Journal of the Acoustical Society of America.

[57]  Masao Ito Control of mental activities by internal models in the cerebellum , 2008, Nature Reviews Neuroscience.

[58]  A. Nobre,et al.  The Response of Left Temporal Cortex to Sentences , 2002, Journal of Cognitive Neuroscience.

[59]  Yosef Grodzinsky,et al.  Neuroimaging of syntax and syntactic processing , 2006, Current Opinion in Neurobiology.

[60]  Jason A. Tourville,et al.  Neural mechanisms underlying auditory feedback control of speech , 2008, NeuroImage.

[61]  K. R. Ridderinkhof,et al.  Neurocognitive mechanisms of cognitive control: The role of prefrontal cortex in action selection, response inhibition, performance monitoring, and reward-based learning , 2004, Brain and Cognition.

[62]  Stéphane Lehéricy,et al.  Foot, face and hand representation in the human supplementary motor area , 2004, Neuroreport.

[63]  Sarah M. E. Gierhan,et al.  Shared Language , 2011, Psychological science.

[64]  G. Hickok Computational neuroanatomy of speech production , 2012, Nature Reviews Neuroscience.

[65]  D. Poeppel,et al.  The cortical organization of speech processing , 2007, Nature Reviews Neuroscience.

[66]  S. G. Nooteboom,et al.  Speaking and unspeaking : detection and correction of phonological and lexical errors in spontaneous speech , 1980 .

[67]  Vincent L. Gracco,et al.  Imaging speech production using fMRI , 2005, NeuroImage.

[68]  Xiaoqin Wang,et al.  Sensory-motor interaction in the primate auditory cortex during self-initiated vocalizations. , 2003, Journal of neurophysiology.

[69]  A. D. Craig,et al.  Once an island, now the focus of attention , 2010, Brain Structure and Function.

[70]  Sophie K. Scott,et al.  The neural processing of masked speech , 2013, Hearing Research.

[71]  Robert J. Hartsuiker,et al.  Error Monitoring in Speech Production: A Computational Test of the Perceptual Loop Theory , 2001, Cognitive Psychology.

[72]  K. J. Cole,et al.  Control of multimovement coordination: sensorimotor mechanisms in speech motor programming. , 1984, Journal of motor behavior.

[73]  T. Carr,et al.  Comparing cortical activations for silent and overt speech using event‐related fMRI , 2002, Human brain mapping.

[74]  G. Humphreys,et al.  Segregating Semantic from Phonological Processes during Reading , 1997, Journal of Cognitive Neuroscience.

[75]  David Silbersweig,et al.  Functional neuroanatomy of verbal self-monitoring , 1996 .

[76]  Richard S. J. Frackowiak,et al.  Differential activation of right and left posterior sylvian regions by semantic and phonological tasks: a positron-emission tomography study in normal human subjects , 1994, Neuroscience Letters.

[77]  Atsuko Takashima,et al.  Attention for speaking: domain-general control from the anterior cingulate cortex in spoken word production , 2013, Front. Hum. Neurosci..

[78]  J. Ford,et al.  Fine-tuning of auditory cortex during speech production. , 2005, Psychophysiology.

[79]  H. Fukuyama,et al.  Cortical processing mechanism for vocalization with auditory verbal feedback , 1997, Neuroreport.

[80]  W. Levelt,et al.  The spatial and temporal signatures of word production components , 2004, Cognition.

[81]  S. Nagarajan,et al.  Magnetoencephalographic evidence for a precise forward model in speech production , 2006, Neuroreport.

[82]  R. Poldrack,et al.  Common neural substrates for inhibition of spoken and manual responses. , 2008, Cerebral cortex.

[83]  S. Petersen,et al.  A dual-networks architecture of top-down control , 2008, Trends in Cognitive Sciences.

[84]  Richard S. J. Frackowiak,et al.  Functional anatomy of a common semantic system for words and pictures , 1996, Nature.

[85]  A. Postma,et al.  Production and Detection of Speech Errors in Silent, Mouthed, Noise-Masked, and Normal Auditory Feedback Speech , 1996 .

[86]  C. Keysers,et al.  Vicarious Neural Processing of Outcomes during Observational Learning , 2013, PloS one.

[87]  I. Hochberg,et al.  Reaction Time of the Tongue to Auditory and Tactile Stimulation , 1965, Perceptual and motor skills.