Inferior Frontal Cortex Contributions to the Recognition of Spoken Words and Their Constituent Speech Sounds

Speech perception and comprehension are often challenged by the need to recognize speech sounds that are degraded or ambiguous. Here, we explore the cognitive and neural mechanisms involved in resolving ambiguity in the identity of speech sounds using syllables that contain ambiguous phonetic segments (e.g., intermediate sounds between /b/ and /g/ as in “blade” and “glade”). We used an audio-morphing procedure to create a large set of natural sounding minimal pairs that contain phonetically ambiguous onset or offset consonants (differing in place, manner, or voicing). These ambiguous segments occurred in different lexical contexts (i.e., in words or pseudowords, such as blade–glade or blem–glem) and in different phonological environments (i.e., with neighboring syllables that differed in lexical status, such as blouse–glouse). These stimuli allowed us to explore the impact of phonetic ambiguity on the speed and accuracy of lexical decision responses (Experiment 1), semantic categorization responses (Experiment 2), and the magnitude of BOLD fMRI responses during attentive comprehension (Experiment 3). For both behavioral and neural measures, observed effects of phonetic ambiguity were influenced by lexical context leading to slower responses and increased activity in the left inferior frontal gyrus for high-ambiguity syllables that distinguish pairs of words, but not for equivalent pseudowords. These findings suggest lexical involvement in the resolution of phonetic ambiguity. Implications for speech perception and the role of inferior frontal regions are discussed.

[1]  James L. McClelland,et al.  The TRACE model of speech perception , 1986, Cognitive Psychology.

[2]  T. Jaeger,et al.  Categorical Data Analysis: Away from ANOVAs (transformation or not) and towards Logit Mixed Models. , 2008, Journal of memory and language.

[3]  Lori L. Holt,et al.  Reflections on mirror neurons and speech perception , 2009, Trends in Cognitive Sciences.

[4]  D Norris,et al.  Merging information in speech recognition: Feedback is never necessary , 2000, Behavioral and Brain Sciences.

[5]  Rainer Goebel,et al.  "Who" Is Saying "What"? Brain-Based Decoding of Human Voice and Speech , 2008, Science.

[6]  O Josephs,et al.  Event-related functional magnetic resonance imaging: modelling, inference and optimization. , 1999, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[7]  P. Ladefoged A course in phonetics , 1975 .

[8]  D. Whalen Subcategorical phonetic mismatches slow phonetic judgments , 1984, Perception & psychophysics.

[9]  R. Bowtell,et al.  “sparse” temporal sampling in auditory fMRI , 1999, Human brain mapping.

[10]  Jean Vroomen,et al.  Brain activation during audiovisual exposure anticipates future perception of ambiguous speech , 2011, NeuroImage.

[11]  W. K. Simmons,et al.  Circular analysis in systems neuroscience: the dangers of double dipping , 2009, Nature Neuroscience.

[12]  David A. Medler,et al.  Neural correlates of sensory and decision processes in auditory object identification , 2004, Nature Neuroscience.

[13]  Matthew H. Davis The Neurobiology of Lexical Access , 2016 .

[14]  Jean-Luc Anton,et al.  Region of interest analysis using an SPM toolbox , 2010 .

[15]  Lori L. Holt,et al.  Are there interactive processes in speech perception? , 2006, Trends in Cognitive Sciences.

[16]  Riikka Möttönen,et al.  Discrimination of speech and non-speech sounds following theta-burst stimulation of the motor cortex , 2014, Front. Psychol..

[17]  A. Hill Match program. , 2016, American journal of orthodontics and dentofacial orthopedics : official publication of the American Association of Orthodontists, its constituent societies, and the American Board of Orthodontics.

[18]  S. Goldinger Words and voices: episodic traces in spoken word identification and recognition memory. , 1996, Journal of experimental psychology. Learning, memory, and cognition.

[19]  Satrajit S. Ghosh,et al.  Optimized Design and Analysis of Sparse-Sampling fMRI Experiments , 2013, Front. Neurosci..

[20]  Emily B. Myers,et al.  The Perception of Voice Onset Time: An fMRI Investigation of Phonetic Category Structure , 2005, Journal of Cognitive Neuroscience.

[21]  Gregory Hickok,et al.  The role of Broca’s area in speech perception: Evidence from aphasia revisited , 2011, Brain and Language.

[22]  Philip T Quinlan,et al.  The nature of phoneme representation in spoken word recognition. , 2008, Journal of experimental psychology. General.

[23]  W Marslen-Wilson,et al.  Levels of perceptual representation and process in lexical access: words, phonemes, and features. , 1994, Psychological review.

[24]  S. Blumstein,et al.  Theories of spoken word recognition deficits in Aphasia: Evidence from eye-tracking and computational modeling , 2011, Brain and Language.

[25]  Ellen F. Lau,et al.  A cortical network for semantics: (de)constructing the N400 , 2008, Nature Reviews Neuroscience.

[26]  W. Marslen-Wilson SPEECH UNDERSTANDING AS A PSYCHOLOGICAL PROCESS , 1980 .

[27]  William D. Marslen-Wilson,et al.  Integrating Form and Meaning: A Distributed Model of Speech Perception. , 1997 .

[28]  Matthew H. Davis,et al.  A complementary systems account of word learning: neural and behavioural evidence , 2009, Philosophical Transactions of the Royal Society B: Biological Sciences.

[29]  M. Tanenhaus,et al.  Subcategorical mismatches and the time course of lexical access: Evidence for lexical competition , 2001 .

[30]  S. Blumstein,et al.  What you see isn’t always what you get: Auditory word signals trump consciously perceived words in lexical access , 2016, Cognition.

[31]  Matthew H. Davis,et al.  Perceptual learning of degraded speech by minimizing prediction error , 2016, Proceedings of the National Academy of Sciences.

[32]  S. Blumstein,et al.  The role of lexical competition and acoustic–phonetic structure in lexical processing: Evidence from normal subjects and aphasic patients , 2005, Brain and Language.

[33]  Joao Correia,et al.  Neural coding of speech and language : fMRI and EEG studies , 2015 .

[34]  Elizabeth Jefferies,et al.  The Selective Role of Premotor Cortex in Speech Perception: A Contribution to Phoneme Judgements but not Speech Comprehension , 2013, Journal of Cognitive Neuroscience.

[35]  Hideki Kawahara,et al.  Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds , 1999, Speech Commun..

[36]  W. Montague,et al.  Category norms of verbal items in 56 categories A replication and extension of the Connecticut category norms , 1969 .

[37]  S. Blumstein,et al.  The Role of Segmentation in Phonological Processing: An fMRI Investigation , 2000, Journal of Cognitive Neuroscience.

[38]  R. Weisskoff,et al.  Improved auditory cortex imaging using clustered volume acquisitions , 1999, Human brain mapping.

[39]  Alan C. Evans,et al.  PET studies of phonetic processing of speech: review, replication, and reanalysis. , 1996, Cerebral cortex.

[40]  T. Rogers,et al.  Lichtheim 2: Synthesizing Aphasia and the Neural Basis of Language in a Neurocomputational Model of the Dual Dorsal-Ventral Language Pathways , 2011, Neuron.

[41]  M. Gareth Gaskell,et al.  Ambiguity, competition, and blending in spoken word recognition , 1999, Cogn. Sci..

[42]  Matthew H. Davis,et al.  Hierarchical Processing for Speech in Human Auditory Cortex and Beyond , 2010, Front. Hum. Neurosci..

[43]  Kenneth Hugdahl,et al.  Effective connectivity analysis demonstrates involvement of premotor cortex during speech perception , 2011, NeuroImage.

[44]  Matthew H. Davis,et al.  Mix, a program for pseudorandomization , 2006, Behavior research methods.

[45]  Karl J. Friston,et al.  A critique of functional localisers , 2006, NeuroImage.

[46]  Emily B. Myers,et al.  Inferior Frontal Regions Underlie the Perception of Phonetic Category Invariance , 2009, Psychological science.

[47]  S. Scott,et al.  Comprehension of familiar and unfamiliar native accents under adverse listening conditions. , 2009, Journal of experimental psychology. Human perception and performance.

[48]  Yinjuan Du,et al.  Noise differentially impacts phoneme representations in the auditory and speech motor systems , 2014, Proceedings of the National Academy of Sciences.

[49]  Matthew H. Davis,et al.  Effortful Listening: The Processing of Degraded Speech Depends Critically on Attention , 2012, The Journal of Neuroscience.

[50]  Friedemann Pulvermüller,et al.  Causal Influence of Articulatory Motor Cortex on Comprehending Single Spoken Words: TMS Evidence , 2014, Cerebral cortex.

[51]  Jonathan E. Peelle,et al.  Methodological challenges and solutions in auditory functional magnetic resonance imaging , 2014, Front. Neurosci..

[52]  Sheila E. Blumstein,et al.  An fMRI examination of the effects of acoustic-phonetic and lexical competition on access to the lexical-semantic network , 2013, Neuropsychologia.

[53]  M. Tanenhaus,et al.  Within-category VOT affects recovery from "lexical" garden paths: Evidence against phoneme-level inhibition. , 2009, Journal of memory and language.

[54]  Kenneth I Forster,et al.  DMDX: A Windows display program with millisecond accuracy , 2003, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[55]  Marco Iacoboni,et al.  The Essential Role of Premotor Cortex in Speech Perception , 2007, Current Biology.

[56]  Matthew H. Davis,et al.  Categorical perception of speech without stimulus repetition , 2009, INTERSPEECH.

[57]  Richard Granger,et al.  Categorical Speech Processing in Broca's Area: An fMRI Study Using Multivariate Pattern-Based Analysis , 2012, The Journal of Neuroscience.

[58]  S. Scott,et al.  The Pathways for Intelligible Speech: Multivariate and Univariate Perspectives , 2013, Cerebral cortex.

[59]  Tracy Love,et al.  Are mirror neurons the basis of speech perception? Evidence from five cases with damage to the purported human mirror system , 2011, Neurocase.

[60]  Elizabeth Bates,et al.  Exploring the processing continuum of single-word comprehension in aphasia. , 2005, Journal of speech, language, and hearing research : JSLHR.

[61]  M. Farah,et al.  Role of left inferior prefrontal cortex in retrieval of semantic knowledge: a reevaluation. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[62]  Matthew H. Davis,et al.  Hierarchical Organization of Auditory and Motor Representations in Speech Perception: Evidence from Searchlight Similarity Analysis , 2015, Cerebral cortex.

[63]  Emily B. Myers,et al.  The neural bases of the lexical effect: an fMRI investigation. , 2008, Cerebral cortex.

[64]  Sophie K. Scott,et al.  A little more conversation, a little less action — candidate roles for the motor cortex in speech perception , 2009, Nature Reviews Neuroscience.

[65]  J. Rauschecker,et al.  Maps and streams in the auditory cortex: nonhuman primates illuminate human speech processing , 2009, Nature Neuroscience.

[66]  Anne Cutler,et al.  Lexical influence in phonetic decision-making: Evidence from subcategorical mismatches , 1999 .

[67]  Lori L. Holt,et al.  Speech perception under adverse conditions: insights from behavioral, computational, and neuroscience research , 2014, Front. Syst. Neurosci..

[68]  Lori L. Holt,et al.  Predicting native English-like performance by native Japanese speakers , 2011, J. Phonetics.

[69]  Jonathan H. Venezia,et al.  Hierarchical organization of human auditory cortex: evidence from acoustic invariance in the response to intelligible speech. , 2010, Cerebral cortex.

[70]  M. Iacoboni,et al.  Listening to speech activates motor areas involved in speech production , 2004, Nature Neuroscience.

[71]  William D. Marslen-Wilson,et al.  The Interaction of Lexical Semantics and Cohort Competition in Spoken Word Recognition: An fMRI Study , 2011, Journal of Cognitive Neuroscience.

[72]  W. Ganong Phonetic categorization in auditory word perception. , 1980, Journal of experimental psychology. Human perception and performance.

[73]  S. Scott,et al.  Identification of a pathway for intelligible speech in the left temporal lobe. , 2000, Brain : a journal of neurology.

[74]  W. Marslen-Wilson,et al.  Making Sense of Semantic Ambiguity: Semantic Competition in Lexical Access , 2002 .

[75]  Mark A. Chevillet,et al.  Automatic Phoneme Category Selectivity in the Dorsal Auditory Stream , 2013, The Journal of Neuroscience.

[76]  L. Fadiga,et al.  The Motor Somatotopy of Speech Perception , 2009, Current Biology.

[77]  Sheila E. Blumstein,et al.  Mapping from Sound to Meaning: Reduced Lexical Activation in Broca's Aphasics , 2001, Brain and Language.

[78]  A. Samuel,et al.  Perceptual learning for speech , 2009, Attention, perception & psychophysics.

[79]  Matthew H. Davis,et al.  Hearing speech sounds: Top-down influences on the interface between audition and speech perception , 2007, Hearing Research.

[80]  Gregory Hickok,et al.  Sublexical Properties of Spoken Words Modulate Activity in Broca's Area but Not Superior Temporal Cortex: Implications for Models of Speech Recognition , 2011, Journal of Cognitive Neuroscience.

[81]  Emily B. Myers Dissociable effects of phonetic competition and category typicality in a phonetic categorization task: An fMRI investigation , 2007, Neuropsychologia.

[82]  S. Scott,et al.  The neuroanatomical and functional organization of speech perception , 2003, Trends in Neurosciences.

[83]  Jeremy Goslin,et al.  Does a regional accent perturb speech processing? , 2006, Journal of experimental psychology. Human perception and performance.

[84]  S. Blumstein,et al.  The effect of subphonetic differences on lexical access , 1994, Cognition.

[85]  Joseph T. Devlin,et al.  On-line plasticity in spoken sentence comprehension: Adapting to time-compressed speech , 2010, NeuroImage.

[86]  D. Poeppel,et al.  The cortical organization of speech processing , 2007, Nature Reviews Neuroscience.

[87]  Jessica S. Arsenault,et al.  Distributed Neural Representations of Phonological Features during Speech Perception , 2015, The Journal of Neuroscience.

[88]  Sheila E. Blumstein,et al.  Auditory Word Recognition: Evidence from Aphasia and Functional Neuroimaging , 2009, Lang. Linguistics Compass.

[89]  Y. Tohkura,et al.  A perceptual interference account of acquisition difficulties for non-native phonemes , 2003, Cognition.

[90]  Rhodri Cusack,et al.  Automatic analysis (aa): efficient neuroimaging workflows and parallel processing using Matlab and XML , 2015, Front. Neuroinform..

[91]  HIDEKI KAWAHARA,et al.  Technical foundations of TANDEM-STRAIGHT, a speech analysis, modification and synthesis framework , 2011 .

[92]  M. Masson,et al.  Using confidence intervals in within-subject designs , 1994, Psychonomic bulletin & review.

[93]  Matthew H. Davis,et al.  Hierarchical Processing in Spoken Language Comprehension , 2003, The Journal of Neuroscience.

[94]  Matthew H. Davis,et al.  Brain regions recruited for the effortful comprehension of noise-vocoded words , 2012 .

[95]  William W. Graves,et al.  Where is the semantic system? A critical review and meta-analysis of 120 functional neuroimaging studies. , 2009, Cerebral cortex.

[96]  K. Watkins,et al.  Motor Representations of Articulators Contribute to Categorical Perception of Speech Sounds , 2009, The Journal of Neuroscience.

[97]  Friedemann Pulvermüller,et al.  Motor cortex maps articulatory features of speech sounds , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[98]  Anne Cutler,et al.  Are there really interactive processes in speech perception? , 2006, Trends in Cognitive Sciences.

[99]  Maarten Casteren,et al.  Match: A program to assist in matching the conditions of factorial experiments , 2007, Behavior research methods.