Categorical Speech Representation in Human Superior Temporal Gyrus

Speech perception requires the rapid and effortless extraction of meaningful phonetic information from a highly variable acoustic signal. A powerful example of this phenomenon is categorical speech perception, in which a continuum of acoustically varying sounds is transformed into perceptually distinct phoneme categories. We found that the neural representation of speech sounds is categorically organized in the human posterior superior temporal gyrus. Using intracranial high-density cortical surface arrays, we found that listening to synthesized speech stimuli varying in small and acoustically equal steps evoked distinct and invariant cortical population response patterns that were organized by their sensitivities to critical acoustic features. Phonetic category boundaries were similar between neurometric and psychometric functions. Although speech-sound responses were distributed, spatially discrete cortical loci were found to underlie specific phonetic discrimination. Our results provide direct evidence for acoustic-to–higher order phonetic level encoding of speech sounds in human language receptive cortex.

[1]  H. Gastaut,et al.  Epilepsy and the functional anatomy of the human brain , 1954 .

[2]  R.N.Dej.,et al.  Epilepsy and the Functional Anatomy of the Human Brain , 1954, Neurology.

[3]  G. A. Miller,et al.  An Analysis of Perceptual Confusions Among Some English Consonants , 1955 .

[4]  B. C. Griffith,et al.  The discrimination of speech sounds within and across phoneme boundaries. , 1957, Journal of experimental psychology.

[5]  A M Liberman,et al.  Perception of the speech code. , 1967, Psychological review.

[6]  D. Hubel,et al.  Receptive fields and functional architecture of monkey striate cortex , 1968, The Journal of physiology.

[7]  M. Merzenich,et al.  Representation of the cochlear partition of the superior temporal plane of the macaque monkey. , 1973, Brain research.

[8]  S. Blumstein,et al.  Perceptual invariance and onset spectra for stop consonants in different vowel environments. , 1980, The Journal of the Acoustical Society of America.

[9]  R N Shepard,et al.  Multidimensional Scaling, Tree-Fitting, and Clustering , 1980, Science.

[10]  J. Perkell,et al.  Invariance and variability in speech processes , 1987 .

[11]  A M Liberman,et al.  A specialization for speech perception. , 1989, Science.

[12]  Joanne L. Miller,et al.  Speech Perception , 1990, Springer Handbook of Auditory Research.

[13]  S. Harnad Categorical Perception: The Groundwork of Cognition , 1990 .

[14]  M M Haglund,et al.  Cortical localization of temporal lobe language sites in patients with gliomas. , 1994, Neurosurgery.

[15]  P Iverson,et al.  Mapping the perceptual magnet effect for speech using signal detection theory and multidimensional scaling. , 1995, The Journal of the Acoustical Society of America.

[16]  R. Lesser,et al.  Auditory Speech Processing in the Left Temporal Lobe: An Electrical Interference Study , 1995, Brain and Language.

[17]  Marilyn M. Vihman,et al.  Phonological Development , 2014 .

[18]  A. Liberman,et al.  On the relation of speech to language , 2000, Trends in Cognitive Sciences.

[19]  E. T. Possing,et al.  Human temporal lobe activation by speech and nonspeech sounds. , 2000, Cerebral cortex.

[20]  P. Kuhl,et al.  Perceptual magnet and phoneme boundary effects in speech perception: Do they arise from a common mechanism? , 2000, Perception & psychophysics.

[21]  J. E. Hind,et al.  Auditory cortex on the human posterior superior temporal gyrus , 2000, The Journal of comparative neurology.

[22]  B. Gordon,et al.  Induced electrocorticographic gamma activity during auditory perception. Brazier Award-winning article, 2001. , 2001, Clinical neurophysiology : official journal of the International Federation of Clinical Neurophysiology.

[23]  David Bimler,et al.  Categorical perception of facial expressions of emotion: Evidence from multidimensional scaling , 2001 .

[24]  K. Kiehl,et al.  Detection of Sounds in the Auditory Stream: Event-Related fMRI Evidence for Differential Activation to Speech and Nonspeech , 2001, Journal of Cognitive Neuroscience.

[25]  B. Gordon,et al.  Induced electrocorticographic gamma activity during auditory perception , 2001, Clinical Neurophysiology.

[26]  H. Scheich,et al.  Phonetic Perception and the Temporal Cortex , 2002, NeuroImage.

[27]  R. Diehl,et al.  Speech Perception , 2004, Annual review of psychology.

[28]  O. Creutzfeldt,et al.  Neuronal activity in the human lateral temporal lobe , 1989, Experimental Brain Research.

[29]  Sophie K. Scott,et al.  The functional neuroanatomy of prelexical processing in speech perception , 2004, Cognition.

[30]  O. Creutzfeldt,et al.  Neuronal activity in the human lateral temporal lobe , 2004, Experimental Brain Research.

[31]  D. Poeppel,et al.  Dorsal and ventral streams: a framework for understanding aspects of the functional anatomy of language , 2004, Cognition.

[32]  Emily B. Myers,et al.  The Perception of Voice Onset Time: An fMRI Investigation of Phonetic Category Structure , 2005, Journal of Cognitive Neuroscience.

[33]  David A. Medler,et al.  Cerebral Cortex doi:10.1093/cercor/bhi040 Cerebral Cortex Advance Access published February 9, 2005 , 2022 .

[34]  R Todd Constable,et al.  Differentiation of speech and nonspeech processing within primary auditory cortex. , 2006, The Journal of the Acoustical Society of America.

[35]  Wilbert Heeringa,et al.  UC Berkeley Phonology Lab Annual Report 2005 , 2006 .

[36]  Matthew Richardson,et al.  Phonetic processing areas revealed by sinewave speech and acoustically similar non-speech , 2006, NeuroImage.

[37]  Roy D. Patterson,et al.  Locating the initial stages of speech–sound processing in human temporal cortex , 2006, NeuroImage.

[38]  Rajeev D. S. Raizada,et al.  Selective Amplification of Stimulus Differences during Categorical Processing of Speech , 2007, Neuron.

[39]  Stephen P. Boyd,et al.  An Interior-Point Method for Large-Scale $\ell_1$-Regularized Least Squares , 2007, IEEE Journal of Selected Topics in Signal Processing.

[40]  Stephen P. Boyd,et al.  An Interior-Point Method for Large-Scale l1-Regularized Logistic Regression , 2007, J. Mach. Learn. Res..

[41]  Jeffrey R. Binder,et al.  Left Posterior Temporal Regions are Sensitive to Auditory Categorization , 2008, Journal of Cognitive Neuroscience.

[42]  N. Logothetis,et al.  A voice region in the monkey brain , 2008, Nature Neuroscience.

[43]  Charles B. Mikell,et al.  Categorical speech representation in human superior temporal gyrus. , 2010, Neurosurgery.

[44]  Robert T. Knight,et al.  Spatiotemporal imaging of cortical activation during verb generation and picture naming , 2010, NeuroImage.