The Tracking of Speech Envelope in the Human Cortex

Humans are highly adept at processing speech. Recently, it has been shown that slow temporal information in speech (i.e., the envelope of speech) is critical for speech comprehension. Furthermore, it has been found that evoked electric potentials in human cortex are correlated with the speech envelope. However, it has been unclear whether this essential linguistic feature is encoded differentially in specific regions, or whether it is represented throughout the auditory system. To answer this question, we recorded neural data with high temporal resolution directly from the cortex while human subjects listened to a spoken story. We found that the gamma activity in human auditory cortex robustly tracks the speech envelope. The effect is so marked that it is observed during a single presentation of the spoken story to each subject. The effect is stronger in regions situated relatively early in the auditory pathway (belt areas) compared to other regions involved in speech processing, including the superior temporal gyrus (STG) and the posterior inferior frontal gyrus (Broca's region). To further distinguish whether speech envelope is encoded in the auditory system as a phonological (speech-related), or instead as a more general acoustic feature, we also probed the auditory system with a melodic stimulus. We found that belt areas track melody envelope weakly, and as the only region considered. Together, our data provide the first direct electrophysiological evidence that the envelope of speech is robustly tracked in non-primary auditory cortex (belt areas in particular), and suggest that the considered higher-order regions (STG and Broca's region) partake in a more abstract linguistic analysis.

[1]  S. Rosen Temporal information in speech: acoustic, auditory and linguistic aspects. , 1992, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[2]  J L Lancaster,et al.  Automated Talairach Atlas labels for functional brain mapping , 2000, Human brain mapping.

[3]  浜中 淑彦 Carl Wernicke;Der aphasische Symptomencomplex--Eine psychologische Studie auf anatomischer Basis(「失語症候群--解剖学的基礎に立つ心理学的研究」,Max Cohn & Weigert,Breslau,1874) , 1975 .

[4]  Patrick Chauvel,et al.  Temporal envelope processing in the human left and right auditory cortices. , 2004, Cerebral cortex.

[5]  N. Birbaumer,et al.  BCI2000: a general-purpose brain-computer interface (BCI) system , 2004, IEEE Transactions on Biomedical Engineering.

[6]  R V Shannon,et al.  Speech Recognition with Primarily Temporal Cues , 1995, Science.

[7]  Gerwin Schalk,et al.  A Practical Guide to Brain–Computer Interfacing with BCI2000: General-Purpose Software for Brain-Computer Interface Research, Data Acquisition, Stimulus Presentation, and Brain Monitoring , 2010 .

[8]  J. Rauschecker,et al.  Hierarchical Organization of the Human Auditory Cortex Revealed by Functional Magnetic Resonance Imaging , 2001, Journal of Cognitive Neuroscience.

[9]  J. Eggermont,et al.  Spatial representation of neural responses to natural and altered conspecific vocalizations in cat auditory cortex. , 2007, Journal of neurophysiology.

[10]  D. Purves,et al.  The Statistical Structure of Human Speech Sounds Predicts Musical Universals , 2003, The Journal of Neuroscience.

[11]  J. Wolpaw,et al.  Decoding flexion of individual fingers using electrocorticographic signals in humans , 2009, Journal of neural engineering.

[12]  P. Matthews,et al.  Defining a left-lateralized response specific to intelligible speech using fMRI. , 2003, Cerebral cortex.

[13]  F. Gibbs,et al.  The Localization of Intracranial Lesions by Electroencephalography , 1938 .

[14]  P. Broca,et al.  Remarques sur le siege de la faculte du langage articule suivies d'une observation d'aphemie , 1861 .

[15]  J. M. Bouma,et al.  Wechsler Adult Intelligence Scale - WAIS-III en WAIS-IV , 2012 .

[16]  N. Geschwind Disconnexion syndromes in animals and man. II. , 1965, Brain : a journal of neurology.

[17]  David A. Medler,et al.  Cerebral Cortex doi:10.1093/cercor/bhi040 Cerebral Cortex Advance Access published February 9, 2005 , 2022 .

[18]  C. Schreiner,et al.  Representation of spectral and temporal envelope of twitter vocalizations in common marmoset primary auditory cortex. , 2002, Journal of neurophysiology.

[19]  Rajesh P. N. Rao,et al.  Cortical electrode localization from X-rays and simple mapping for electrocorticographic research: The “Location on Cortex” (LOC) package for MATLAB , 2007, Journal of Neuroscience Methods.

[20]  G. Ojemann,et al.  Cortical language localization in left, dominant hemisphere. An electrical stimulation mapping investigation in 117 patients. , 1989, Journal of neurosurgery.

[21]  N. Geschwind Disconnexion syndromes in animals and man. I. , 1965, Brain : a journal of neurology.

[22]  D. Abrams,et al.  Right-Hemisphere Auditory Cortex Is Dominant for Coding Syllable Patterns in Speech , 2008, The Journal of Neuroscience.

[23]  N. Thakor,et al.  Electrocorticographic amplitude predicts finger positions during slow grasping motions of the hand , 2010, Journal of neural engineering.

[24]  J. Rauschecker,et al.  Multiple stages of auditory speech perception reflected in event-related FMRI. , 2007, Cerebral cortex.

[25]  Jos J Eggermont,et al.  Neuronal responses in cat primary auditory cortex to natural and altered species-specific calls , 2000, Hearing Research.

[26]  I. Peretz,et al.  Evidence for the role of the right auditory cortex in fine pitch resolution , 2008, Neuropsychologia.

[27]  R. Plomp,et al.  Effect of reducing slow temporal modulations on speech reception. , 1994, The Journal of the Acoustical Society of America.

[28]  A. Boemio,et al.  Hierarchical and asymmetric temporal sensitivity in human auditory cortices , 2005, Nature Neuroscience.

[29]  J. Rauschecker,et al.  Maps and streams in the auditory cortex: nonhuman primates illuminate human speech processing , 2009, Nature Neuroscience.

[30]  M M Merzenich,et al.  Representation of a species-specific vocalization in the primary auditory cortex of the common marmoset: temporal and spectral characteristics. , 1995, Journal of neurophysiology.

[31]  S. Scott,et al.  Identification of a pathway for intelligible speech in the left temporal lobe. , 2000, Brain : a journal of neurology.

[32]  H. Goodglass Boston diagnostic aphasia examination , 2013 .

[33]  C. Wernicke Der aphasische Symptomencomplex: Eine psychologische Studie auf anatomischer Basis , 1874 .

[34]  Y Sininger,et al.  Temporal and speech processing de ® cits in auditory neuropathy , 1999 .

[35]  R. Plomp,et al.  Effect of temporal envelope smearing on speech reception. , 1994, The Journal of the Acoustical Society of America.

[36]  J. Wolpaw,et al.  Decoding two-dimensional movement trajectories using electrocorticographic signals in humans , 2007, Journal of neural engineering.

[37]  D. Poeppel,et al.  The cortical organization of speech processing , 2007, Nature Reviews Neuroscience.

[38]  J. J. Ryan,et al.  Wechsler Adult Intelligence Scale-III , 2001 .

[39]  R. Zatorre,et al.  Structure and function of auditory cortex: music and speech , 2002, Trends in Cognitive Sciences.

[40]  E. T. Possing,et al.  Human temporal lobe activation by speech and nonspeech sounds. , 2000, Cerebral cortex.

[41]  D. Abrams,et al.  Abnormal Cortical Processing of the Syllable Rate of Speech in Poor Readers , 2009, The Journal of Neuroscience.

[42]  I. Fried,et al.  Coupling between Neuronal Firing Rate, Gamma LFP, and BOLD fMRI Is Related to Interneuronal Correlations , 2007, Current Biology.

[43]  Christopher K. Kovach,et al.  Temporal Envelope of Time-Compressed Speech Represented in the Human Auditory Cortex , 2009, The Journal of Neuroscience.

[44]  E Ahissar,et al.  Speech comprehension is correlated with temporal response patterns recorded from auditory cortex , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[45]  Rafael Malach,et al.  Invariance of firing rate and field potential dynamics to stimulus modulation rate in human auditory cortex , 2011, Human brain mapping.

[46]  K. Miller Broadband Spectral Change: Evidence for a Macroscale Correlate of Population Firing Rate? , 2010, The Journal of Neuroscience.