Neural coding of the temporal envelope of speech : Relation to modulation transfer functions

Continuous speech shows pronounced low-frequency modulations in its temporal envelope. Modulation frequencies near the average syllabic rate of 3-4 Hz are the most prominent. Degradations in these low-frequency modulations reduce speech intelligibility (Drullman et al., 1994; Drullman et al., 1994; Houtgast and Steeneken, 1973), while speech processed to have minimal spectral information is intelligible providing that low-frequency modulations are preserved (Shannon et al., 1995). Thus, low-frequency modulations are both necessary and almost sufficient for accurate speech reception. The ability of auditory neurons to encode amplitude modulation has been characterized using modulation transfer functions (MTFs) (reviews by Eggermont, 1993; Langner, 1992). The MTF expresses, as a function of frequency, the complex ratio (magnitude and phase) of the modulation in the neural response to the modulation in the acoustic stimulus. Previous neurophysiological studies have rarely focused on low modulation frequencies most important for speech reception, and have not explicitly related the MTFs of auditory neurons to their response to speech. In the present study, we measured both MTFs and responses to a speech utterance for single units in the auditory nerve (AN), cochlear nucleus (CN), and inferior colliculus (IC) of anesthetized cats. We also developed a functional model for predicting the neural response to speech based in part on the MTF.

[1]  T. Houtgast,et al.  The Modulation Transfer Function in Room Acoustics as a Predictor of Speech Intelligibility , 1973 .

[2]  M. Sachs,et al.  Rate versus level functions for auditory-nerve fibers in cats: tone-burst stimuli. , 1974, The Journal of the Acoustical Society of America.

[3]  Adrian Rees,et al.  Responses of neurons in the inferior colliculus of the rat to AM and FM tones , 1983, Hearing Research.

[4]  Jong Kyoung Kim,et al.  Speech recognition , 1983, 1983 IEEE International Solid-State Circuits Conference. Digest of Technical Papers.

[5]  Adrian Rees,et al.  Dynamic properties of the responses of single neurons in the inferior colliculus of the rat , 1986, Hearing Research.

[6]  C. Schreiner,et al.  Periodicity coding in the inferior colliculus of the cat. I. Neuronal mechanisms. , 1988, Journal of neurophysiology.

[7]  L. Carney,et al.  Temporal coding of resonances by low-frequency auditory nerve fibers: single-fiber responses and a population model. , 1988, Journal of neurophysiology.

[8]  Robert D Frisina,et al.  Encoding of amplitude modulation in the gerbil cochlear nucleus: I. A hierarchy of enhancement , 1990, Hearing Research.

[9]  Robert D Frisina,et al.  Encoding of amplitude modulation in the gerbil cochlear nucleus: II. Possible neural mechanisms , 1990, Hearing Research.

[10]  Gerald Langner,et al.  Periodicity coding in the auditory system , 1992, Hearing Research.

[11]  T. Yin,et al.  Responses to amplitude-modulated tones in the auditory nerve of the cat. , 1992, The Journal of the Acoustical Society of America.

[12]  R. Plomp,et al.  Effect of reducing slow temporal modulations on speech reception. , 1994, The Journal of the Acoustical Society of America.

[13]  R. Plomp,et al.  Effect of temporal envelope smearing on speech reception. , 1994, The Journal of the Acoustical Society of America.

[14]  R V Shannon,et al.  Speech Recognition with Primarily Temporal Cues , 1995, Science.

[15]  B. Delgutte,et al.  Neural correlates of the pitch of complex tones. I. Pitch and pitch salience. , 1996, Journal of neurophysiology.

[16]  Bertrand Delgutte,et al.  Auditory Neural Processing of Speech , 2002 .