Features of stimulation affecting tonal-speech perception: implications for cochlear prostheses.

Tone languages differ from English in that the pitch pattern of a single-syllable word conveys lexical meaning. In the present study, dependence of tonal-speech perception on features of the stimulation was examined using an acoustic simulation of a CIS-type speech-processing strategy for cochlear prostheses. Contributions of spectral features of the speech signals were assessed by varying the number of filter bands, while contributions of temporal envelope features were assessed by varying the low-pass cutoff frequency used for extracting the amplitude envelopes. Ten normal-hearing native Mandarin Chinese speakers were tested. When the low-pass cutoff frequency was fixed at 512 Hz, consonant, vowel, and sentence recognition improved as a function of the number of channels and reached plateau at 4 to 6 channels. Subjective judgments of sound quality continued to improve as the number of channels increased to 12, the highest number tested. Tone recognition, i.e., recognition of the four Mandarin tone patterns, depended on both the number of channels and the low-pass cutoff frequency. The trade-off between the temporal and spectral cues for tone recognition indicates that temporal cues can compensate for diminished spectral cues for tone recognition and vice versa. An additional tone recognition experiment using syllables of equal duration showed a marked decrease in performance, indicating that duration cues contribute to tone recognition. A third experiment showed that recognition of processed FM patterns that mimic Mandarin tone patterns was poor when temporal envelope and duration cues were removed.

[1]  Harvey Fletcher,et al.  Speech and hearing. , 1930, Health services manager.

[2]  M F Dorman,et al.  The recognition of sentences in noise by normal-hearing listeners using simulations of cochlear-implant signal processors with 6-20 channels. , 1998, The Journal of the Acoustical Society of America.

[3]  Q J Fu,et al.  Effects of noise and spectral resolution on vowel and consonant recognition: acoustic and electric hearing. , 1998, The Journal of the Acoustical Society of America.

[4]  Yi Xu,et al.  Information for Mandarin tones in the amplitude contour and in brief segments , 1990 .

[5]  R. Plomp,et al.  Effect of temporal envelope smearing on speech reception. , 1994, The Journal of the Acoustical Society of America.

[6]  S. Rosen Temporal information in speech: acoustic, auditory and linguistic aspects. , 1992, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[7]  L Geurts,et al.  Coding of the fundamental frequency in continuous interleaved sampling processors for cochlear implants. , 2001, The Journal of the Acoustical Society of America.

[8]  P C Loizou,et al.  On the number of channels needed to understand speech. , 1999, The Journal of the Acoustical Society of America.

[9]  D H Whalen,et al.  Information for Mandarin Tones in the Amplitude Contour and in Brief Segments , 1990, Phonetica.

[10]  A Faulkner,et al.  Effects of the salience of pitch and periodicity information on the intelligibility of four-channel vocoded speech: implications for cochlear implants. , 2000, The Journal of the Acoustical Society of America.

[11]  B. L. Cardozo,et al.  Pitch of the Residue , 1962 .

[12]  A van Wieringen,et al.  Natural vowel and consonant recognition by Laura cochlear implantees. , 1999, Ear and hearing.

[13]  M. Skinner,et al.  Optimization of Speech Processor Fitting Strategies for Chinese‐Speaking Cochlear Implantees , 1998, The Laryngoscope.

[14]  R. Plomp,et al.  Effect of spectral envelope smearing on speech reception. II. , 1992, The Journal of the Acoustical Society of America.

[15]  D. M. Green,et al.  Signal detection theory and psychophysics , 1966 .

[16]  F. Zeng,et al.  Speech recognition with altered spectral distribution of envelope cues. , 1996, The Journal of the Acoustical Society of America.

[17]  D. T. Lawson,et al.  Design for an Inexpensive but Effective Cochlear Implant , 1998, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[18]  R V Shannon,et al.  Speech Recognition with Primarily Temporal Cues , 1995, Science.

[19]  R. Shannon,et al.  Speech recognition in noise as a function of the number of spectral channels: comparison of acoustic hearing and cochlear implants. , 2001, The Journal of the Acoustical Society of America.

[20]  D J Van Tasell,et al.  Speech waveform envelope cues for consonant recognition. , 1987, The Journal of the Acoustical Society of America.

[21]  M. Dorman,et al.  Speech intelligibility as a function of the number of channels of stimulation for signal processors using sine-wave and noise-band outputs. , 1997, The Journal of the Acoustical Society of America.

[22]  J K Shallop,et al.  Evaluation of a new spectral peak coding strategy for the Nucleus 22 Channel Cochlear Implant System. , 1994, The American journal of otology.

[23]  M. Dorman,et al.  Simulating the effect of cochlear-implant electrode insertion depth on speech understanding. , 1997, The Journal of the Acoustical Society of America.

[24]  D. D. Greenwood A cochlear frequency-position function for several species--29 years later. , 1990, The Journal of the Acoustical Society of America.

[25]  R. Plomp,et al.  Effect of spectral envelope smearing on speech reception. I. , 1991, The Journal of the Acoustical Society of America.

[26]  S Y Liu,et al.  Nucleus 22-channel cochlear mini-system implantations in Mandarin-speaking patients. , 1996, The American journal of otology.

[27]  R V Shannon,et al.  Speech recognition as a function of the number of electrodes used in the SPEAK cochlear implant speech processor. , 1997, Journal of speech, language, and hearing research : JSLHR.

[28]  M. Schroeder Reference Signal for Signal Quality Studies , 1968 .

[29]  Y Hui,et al.  Chinese tonal language rehabilitation following cochlear implantation in children. , 2000, Acta oto-laryngologica.

[30]  Graeme M. Clark,et al.  Results for Chinese and English in a Multichannel Cochlear Implant Patient , 1987 .

[31]  Li Xu,et al.  Effects of Electrode Configuration and Place of Stimulation on Speech Perception with Cochlear Prostheses , 2000, Journal of the Association for Research in Otolaryngology.

[32]  F. Zeng,et al.  Importance of tonal envelope cues in Chinese speech recognition. , 1998, The Journal of the Acoustical Society of America.

[33]  William M. Rabinowitz,et al.  Better speech recognition with cochlear implants , 1991, Nature.

[34]  G. Studebaker A "rationalized" arcsine transform. , 1985, Journal of speech and hearing research.

[36]  P Seligman,et al.  Architecture of the Spectra 22 speech processor. , 1995, The Annals of otology, rhinology & laryngology. Supplement.

[37]  A. Thornton,et al.  Speech-discrimination scores modeled as a binomial variable. , 1978, Journal of speech and hearing research.

[38]  G. Keppel Design and analysis: A researcher's handbook, 3rd ed. , 1991 .

[39]  M F Dorman,et al.  The Identification of Consonants and Vowels by Cochlear Implant Patients Using a 6‐Channel Continuous Interleaved Sampling Processor and by Normal‐Hearing Subjects Using Simulations of Processors with Two to Nine Channels , 1998, Ear and hearing.