Speech encoding strategies for multielectrode cochlear implants: a digital signal processor approach.

The following processing strategies have been implemented on an experimental laboratory system of a cochlear implant digital speech processor (CIDSP) for the Nucleus 22-channel cochlear prosthesis. The first approach (PES, Pitch Excited Sampler) is based on the maximum peak channel vocoder concept whereby the time-varying spectral energy of a number of frequency bands is transformed into electrical stimulation parameters for up to 22 electrodes. The pulse rate at any electrode is controlled by the voice pitch of the input speech signal. The second approach (CIS, Continuous Interleaved Sampler) uses a stimulation pulse rate which is independent of the input signal. The algorithm continuously scans all specified frequency bands (typically between four and 22) and samples their energy levels. As only one electrode can be stimulated at any instance of time, the maximally achievable rate of stimulation is limited by the required stimulus pulse widths (determined individually for each subject) and some additional constraints and parameters. A number of variations of the CIS approach have, therefore, been implemented which either maximize the number of quasi-simultaneous stimulation channels or the pulse rate on a reduced number of electrodes. Evaluation experiments with five experienced cochlear implant users showed significantly better performance in consonant identification tests with the new processing strategies than with the subjects' own wearable speech processors; improvements in vowel identification tasks were rarely observed. Modifications of the basic PES- and CIS strategies resulted in large variations of identification scores. Information transmission analysis of confusion matrices revealed a rather complex pattern across conditions and speech features. Optimization and fine-tuning of processing parameters for these coding strategies will require more data both from speech identification and discrimination evaluations and from psychophysical experiments.

[1]  N Dillier,et al.  Wearable digital speech processor for cochlear implants using a TMS320C25. , 1990, Acta oto-laryngologica. Supplementum.

[2]  M. D. Wang,et al.  Consonant confusions in noise: a study of perceptual features. , 1973, The Journal of the Acoustical Society of America.

[3]  P. Arabie,et al.  Auditory versus phonetic accounts of observed confusions between consonant phonemes. , 1979, The Journal of the Acoustical Society of America.

[4]  Graeme M. Clark,et al.  Preliminary results with a six spectral maxima speech processor for The University of Melbourne/Nucleus multiple electrode cochlear implant , 1991 .

[5]  J. Flanagan Speech Analysis, Synthesis and Perception , 1971 .

[6]  N Dillier,et al.  [Perception of acoustic speech features with single channel cochlear implants and hearing aids. A comparative analysis]. , 1988, HNO.

[7]  Norbert Dillier,et al.  Some aspects of temporal coding for single-channel electrical stimulation of the cochlea , 1985, Hearing Research.

[8]  G. A. Miller,et al.  An Analysis of Perceptual Confusions Among Some English Consonants , 1955 .

[9]  D. D. Greenwood A cochlear frequency-position function for several species--29 years later. , 1990, The Journal of the Acoustical Society of America.

[10]  William M. Rabinowitz,et al.  Better speech recognition with cochlear implants , 1991, Nature.

[11]  M. Skinner,et al.  Performance of postlinguistically deaf adults with the Wearable Speech Processor (WSP III) and Mini Speech Processor (MSP) of the Nucleus Multi-Electrode Cochlear Implant. , 1991, Ear and hearing.

[12]  P. Stypulkowski,et al.  Temporal response patterns of single auditory nerve fibers elicited by periodic electrical stimuli , 1987, Hearing Research.