This experiment examined the effects of spectral resolution and fine spectral structure on recognition of spectrally asynchronous sentences by normal-hearing and cochlear implant listeners. Sentence recognition was measured in six normal-hearing subjects listening to either full-spectrum or noise-band processors and five Nucleus-22 cochlear implant listeners fitted with 4-channel continuous interleaved sampling (CIS) processors. For the full-spectrum processor, the speech signals were divided into either 4 or 16 channels. For the noise-band processor, after band-pass filtering into 4 or 16 channels, the envelope of each channel was extracted and used to modulate noise of the same bandwidth as the analysis band, thus eliminating the fine spectral structure available in the full-spectrum processor. For the 4-channel CIS processor, the amplitude envelopes extracted from four bands were transformed to electric currents by a power function and the resulting electric currents were used to modulate pulse trains delivered to four electrode pairs. For all processors, the output of each channel was time-shifted relative to other channels, varying the channel delay across channels from 0 to 240 ms (in 40-ms steps). Within each delay condition, all channels were desynchronized such that the cross-channel delays between adjacent channels were maximized, thereby avoiding local pockets of channel synchrony. Results show no significant difference between the 4- and 16-channel full-spectrum speech processor for normal-hearing listeners. Recognition scores dropped significantly only when the maximum delay reached 200 ms for the 4-channel processor and 240 ms for the 16-channel processor. When fine spectral structures were removed in the noise-band processor, sentence recognition dropped significantly when the maximum delay was 160 ms for the 16-channel noise-band processor and 40 ms for the 4-channel noise-band processor. There was no significant difference between implant listeners using the 4-channel CIS processor and normal-hearing listeners using the 4-channel noise-band processor. The results imply that when fine spectral structures are not available, as in the implant listener's case, increased spectral resolution is important for overcoming cross-channel asynchrony in speech signals.
[1]
S. Soli,et al.
Development of the Hearing in Noise Test for the measurement of speech reception thresholds in quiet and in noise.
,
1994,
The Journal of the Acoustical Society of America.
[2]
William M. Rabinowitz,et al.
Better speech recognition with cochlear implants
,
1991,
Nature.
[3]
R V Shannon,et al.
A computer interface for psychophysical and speech research with the Nucleus cochlear implant.
,
1990,
The Journal of the Acoustical Society of America.
[4]
Q J Fu,et al.
Effects of noise and spectral resolution on vowel and consonant recognition: acoustic and electric hearing.
,
1998,
The Journal of the Acoustical Society of America.
[5]
R E Remez,et al.
CODING OF THE SPEECH SPECTRUM IN THREE TIME‐VARYING SINUSOIDS a
,
1983,
Annals of the New York Academy of Sciences.
[6]
R V Shannon,et al.
Speech Recognition with Primarily Temporal Cues
,
1995,
Science.
[7]
Steven Greenberg,et al.
Speech intelligibility in the presence of cross-channel spectral asynchrony
,
1998,
Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).
[8]
R. M. Warren,et al.
Spectral redundancy: Intelligibility of sentences heard through narrow spectral slits
,
1995,
Perception & psychophysics.
[9]
M F Dorman,et al.
The recognition of sentences in noise by normal-hearing listeners using simulations of cochlear-implant signal processors with 6-20 channels.
,
1998,
The Journal of the Acoustical Society of America.
[10]
R V Shannon,et al.
Effects of amplitude nonlinearity on phoneme recognition by cochlear implant users and normal-hearing listeners.
,
1998,
The Journal of the Acoustical Society of America.
[11]
Steven Greenberg,et al.
Speech intelligibility derived from exceedingly sparse spectral information
,
1998,
ICSLP.