Effect of bandwidth extension to telephone speech recognition in cochlear implant users.

The present study investigated a bandwidth extension method to enhance telephone speech understanding for cochlear implant (CI) users. The acoustic information above telephone speech transmission range (i.e., 3400 Hz) was estimated based on trained models describing the relation between narrow-band and wide-band speech. The effect of the bandwidth extension method was evaluated with IEEE sentence recognition tests in seven CI users. Results showed a relatively modest but significant improvement in the speech recognition with the proposed method. The effect of bandwidth extension method was also observed to be highly dependent on individual CI users.

[1]  J. Ito,et al.  Hearing ability by telephone of patients with cochlear implants , 1999, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[2]  Helen E Cullington,et al.  An investigation into the effect of limiting the frequency bandwidth of speech on speech recognition in adult cochlear implant users , 2004, International journal of audiology.

[3]  Peter Jax,et al.  On artificial bandwidth extension of telephone speech , 2003, Signal Process..

[4]  W. Bastiaan Kleijn,et al.  Avoiding over-estimation in bandwidth extension of telephony speech , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[5]  Alexander Kain,et al.  Spectral voice conversion for text-to-speech synthesis , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[6]  M Terry,et al.  Processing the telephone speech signal for the hearing impaired. , 1992, Ear and hearing.

[7]  Eric Moulines,et al.  Continuous probabilistic transform for voice conversion , 1998, IEEE Trans. Speech Audio Process..

[8]  M Terry,et al.  Telephone usage in the hearing-impaired population. , 1992, Ear and hearing.

[9]  John Makhoul,et al.  High-frequency regeneration in speech coding systems , 1979, ICASSP.

[10]  IEEE Recommended Practice for Speech Quality Measurements , 1969, IEEE Transactions on Audio and Electroacoustics.

[11]  Margaret W Skinner,et al.  Nucleus® 24 Advanced Encoder Conversion Study: Performance versus Preference , 2002, Ear and hearing.

[12]  P Seligman,et al.  Architecture of the Spectra 22 speech processor. , 1995, The Annals of otology, rhinology & laryngology. Supplement.