Improving performance in noise for hearing aids and cochlear implants using coherent modulation filtering

This study evaluated the maximal attainable performance of speech enhancement strategies based on coherent modulation filtering. An optimal adaptive coherent modulation filtering algorithm was designed to enhance known signals from a target talker in two-talker babble noise. The algorithm was evaluated in a closed-set, speech-recognition-in-noise task. The speech reception threshold (SRT) was measured using a one-down, one-up adaptive procedure. Five hearing-impaired subjects and five cochlear implant users were tested in three processing conditions: (1) original sounds; (2) fixed coherent modulation filtered sounds; and (3) optimal coherent modulation filtered sounds. Six normal-hearing subjects were tested with a 6-channel cochlear implant simulation of sounds processed in the same three conditions. Significant improvements in SRTs were observed when the signal was processed with the optimal coherent modulation filtering algorithm. There was no benefit when the signal was processed with the fixed modulation filter. The current study suggested that coherent modulation filtering might be a promising method for front-end processing in hearing aids and cochlear implants. An approach such as hidden Markov models could be used to generalize the optimal coherent modulation filtering algorithm to unknown utterances and to extend it to open-set speech.

[1]  Les E. Atlas,et al.  Coherent envelope detection for modulation filtering of speech , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[2]  Jong Ho Won,et al.  Spectral-Ripple Resolution Correlates with Speech Reception in Noise in Cochlear Implant Users , 2007, Journal of the Association for Research in Otolaryngology.

[3]  Paris Smaragdis,et al.  Convolutive Speech Bases and Their Application to Supervised Speech Separation , 2007, IEEE Transactions on Audio, Speech, and Language Processing.

[4]  Misha Pavel,et al.  Intelligibility of speech with filtered time trajectories of spectral envelopes , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[5]  N. Viemeister Temporal modulation transfer functions based upon modulation thresholds. , 1979, The Journal of the Acoustical Society of America.

[6]  H Levitt,et al.  Noise reduction in hearing aids: a review. , 2001, Journal of rehabilitation research and development.

[7]  J. Dugundji,et al.  Envelopes and pre-envelopes of real waveforms , 1958, IRE Trans. Inf. Theory.

[8]  Neal F. Viemeister,et al.  Temporal factors in audition: A systems analysis approach , 1977 .

[9]  Bruce J Gantz,et al.  Speech recognition in noise for cochlear implant listeners: benefits of residual acoustic hearing. , 2004, The Journal of the Acoustical Society of America.

[10]  O Ghitza,et al.  On the upper cutoff frequency of the auditory critical-band envelope detectors in the context of speech perception. , 2001, The Journal of the Acoustical Society of America.

[11]  Qin Li,et al.  Homomorphic modulation spectra , 2004, 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[12]  Les E. Atlas,et al.  Scalable and progressive audio codec , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[13]  N. Viemeister,et al.  Temporal modulation transfer functions in normal-hearing and hearing-impaired listeners. , 1985, Audiology : official organ of the International Society of Audiology.

[14]  Fan-Gang Zeng,et al.  Encoding frequency Modulation to improve cochlear implant performance in noise , 2005, IEEE Transactions on Biomedical Engineering.

[15]  Les E. Atlas,et al.  Coherent modulation spectral filtering for single-channel music source separation , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[16]  T. G. Forrest,et al.  The role of frequency selectivity in measures of auditory and vibrotactile temporal resolution. , 1992, The Journal of the Acoustical Society of America.

[17]  Yuji Murahara,et al.  Modulation enhancement of speech as a preprocessing for reverberant chambers with the hearing-impaired , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[18]  C. Schreiner,et al.  Periodicity coding in the inferior colliculus of the cat. II. Topographical organization. , 1988, Journal of neurophysiology.

[19]  Les E. Atlas,et al.  Frequency Reassignment for Coherent Modulation Filtering , 2006, 2006 IEEE International Conference on Acoustics Speech and Signal Processing Proceedings.

[20]  Peter Ladefoged,et al.  On the Fusion of Sounds Reaching Different Sense Organs , 1957 .

[21]  R B Gardner,et al.  Mistuning a harmonic of a vowel: grouping and phase effects on vowel quality. , 1986, The Journal of the Acoustical Society of America.

[22]  R. Plomp,et al.  Effect of temporal envelope smearing on speech reception. , 1994, The Journal of the Acoustical Society of America.

[23]  Steven Greenberg,et al.  The modulation spectrogram: in pursuit of an invariant representation of speech , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[24]  Xiaoqin Wang,et al.  Neural representations of sinusoidal amplitude and frequency modulations in the primary auditory cortex of awake primates. , 2002, Journal of neurophysiology.

[25]  R V Shannon,et al.  Speech Recognition with Primarily Temporal Cues , 1995, Science.

[26]  Zachary M. Smith,et al.  Chimaeric sounds reveal dichotomies in auditory perception , 2002, Nature.

[27]  Stephen T. Neely,et al.  Signals, Sound, and Sensation , 1997 .

[28]  A R Moller Unit responses in the rat cochlear nucleus to tones of rapidly varying frequency and amplitude. , 1971, Acta physiologica Scandinavica.

[29]  H. Levitt Transformed up-down methods in psychoacoustics. , 1971, The Journal of the Acoustical Society of America.

[30]  W. M. Rabinowitz,et al.  Standardization of a test of speech perception in noise. , 1979, Journal of speech and hearing research.

[31]  P. Heil,et al.  Frequency and periodicity are represented in orthogonal maps in the human auditory cortex: evidence from magnetoencephalography , 1997, Journal of Comparative Physiology A.

[32]  C. Schreiner,et al.  Representation of amplitude modulation in the auditory cortex of the cat. I. The anterior auditory field (AAF) , 1986, Hearing Research.