Improved perception of speech in noise and Mandarin tones with acoustic simulations of harmonic coding for cochlear implants.

Harmonic and temporal fine structure (TFS) information are important cues for speech perception in noise and music perception. However, due to the inherently coarse spectral and temporal resolution in electric hearing, the question of how to deliver harmonic and TFS information to cochlear implant (CI) users remains unresolved. A harmonic-single-sideband-encoder [(HSSE); Nie et al. (2008). Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing; Lie et al., (2010). Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing] strategy has been proposed that explicitly tracks the harmonics in speech and transforms them into modulators conveying both amplitude modulation and fundamental frequency information. For unvoiced speech, HSSE transforms the TFS into a slowly varying yet still noise-like signal. To investigate its potential, four- and eight-channel vocoder simulations of HSSE and the continuous-interleaved-sampling (CIS) strategy were implemented, respectively. Using these vocoders, five normal-hearing subjects' speech recognition performance was evaluated under different masking conditions; another five normal-hearing subjects' Mandarin tone identification performance was also evaluated. Additionally, the neural discharge patterns evoked by HSSE- and CIS-encoded Mandarin tone stimuli were simulated using an auditory nerve model. All subjects scored significantly higher with HSSE than with CIS vocoders. The modeling analysis demonstrated that HSSE can convey temporal pitch cues better than CIS. Overall, the results suggest that HSSE is a promising strategy to enhance speech perception with CIs.

[1]  R. Shannon Temporal modulation transfer functions in patients with cochlear implants. , 1992, The Journal of the Acoustical Society of America.

[2]  C J Darwin,et al.  Listening to speech in the presence of other sounds , 2008, Philosophical Transactions of the Royal Society B: Biological Sciences.

[3]  Fan-Gang Zeng,et al.  Temporal pitch in electric hearing , 2002, Hearing Research.

[4]  Fan-Gang Zeng,et al.  Cochlear implant speech recognition with speech maskers. , 2004, The Journal of the Acoustical Society of America.

[6]  Paul J. Abbas,et al.  Effects of electrode-to-fiber distance on temporal neural response with electrical stimulation , 2004, IEEE Transactions on Biomedical Engineering.

[7]  Bryan E Pfingst,et al.  Features of stimulation affecting tonal-speech perception: implications for cochlear prostheses. , 2002, The Journal of the Acoustical Society of America.

[8]  Ning Zhou,et al.  Tone perception and production in pediatric cochlear implants users , 2011, Acta oto-laryngologica.

[9]  Brian C J Moore,et al.  Effects of moderate cochlear hearing loss on the ability to benefit from temporal fine structure information in speech. , 2008, The Journal of the Acoustical Society of America.

[10]  S. Soli,et al.  Development of the Hearing in Noise Test for the measurement of speech reception thresholds in quiet and in noise. , 1994, The Journal of the Acoustical Society of America.

[11]  Jan Wouters,et al.  Perception of Mandarin Chinese with cochlear implants using enhanced temporal pitch cues , 2012, Hearing Research.

[12]  M R Leek,et al.  FO processing and the separation of competing speech signals by listeners with normal hearing and with hearing loss. , 1998, Journal of speech, language, and hearing research : JSLHR.

[13]  IEEE Recommended Practice for Speech Quality Measurements , 1969, IEEE Transactions on Audio and Electroacoustics.

[14]  Les E. Atlas,et al.  Single sideband encoder for music coding in cochlear implants , 2008, 2008 IEEE International Conference on Acoustics, Speech and Signal Processing.

[15]  Marc Moonen,et al.  Improved Music Perception with Explicit Pitch Coding in Cochlear Implants , 2006, Audiology and Neurotology.

[16]  Les E. Atlas,et al.  Coherent envelope detection for modulation filtering of speech , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[17]  Brian C J Moore,et al.  Speech perception problems of the hearing impaired reflect inability to use temporal fine structure , 2006, Proceedings of the National Academy of Sciences.

[18]  Alexandra Kaider,et al.  Envelope Versus Fine Structure Speech Coding Strategy: A Crossover Study , 2011, Otology & neurotology : official publication of the American Otological Society, American Neurotology Society [and] European Academy of Otology and Neurotology.

[19]  D. D. Greenwood A cochlear frequency-position function for several species--29 years later. , 1990, The Journal of the Acoustical Society of America.

[20]  R. Shannon,et al.  Speech recognition in noise as a function of the number of spectral channels: comparison of acoustic hearing and cochlear implants. , 2001, The Journal of the Acoustical Society of America.

[21]  Ning Zhou,et al.  Lexical Tone Perception with HiResolution and HiResolution 120 Sound-Processing Strategies in Pediatric Mandarin-Speaking Cochlear Implant Users , 2009, Ear and hearing.

[22]  Les E. Atlas,et al.  Harmonic coherent demodulation for improving sound coding in cochlear implants , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[23]  Les E. Atlas,et al.  Time-Frequency Coherent Modulation Filtering of Nonstationary Signals , 2009, IEEE Transactions on Signal Processing.

[24]  Andrew E. Vandali,et al.  Development of a temporal fundamental frequency coding strategy for cochlear implants. , 2011, The Journal of the Acoustical Society of America.

[25]  Thomas F. Quatieri,et al.  Speech analysis/Synthesis based on a sinusoidal representation , 1986, IEEE Trans. Acoust. Speech Signal Process..

[26]  J. L. Flanagan,et al.  Parametric coding of speech spectra , 1980 .

[27]  B. Moore,et al.  Benefit of high-rate envelope cues in vocoder processing: effect of number of channels and spectral region. , 2008, The Journal of the Acoustical Society of America.

[28]  Qin Li,et al.  Time-variant least squares harmonic modeling , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[29]  B. Moore The Role of Temporal Fine Structure Processing in Pitch Perception, Masking, and Speech Perception for Normal-Hearing and Hearing-Impaired People , 2008, Journal of the Association for Research in Otolaryngology.

[30]  A. Oxenham Pitch Perception and Auditory Stream Segregation: Implications for Hearing Loss and Cochlear Implants , 2008, Trends in amplification.

[31]  Bruce J Gantz,et al.  Speech recognition in noise for cochlear implant listeners: benefits of residual acoustic hearing. , 2004, The Journal of the Acoustical Society of America.

[32]  Blake S. Wilson,et al.  Cochlear implants: A remarkable past and a brilliant future , 2008, Hearing Research.

[33]  R. Carlyon,et al.  The role of resolved and unresolved harmonics in pitch perception and frequency modulation discrimination. , 1994, The Journal of the Acoustical Society of America.

[34]  Jay T. Rubinstein,et al.  Stochastic Population Model for Electrical Stimulation of the Auditory Nerve , 2009, IEEE Transactions on Biomedical Engineering.

[35]  Fan-Gang Zeng,et al.  Speech recognition with varying numbers and types of competing talkers by normal-hearing, cochlear-implant, and implant simulation subjects. , 2008, The Journal of the Acoustical Society of America.

[36]  Jong Ho Won,et al.  Sensitivity of psychophysical measures to signal processor modifications in cochlear implant users , 2010, Hearing Research.

[37]  Fan-Gang Zeng,et al.  Encoding frequency Modulation to improve cochlear implant performance in noise , 2005, IEEE Transactions on Biomedical Engineering.

[38]  Richard L Freyman,et al.  Speech intelligibility in cochlear implant simulations: Effects of carrier type, interfering noise, and subject experience. , 2007, The Journal of the Acoustical Society of America.

[39]  M F Dorman,et al.  The recognition of sentences in noise by normal-hearing listeners using simulations of cochlear-implant signal processors with 6-20 channels. , 1998, The Journal of the Acoustical Society of America.

[40]  Clemens Zierhofer,et al.  Temporal fine structure in cochlear implants: Preliminary speech perception results in Cantonese-speaking implant users , 2010, Acta oto-laryngologica.

[41]  Robert Mannell,et al.  Relative Contributions of Temporal Envelope and Fine Structure Cues to Lexical Tone Recognition in Hearing-Impaired Listeners , 2011, Journal of the Association for Research in Otolaryngology.