Speech perception with F0mod, a cochlear implant pitch coding strategy

Abstract Objective: The fundamental frequency modulation (F0mod) sound processing strategy was developed to improve pitch perception with cochlear implants. In previous work it has been shown to improve performance in a number of pitch-related tasks such as pitch ranking, familiar melody identification, and Mandarin Chinese tone identification. The objective of the current study was to compare speech perception with F0mod and the standard clinical advanced combination encoder (ACE) strategy. Study sample: Seven cochlear-implant listeners were recruited from the clinical population of the University Hospital Leuven. Design: F0mod was implemented on a real-time system. Speech recognition in quiet and noise was measured for seven cochlear-implant listeners, comparing F0mod with ACE, using three different Dutch-language speech materials. Additionally the F0 estimator used was evaluated physically, and pitch ranking performance was compared between F0mod and ACE. Results: Immediately after switch-on of the F0mod strategy, speech recognition in quiet and noise were similar for ACE and F0mod, for four out of seven listeners. The remaining three listeners were subjected to a short training protocol with F0mod, after which their performance was reassessed, and a significant improvement was found. Conclusions: As F0mod improves pitch perception, for the seven subjects tested it did not interfere with speech recognition in quiet and noise, and has a low computational complexity, it seems promising for implementation in a clinical sound processor.

[1]  P. Boersma ACCURATE SHORT-TERM ANALYSIS OF THE FUNDAMENTAL FREQUENCY AND THE HARMONICS-TO-NOISE RATIO OF A SAMPLED SOUND , 1993 .

[2]  Astrid van Wieringen,et al.  LIST and LINT: Sentences and numbers for quantifying speech understanding in severely impaired listeners for Flanders and the Netherlands , 2008, International journal of audiology.

[3]  Stuart Rosen,et al.  Enhancement of temporal periodicity cues in cochlear implants: effects on prosodic perception and vowel identification. , 2005, The Journal of the Acoustical Society of America.

[4]  R V Shannon,et al.  Speech Recognition with Primarily Temporal Cues , 1995, Science.

[5]  Marc Moonen,et al.  Improved Music Perception with Explicit Pitch Coding in Cochlear Implants , 2006, Audiology and Neurotology.

[6]  Tom Francart,et al.  Psychophysics, Fitting, and Signal Processing for Combined Hearing Aid and Cochlear Implant Stimulation , 2010, Ear and hearing.

[7]  Jan Wouters,et al.  APEX 3: a multi-purpose test platform for auditory psychophysical experiments , 2008, Journal of Neuroscience Methods.

[8]  Marc Moonen,et al.  Relative contributions of temporal and place pitch cues to fundamental frequency discrimination in cochlear implantees. , 2004, The Journal of the Acoustical Society of America.

[9]  Aaron E. Rosenberg,et al.  A comparative performance study of several pitch detection algorithms , 1976 .

[10]  G. Studebaker A "rationalized" arcsine transform. , 1985, Journal of speech and hearing research.

[11]  L Geurts,et al.  Coding of the fundamental frequency in continuous interleaved sampling processors for cochlear implants. , 2001, The Journal of the Acoustical Society of America.

[12]  Xin Luo,et al.  Encoding pitch contours using current steering. , 2010, The Journal of the Acoustical Society of America.

[13]  Hugh J. McDermott,et al.  Pitch ranking ability of cochlear implant recipients: a comparison of sound-processing strategies. , 2005, The Journal of the Acoustical Society of America.

[14]  R. Shannon Temporal modulation transfer functions in patients with cochlear implants. , 1992, The Journal of the Acoustical Society of America.

[15]  Gail S Donaldson,et al.  Within-Subjects Comparison of the HiRes and Fidelity120 Speech Processing Strategies: Speech Perception and Its Relation to Place-Pitch Sensitivity , 2011, Ear and hearing.

[16]  Andrew E Vandali,et al.  Enhancement of temporal cues to pitch in cochlear implants: effects on pitch ranking. , 2012, The Journal of the Acoustical Society of America.

[17]  H J McDermott,et al.  Pitch matching of amplitude-modulated current pulse trains by cochlear implantees: the effect of modulation depth. , 1995, The Journal of the Acoustical Society of America.

[18]  Paul C. Bagshaw,et al.  Enhanced pitch tracking and the processing of F0 contours for computer aided intonation teaching , 1993, EUROSPEECH.

[19]  K. Plant,et al.  Speech Perception as a Function of Electrical Stimulation Rate: Using the Nucleus 24 Cochlear Implant System , 2000, Ear and hearing.

[20]  Philipos C Loizou,et al.  Effect of filter spacing on melody recognition: acoustic and electric hearing. , 2007, The Journal of the Acoustical Society of America.

[21]  Birger Kollmeier,et al.  A Spanish matrix sentence test for assessing speech reception thresholds in noise , 2012, International journal of audiology.

[22]  Hideki Kawahara,et al.  YIN, a fundamental frequency estimator for speech and music. , 2002, The Journal of the Acoustical Society of America.

[23]  Jan Wouters,et al.  Better place-coding of the fundamental frequency in cochlear implants. , 2004, The Journal of the Acoustical Society of America.

[24]  Paul Boersma,et al.  Praat, a system for doing phonetics by computer , 2002 .

[25]  Xin Luo,et al.  Current Steering with Partial Tripolar Stimulation Mode in Cochlear Implants , 2013, Journal of the Association for Research in Otolaryngology.

[26]  Andrew E. Vandali,et al.  Development of a temporal fundamental frequency coding strategy for cochlear implants. , 2011, The Journal of the Acoustical Society of America.

[27]  A. M. Mimpen,et al.  Speech-reception threshold for sentences as a function of age and noise level. , 1979, The Journal of the Acoustical Society of America.

[28]  Jan Wouters,et al.  Perception of Mandarin Chinese with cochlear implants using enhanced temporal pitch cues , 2012, Hearing Research.

[29]  Astrid van Wieringen,et al.  Improved fundamental frequency coding in cochlear implant signal processing. , 2009, The Journal of the Acoustical Society of America.

[30]  Stuart Rosen,et al.  Enhancing temporal cues to voice pitch in continuous interleaved sampling cochlear implants. , 2004, The Journal of the Acoustical Society of America.

[31]  J P Martens,et al.  Pitch and voiced/unvoiced determination with an auditory model. , 1992, The Journal of the Acoustical Society of America.