Zerocrossing-based fine structure representation to convey Mandarin tonal information: A study on the noise effect

Since the fine structure cue of the speech signal has been believed to be important for the pitch perception of cochlear implant (CI) users, studies are actively ongoing with attempts to propose novel CI speech processing strategies incorporating the fine structure cue. A speech synthesis model with the zerocrossing-based fine structure representation has been recently developed. This paper further examines its ability to convey Mandarin tonal information in the noisy conditions. Acoustic simulation experiment of Mandarin tone identification was conducted. The experimental results supported that the zerocrossing-based speech processing strategy could more efficiently convey Mandarin tonal information than the traditional continuous-interleaved-sampling (CIS) method, even when the Mandarin voices were contaminated by the speech-spectrum shaped noise at low signal-to-noise-ratio levels. It is believed that the zerocrossing technique would facilitate the development of novel CI speech processor to enhance the pitch perception of cochlear implantees speaking tonal language, such as Mandarin.

[1]  Blake S Wilson,et al.  Two New Directions in Speech Processor Design for Cochlear Implants , 2005, Ear and hearing.

[2]  Shangkai Gao,et al.  A novel speech-processing strategy incorporating tonal information for cochlear implants , 2004, IEEE Transactions on Biomedical Engineering.

[3]  Zachary M. Smith,et al.  Chimaeric sounds reveal dichotomies in auditory perception , 2002, Nature.

[4]  R Drullman,et al.  Temporal envelope and fine structure cues for speech intelligibility. , 1994, The Journal of the Acoustical Society of America.

[5]  Jay T. Rubinstein,et al.  A novel acoustic simulation of cochlear implant hearing: effects of temporal fine structure , 2003, First International IEEE EMBS Conference on Neural Engineering, 2003. Conference Proceedings..

[6]  R V Shannon,et al.  Speech Recognition with Primarily Temporal Cues , 1995, Science.

[7]  Bryan E Pfingst,et al.  Relative importance of temporal envelope and fine structure in lexical-tone perception. , 2003, The Journal of the Acoustical Society of America.

[8]  P. Loizou Introduction to cochlear implants. , 1999, IEEE engineering in medicine and biology magazine : the quarterly magazine of the Engineering in Medicine & Biology Society.

[9]  D. D. Greenwood A cochlear frequency-position function for several species--29 years later. , 1990, The Journal of the Acoustical Society of America.

[10]  William M. Rabinowitz,et al.  Better speech recognition with cochlear implants , 1991, Nature.

[11]  R. Drullman Temporal envelope and fine structure cues for speech intelligibility , 1994 .

[12]  Fan-Gang Zeng,et al.  Mandarin tone recognition in cochlear-implant subjects , 2004, Hearing Research.

[13]  F. Zeng,et al.  Speaker recognition with temporal cues in acoustic and electric hearing. , 2005, The Journal of the Acoustical Society of America.

[14]  Rahul Sarpeshkar,et al.  A Low-Power Asynchronous Interleaved Sampling Algorithm for Cochlear Implants That Encodes Envelope and Phase Information , 2007, IEEE Transactions on Biomedical Engineering.

[15]  R. Sarpeshkar,et al.  An analog bionic ear processor with zero-crossing detection , 2005, ISSCC. 2005 IEEE International Digest of Technical Papers. Solid-State Circuits Conference, 2005..

[16]  Fan-Gang Zeng,et al.  Encoding frequency Modulation to improve cochlear implant performance in noise , 2005, IEEE Transactions on Biomedical Engineering.

[17]  Fei Chen,et al.  A novel temporal fine structure-based speech synthesis model for cochlear implant , 2008, Signal Process..

[18]  Fan-Gang Zeng,et al.  Music Perception with Temporal Cues in Acoustic and Electric Hearing , 2004, Ear and hearing.

[19]  F. Zeng Trends in Cochlear Implants , 2004, Trends in amplification.