Mandarin Chinese Tone Recognition with an Artificial Neural Network

Mandarin Chinese tone patterns vary in one of the four ways, i.e, (1) high level; (2) rising; (3) low falling and rising; and (4) high falling. The present study is to examine the efficacy of an artificial neural network in recognizing these tone patterns. Speech data were recorded from 12 children (3-6 years of age) and 15 adults. All subjects were native Mandarin Chinese speakers. The fundamental frequencies (F0) of each monosyllabic word of the speech data were extracted with an autocorrelation method. The pitch data(i.e., the F0 contours) were the inputs to a feed-forward backpropagation artificial neural network. The number of inputs to the neural network varied from 1 to 16 and the hidden layer of the network contained neurons that varied from 1 to 16 in number. The output of the network consisted of four neurons representing the four tone patterns of Mandarin Chinese. After being trained with the Levenberg-Marquardt optimization, the neural network was able to successfully classify the tone patterns with an accuracy of about 90% correct for speech samples from both adults and children. The artificial neural network may provide an objective and effective way of assessing tone production in prelingually-deafened children who have received cochlear implants.

[1]  Pak-Chung Ching,et al.  Tone recognition of isolated Cantonese syllables , 1995, IEEE Trans. Speech Audio Process..

[2]  Alexander L. Francis,et al.  The perception of Cantonese lexical tones by early-deafened cochlear implantees. , 2002, The Journal of the Acoustical Society of America.

[3]  Fan-Gang Zeng,et al.  Mandarin tone recognition in cochlear-implant subjects , 2004, Hearing Research.

[4]  M. Skinner,et al.  Optimization of Speech Processor Fitting Strategies for Chinese‐Speaking Cochlear Implantees , 1998, The Laryngoscope.

[5]  Liang Tao,et al.  Perception of acoustically modified Mandarin tones , 2006 .

[6]  Hintat Cheung,et al.  Perception and Production of Mandarin Tones in Prelingually Deaf Children with Cochlear Implants , 2004, Ear and hearing.

[7]  Sin-Horng Chen,et al.  Mandarin tone recognition by multi-layer perceptron , 1990, International Conference on Acoustics, Speech, and Signal Processing.

[8]  Y Hui,et al.  Chinese tonal language rehabilitation following cochlear implantation in children. , 2000, Acta oto-laryngologica.

[9]  Huang Ts,et al.  Tone perception of Mandarin-speaking postlingually deaf implantees using the Nucleus 22-Channel Cochlear Mini System. , 1995 .

[10]  Raymond D. Kent,et al.  Acoustic Analysis of Speech , 2009 .

[11]  C A van Hasselt,et al.  Cantonese tone perception ability of cochlear implant children in comparison with normal-hearing children. , 2002, International journal of pediatric otorhinolaryngology.

[12]  Y R Wang,et al.  Tone recognition of continuous Mandarin speech assisted with prosodic information. , 1994, The Journal of the Acoustical Society of America.

[13]  Shangkai Gao,et al.  A novel speech-processing strategy incorporating tonal information for cochlear implants , 2004, IEEE Transactions on Biomedical Engineering.

[14]  Demin Han,et al.  Tone production in Mandarin-speaking children with cochlear implants: a preliminary study , 2004, Acta oto-laryngologica.

[15]  S Y Liu,et al.  Nucleus 22-channel cochlear mini-system implantations in Mandarin-speaking patients. , 1996, The American journal of otology.

[16]  J. Jenkins,et al.  Perception and production of lexical tones by 3-year-old, Mandarin-speaking children. , 2005, Journal of speech, language, and hearing research : JSLHR.