论文信息 - Standard Speaker Selection in Speech Synthesis for Mandarin Tone Learning

Standard Speaker Selection in Speech Synthesis for Mandarin Tone Learning

The teaching speech chosen to imitate plays a key role in learning Mandarin tone for L2 learners. It has been found that the synthesis teaching speech becomes more acceptable if it is alike the L2 learner’s own speech. Voice modification technology can be used to synthesize the teaching speech with both the standard speech of Chinese and the learner’s speech. At the same time different standard Chinese speakers will definitely affect the quality of the synthesis speech. The paper studies the selection method of the standard speech of Chinese in the teaching speech synthesis. The speakers’ features including MFCC, pitch, rhythm are compared and Gaussian Mixture Model is used to select the most appropriate Chinese speaker. The perceptual experimental results show that the modification with the Chinese speech which is similar to the learner’s speech in MFCC gets the best teaching speech both in phonetic and tonal quality.

[1] Paul Boersma,et al. Praat, a system for doing phonetics by computer , 2002 .

[2] E. Grabe,et al. Durational variability in speech and the rhythm class hypothesis , 2005 .

[3] Ruili Wang,et al. Investigation of golden speakers for second language learners from imitation preference perspective by voice modification , 2011, Speech Commun..

[4] Stephanie Seneff,et al. Towards Automatic Tone Correction in Non-native Mandarin , 2006, ISCSLP.

[5] Min Tang,et al. Voice transformations: from speech synthesis to mammalian vocalizations , 2001, INTERSPEECH.

[6] Jinsong Zhang,et al. Developing a Chinese L2 speech database of Japanese learners with narrow-phonetic labels for computer assisted pronunciation training , 2010, INTERSPEECH.

[7] Maxine Eskénazi,et al. Enhancing foreign language tutors - In search of the golden speaker , 2002, Speech Commun..

[8] F. Ramus,et al. Correlates of linguistic rhythm in the speech signal , 1999, Cognition.

[9] Ricardo Gutierrez-Osuna,et al. Foreign accent conversion in computer assisted pronunciation training , 2009, Speech Commun..

[10] S. Pinker,et al. Default nominal inflection in Hebrew: evidence for mental variables , 1999, Cognition.

[11] Hua Lin,et al. Mandarin Rhythm: An Acoustic Study , 2007, J. Chin. Lang. Comput..