Automatic pronunciation evaluation of foreign speakers using unknown text

Abstract In this study we present various techniques to evaluate the pronunciation of students of a foreign language without any knowledge of the uttered text. Previous attempts have shown that it is feasible to evaluate the pronunciation of a non-native speaker by having implicit or explicit knowledge of the uttered text, provided that enough utterances are available. Our approach is to use characteristics of the mother tongue (SOURCE language) of the speaker in the evaluation of his/her pronunciation. We recorded 20 Greek students speaking English (TARGET language) and evaluated their pronunciation using algorithms that include characteristics of the SOURCE language (Greek). We show that the pronunciation scores that are based on both TARGET- and SOURCE-language characteristics have better correlation with the human scores than those based only on characteristics of the TARGET language. As in previous studies, we found that the best-performing algorithms for automatic evaluation of pronunciation are based on speech recognition technology.

[1]  Yoshinori Yamada,et al.  Assistive speech technology for persons with speech impairments , 2000, Speech Commun..

[2]  Steve J. Young,et al.  Phone-level pronunciation scoring and assessment for interactive language learning , 2000, Speech Commun..

[3]  Steve J. Young,et al.  Estimation of models for non-native speech in computer-assisted language learning based on linear model combination , 1998, ICSLP.

[4]  Mitch Weintraub,et al.  Automatic evaluation and training in English pronunciation , 1990, ICSLP.

[5]  Yoon Kim,et al.  Automatic pronunciation scoring for language instruction , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[6]  M. Witt,et al.  Performance Measures for Phone-level Pronunciation Teaching in Call , 1998 .

[7]  Keikichi Hirose,et al.  Teaching the pronunciation of Japanese double-mora phonemes using speech recognition technology , 2000, Speech Commun..

[8]  Vassilios Digalakis,et al.  Combination of machine scores for automatic grading of pronunciation quality , 2000, Speech Commun..

[9]  Robert V. Brill,et al.  Applied Statistics and Probability for Engineers , 2004, Technometrics.

[10]  Nikos Chatzichrisafis,et al.  Large vocabulary continuous speech recognition in greek: corpus and an automatic dictation system , 2003, INTERSPEECH.

[11]  Keikichi Hirose,et al.  A CALL system using speech recognition to train the pronunciation of Japanese long vowels, the mora nasal and mora obstruents , 1997, EUROSPEECH.

[12]  Steve J. Young,et al.  Language learning based on non-native speech recognition , 1997, EUROSPEECH.

[13]  David B. Pisoni,et al.  Two Experiments on Automatic Scoring of Spoken Language Proficiency , 2000 .

[14]  Lou Boves,et al.  Different aspects of expert pronunciation quality ratings and their relation to scores produced by speech recognition algorithms , 2000, Speech Commun..

[15]  Amir Najmi,et al.  An interactive dialog system for learning Japanese , 2000, Speech Commun..

[16]  Vassilios Digalakis,et al.  Genones: optimizing the degree of mixture tying in a large vocabulary hidden Markov model based speech recognizer , 1994, Proceedings of ICASSP '94. IEEE International Conference on Acoustics, Speech and Signal Processing.

[17]  Keikichi Hirose,et al.  A method for measuring the intelligibility and nonnativeness of phone quality in foreign language pronunciation training , 1998, ICSLP.

[18]  Rodolfo Delmonte,et al.  SLIM prosodic automatic tools for self-learning instruction , 2000, Speech Commun..

[19]  Mitch Weintraub,et al.  Automatic text-independent pronunciation scoring of foreign language student speech , 1996, Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96.

[20]  Mitch Weintraub,et al.  Automatic scoring of pronunciation quality , 2000, Speech Commun..

[21]  George R. Doddington CSR Corpus Development , 1992, HLT.