论文信息 - Design of the pronunciation dictionary for an English CAPT system

Design of the pronunciation dictionary for an English CAPT system

Computer Assisted Pronunciation Training (CAPT) systems can judge the overall pronunciation quality and point out the pronunciation errors by recognition of the user's utterance to improve the users' oral ability. However, the performance of the current error detection systems can't satisfy the users' expectation. This paper carefully designs the pronunciation dictionary of the speech recognition engine of the CAPT system based on observation of the spectral properties of some phonemes. First, the phonetic symbol /R/ utilized in some modern English dictionaries is replaced by two symbols, /R/ (as in word result) and /ER/ (as in word bear), due to the large spectral deviation in these two cases. Moreover, rather than using a consonant sequence to represent a consonant cluster, we use one symbol to represent the whole for some consonant clusters, according to the phonetic properties of the consonants. Finally, we divide the phoneme /l/ into clear /l/ and dark /l/ according to the distinction in the manner of articulation and the place of articulation. So is /n/. As a result, we get a new pronunciation dictionary which is more suitable for the speech recognition engine of the CAPT system. The HMMs are trained by using the TIMIT database, and evaluated on a database involving 40 Chinese undergraduates. Experimental results illustrate the effectiveness of the new pronunciation dictionary.

Bo Zhang | Jing Xu | Bo Peng | JinXin Liu

[1] Helmer Strik,et al. Automatic Speech Recognition for second language learning: How and why it actually works , 2003 .

[2] Wolfgang Menzel,et al. Automatic detection and correction of non-native English pronunciations , 2000 .

[3] Keith A. Johnson,et al. Acoustic and Auditory Phonetics , 1997, Phonetica.

[4] Garry Molholt. Computer-Assisted Instruction in Pronunciation for Chinese Speakers of American English , 1988 .

[5] Toshihiro Kitama,et al. Voiceless affricate/fricative distinction by frication duration and amplitude rise slope. , 2006, The Journal of the Acoustical Society of America.

[6] Silke M. Witt,et al. Use of speech recognition in computer-assisted language learning , 2000 .

[7] P Howell,et al. Production and perception of rise time in the voiceless affricate/fricative distinction. , 1983, The Journal of the Acoustical Society of America.

[8] John H. L. Hansen,et al. Language accent classification in American English , 1996, Speech Commun..

[9] Shrikanth S. Narayanan,et al. Using Articulatory Representations to Detect Segmental Errors in Nonnative Pronunciation , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[10] Vassilios Digalakis,et al. Genones: generalized mixture tying in continuous hidden Markov model-based speech recognizers , 1996, IEEE Trans. Speech Audio Process..