论文信息 - The EURONOUNCE corpus of non-native Polish for ASR-based pronunciation tutoring system

The EURONOUNCE corpus of non-native Polish for ASR-based pronunciation tutoring system

This paper gives a detailed information on the design of the speech corpus for the purpose of developing an ASR-based pronunciation tutoring system. In the first place, assumptions on the structure of the corpus are presented. Then collection of text material, recordings and procedure of annotation of the resulting speech corpus are described. In the end, preliminary results of the analysis of pronunciation errors are discussed. They provide information which is important for ASR training and testing on the one hand, and automatic error detection on the other hand.

[1] I. R. MacKay,et al. Factors affecting strength of perceived foreign accent in a second language. , 1995, The Journal of the Acoustical Society of America.

[2] Kristin Precoda,et al. The SRI EduSpeak System: Recognition and Pronunciation Scoring for Language Learning , 2007 .

[3] Yik-Cheung Tam,et al. PLASER: Pronunciation Learning via Automatic Speech Recognition , 2003, HLT-NAACL 2003.

[4] Oliver Jokisch,et al. The use of CALL in acquiring foreign language pronunciation and prosody - General specifications for Euronounce Project , 2009 .

[5] John C. Wells,et al. Overcoming phonetic interference , 1999 .

[6] Rüdiger Hoffmann,et al. Pronunciation Learning and Foreign Accent Reduction by an Audiovisual Feedback System , 2005, ACII.

[7] Helmer Strik,et al. Automatic Speech Recognition for second language learning: How and why it actually works , 2003 .

[8] Eric Atwell,et al. The ISLE corpus: Italian and German spoken learner's English , 2003 .

[9] Min Liu,et al. A Look at the Research on Computer-Based Technology Use in Second Language Learning , 2002 .

[10] Wolfgang Menzel,et al. Phonetic Rules for Diagnosis of Pronunciation Errors , 2000, KONVENS.