Pronunciation Error Detection Using Dynamic Time Warping Algorithm

In the paper a pronunciation error detection method has been presented, wchich is based on word structural features. A lowcomplexity classifier has been proposed, that is not concentrated on a limited base of error patterns, but is flexible enough to find unspecified mispronunciations. Two classification variants using Dynamic Time Warping (DTW) algorithm has been tested on speech corpus containing recordings of 30 people.

[1]  Ren-Yuan Lyu,et al.  Pronunciation Error Detection for Computer Assisted Pronunciation Teaching in Mandarin , 2008, 2008 6th International Symposium on Chinese Spoken Language Processing.

[2]  Aaron E. Rosenberg,et al.  A connected speech recognition system based on spotting diphone-like segments--Preliminary results , 1987, ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[3]  Shuang Xu,et al.  Automatic pronunciation error detection based on linguistic knowledge and pronunciation space , 2009, 2009 IEEE International Conference on Acoustics, Speech and Signal Processing.

[4]  Keikichi Hirose,et al.  Syllable: A self-contained unit to model pronunciation variation , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[5]  Martin J. Russell,et al.  The STAR system: an interactive pronunciation tutor for young children , 2000, Comput. Speech Lang..

[6]  Ren-Yuan Lyu,et al.  Data-Driven Approach to Pronunciation Error Detection for Computer Assisted Language Teaching , 2007, Seventh IEEE International Conference on Advanced Learning Technologies (ICALT 2007).

[7]  Lin-Shan Lee,et al.  Improved approaches of modeling and detecting Error Patterns with empirical analysis for Computer-Aided Pronunciation Training , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[8]  Helmer Strik,et al.  The goodness of pronunciation algorithm: a detailed performance study , 2009, SLaTE.

[9]  Keikichi Hirose,et al.  Automatic Chinese pronunciation error detection using SVM trained with structural features , 2012, 2012 IEEE Spoken Language Technology Workshop (SLT).

[10]  Silke M. Witt,et al.  Use of speech recognition in computer-assisted language learning , 2000 .

[11]  Grazyna Demenko,et al.  The EURONOUNCE corpus of non-native Polish for ASR-based pronunciation tutoring system , 2009, SLaTE.

[12]  Helmer Strik,et al.  Comparing classifiers for pronunciation error detection , 2007, INTERSPEECH.

[13]  Oliver Jokisch,et al.  An audiovisual feedback system for acquiring L2 pronunciation and L2 prosody , 2009, SLaTE.

[14]  Harriet J. Nock,et al.  Pronunciation modeling by sharing gaussian densities across phonetic models , 1999, EUROSPEECH.

[15]  Grazyna Demenko,et al.  Applying speech and language technology to foreign language education , 2009, 2009 International Multiconference on Computer Science and Information Technology.

[16]  Grazyna Demenko,et al.  The Use of Speech Technology in Foreign Language Pronunciation Training , 2010 .

[17]  Sanjeev Khudanpur,et al.  Pronunciation change in conversational speech and its implications for automatic speech recognition , 2004, Comput. Speech Lang..

[18]  Steve J. Young,et al.  Phone-level pronunciation scoring and assessment for interactive language learning , 2000, Speech Commun..