论文信息 - Automatic Pronunciation Error Detection Based on Extended Pronunciation Space Using the Unsupervised Clustering of Pronunciation Errors

Automatic Pronunciation Error Detection Based on Extended Pronunciation Space Using the Unsupervised Clustering of Pronunciation Errors

Calculating posterior probability within a standard pronunciation space (SPS) is a common method in automatic pronunciation error detection (APED). However, to pronunciation errors outside the SPS, this kind of methods can only give an approximate solution, that may be not right in many applications. This paper expands the SPS to include more pronunciation errors, introduces a Bhattacharyya distance based clustering of pronunciation errors, and thus refines more detailed acoustic models for APED within the extended pronunciation space (EPS). The relationship between the performance of APED system and the number of cluster or the size of the EPS is well studied. The experimental results show that, compared with the APED based on the SPS, the APED based on the EPS using adaptive unsupervised clustering of pronunciation errors can achieve a better performance and the average scoring error rate (ASER) decreases from 0.412 to 0.301, relatively reducing by 26.94%.

Long Zhang | Haifeng Li

[1] Seiichi Nakagawa,et al. Automatic evaluation of English pronunciation by Japanese speakers using various acoustic features and pattern recognition techniques , 2010, INTERSPEECH.

[2] Yu Hu,et al. Pronunciation Space Models for Pronunciation Evaluation , 2008, 2008 6th International Symposium on Chinese Spoken Language Processing.

[3] Wang Ren-hua. The Electronic PSC Testing System , 2006 .

[4] Kiyohiro Shikano,et al. Isolated word recognition using phoneme-like templates , 1983, ICASSP.

[5] Satoshi Nakamura,et al. Automatic pronunciation scoring of words and sentences independent from the non-native's first language , 2009, Comput. Speech Lang..

[6] Ye Weiping. Survey of Automatic Pronunciation Error Detection , 2009 .

[7] Helmer Strik,et al. Comparing different approaches for automatic pronunciation error detection , 2009, Speech Commun..

[8] T. Kailath. The Divergence and Bhattacharyya Distance Measures in Signal Selection , 1967 .