论文信息 - Improved automatic English proficiency rating of unconstrained speech with multiple corpora

Improved automatic English proficiency rating of unconstrained speech with multiple corpora

The performance of machine learning classifiers in automatically scoring the English proficiency of unconstrained speech has been explored. Suprasegmental measures were computed by software, which identifies the basic elements of Brazil’s model in human discourse. This paper explores machine learning training with multiple corpora to improve two of those algorithms: prominent syllable detection and tone choice classification. The results show that machine learning training with the Boston University Radio News Corpus can improve automatic English proficiency scoring of unconstrained speech from a Pearson’s correlation of 0.677–0.718. This correlation is higher than any other existing computer programs for automatically scoring the proficiency of unconstrained speech and is approaching that of human raters in terms of inter-rater reliability.

David O. Johnson | Okim Kang | Romy Ghanem

[1] C. D. Gelatt,et al. Optimization by Simulated Annealing , 1983, Science.

[2] T. Landauer,et al. A Solution to Plato's Problem: The Latent Semantic Analysis Theory of Acquisition, Induction, and Representation of Knowledge. , 1997 .

[3] Okim Kang,et al. Relative salience of suprasegmental features on judgments of L2 comprehensibility and accentedness , 2010 .

[4] Jill Burstein,et al. AUTOMATED ESSAY SCORING WITH E‐RATER® V.2.0 , 2004 .

[5] Daniel Povey,et al. The Kaldi Speech Recognition Toolkit , 2011 .

[6] Xiaoming Xi,et al. Automatic scoring of non-native spontaneous speech in tests of spoken English , 2009, Speech Commun..

[7] Martin Chodorow,et al. Computer Analysis of Essay Content for Automated Score Prediction , 1998 .

[8] D. Rubin,et al. Suprasegmental Measures of Accentedness and Judgments of Language Learner Proficiency in Oral English , 2010 .

[9] Claudia Leacock. Scoring Free-Responses Automatically: A Case Study of a Large-Scale Assessment , 2004 .

[10] M. Chodorow,et al. BEYOND ESSAY LENGTH: EVALUATING E-RATER®'S PERFORMANCE ON TOEFL® ESSAYS , 2004 .

[11] Robert Tibshirani,et al. Classification by Pairwise Coupling , 1997, NIPS.