Towards Automatic Scoring of Non-Native Spontaneous Speech

This paper investigates the feasibility of automated scoring of spoken English proficiency of non-native speakers. Unlike existing automated assessments of spoken English, our data consists of spontaneous spoken responses to complex test items. We perform both a quantitative and a qualitative analysis of these features using two different machine learning approaches. (1) We use support vector machines to produce a score and evaluate it with respect to a mode baseline and to human rater agreement. We find that scoring based on support vector machines yields accuracies approaching inter-rater agreement in some cases. (2) We use classification and regression trees to understand the role of different features and feature classes in the characterization of speaking proficiency by human scorers. Our analysis shows that across all the test items most or all the feature classes are used in the nodes of the trees suggesting that the scores are, appropriately, a combination of multiple components of speaking proficiency. Future research will concentrate on extending the set of features and introducing new feature classes to arrive at a scoring model that comprises additional relevant aspects of speaking proficiency.

[1]  Helmer Strik,et al.  Using speech recognition technology to assess foreign speakers' pronunciation of Dutch , 1997 .

[2]  David B. Pisoni,et al.  Two Experiments on Automatic Scoring of Spoken Language Proficiency , 2000 .

[3]  Helmer Strik,et al.  Automatic evaluation of Dutch pronunciation by using speech recognition technology , 1997, 1997 IEEE Workshop on Automatic Speech Recognition and Understanding Proceedings.

[4]  Xiaoming Xi,et al.  Extracting meaningful speech features to support diagnostic feedback: an ECD approach to automated scoring , 2006 .

[5]  Janet Holmes,et al.  Sociolinguistics : selected readings , 1972 .

[6]  Lyle F. Bachman 语言测试要略 = Fundamental considerations in language testing , 1990 .

[7]  M. Swain,et al.  THEORETICAL BASES OF COMMUNICATIVE APPROACHES TO SECOND LANGUAGE TEACHING AND TESTING , 1980 .

[8]  Vladimir Cherkassky,et al.  The Nature Of Statistical Learning Theory , 1997, IEEE Trans. Neural Networks.

[9]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[10]  Kristin Precoda,et al.  The SRI EduSpeak System: Recognition and Pronunciation Scoring for Language Learning , 2007 .

[11]  神保 尚武 言語運用能力(Communicative Competence) , 1987 .

[12]  Jill Burstein,et al.  Automated Essay Scoring : A Cross-disciplinary Perspective , 2003 .

[13]  Lawrence M. Rudner,et al.  An Overview of Three Approaches to Scoring Written Essays by Computer. ERIC Digest. , 2001 .

[14]  Brian North,et al.  The development of a common framework scale of language proficiency , 2000 .