Feature selection for automated speech scoring

Automated scoring systems used for the evaluation of spoken or written responses in language assessments need to balance good empirical performance with the interpretability of the scoring models. We compare several methods of feature selection for such scoring systems and show that the use of shrinkage methods such as Lasso regression makes it possible to rapidly build models that both satisfy the requirements of validity and intepretability, crucial in assessment contexts as well as achieve good empirical performance.

[1]  Jian Cheng,et al.  Automatic Assessment of the Speech of Young English Learners , 2014, BEA@ACL.

[2]  Martin Chodorow,et al.  Automated Scoring Using A Hybrid Feature Identification Technique , 1998, ACL.

[3]  Maxine Eskénazi,et al.  An overview of spoken language technology for education , 2009, Speech Commun..

[4]  Charles L. Lawson,et al.  Solving least squares problems , 1976, Classics in applied mathematics.

[5]  J. Goeman L1 Penalized Estimation in the Cox Proportional Hazards Model , 2009, Biometrical journal. Biometrische Zeitschrift.

[6]  Charles L. Lawson,et al.  Solving least squares problems , 1976, Classics in applied mathematics.

[7]  Jill Burstein,et al.  Automated Essay Scoring : A Cross-disciplinary Perspective , 2003 .

[8]  Stan Lipovetsky,et al.  Linear regression with special coefficient features attained via parameterization in exponential, logistic, and multinomial-logit forms , 2009, Math. Comput. Model..

[9]  David M. Williamson,et al.  Automated essay scoring: Psychometric guidelines and practices , 2013 .

[10]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[11]  R. Tibshirani Regression Shrinkage and Selection via the Lasso , 1996 .

[12]  David M. Williamson,et al.  A Framework for Evaluation and Use of Automated Scoring , 2012 .

[13]  Xiaoming Xi,et al.  Automatic scoring of non-native spontaneous speech in tests of spoken English , 2009, Speech Commun..

[14]  R. Tibshirani,et al.  Regression shrinkage and selection via the lasso: a retrospective , 2011 .

[15]  Mee Young Park,et al.  L1‐regularization path algorithm for generalized linear models , 2007 .

[16]  Jian Cheng,et al.  Validating automated speaking tests , 2010 .

[17]  William Wresch,et al.  The Imminence of Grading Essays by Computer-25 Years Later , 1993 .