A Readable Read: Automatic Assessment of Language Learning Materials based on Linguistic Complexity

Corpora and web texts can become a rich language learning resource if we have a means of assessing whether they are linguistically appropriate for learners at a given proficiency level. In this paper, we aim at addressing this issue by presenting the first approach for predicting linguistic complexity for Swedish second language learning material on a 5-point scale. After showing that the traditional Swedish readability measure, L\"asbarhetsindex (LIX), is not suitable for this task, we propose a supervised machine learning model, based on a range of linguistic features, that can reliably classify texts according to their difficulty level. Our model obtained an accuracy of 81.3% and an F-score of 0.8, which is comparable to the state of the art in English and is considerably higher than previously reported results for other languages. We further studied the utility of our features with single sentences instead of full texts since sentences are a common linguistic unit in language learning exercises. We trained a separate model on sentence-level data with five classes, which yielded 63.4% accuracy. Although this is lower than the document level performance, we achieved an adjacent accuracy of 92%. Furthermore, we found that using a combination of different features, compared to using lexical features alone, resulted in 7% improvement in classification accuracy at the sentence level, whereas at the document level, lexical features were more dominant. Our models are intended for use in a freely accessible web-based language learning platform for the automatic generation of exercises.

[1]  Sofie Johansson Kokkinakis,et al.  Introducing the Swedish Kelly-list, a new lexical e-resource for Swedish , 2012, LREC.

[2]  Felice Dell'Orletta,et al.  Assessing the Readability of Sentences: Which Corpora and Features? , 2014, BEA@ACL.

[3]  吉島 茂,et al.  文化と言語の多様性の中のCommon European Framework of Reference for Languages: Learning, teaching, assessment (CEFR)--それは基準か? (第10回明海大学大学院応用言語学研究科セミナー 講演) , 2008 .

[4]  Kirsten Beissner “I see what you mean” , 2003 .

[5]  Arthur C. Graesser,et al.  Coh-Metrix , 2011 .

[6]  Richard Johansson,et al.  Automatic Selection of Suitable Sentences for Language Learning Exercises , 2013 .

[7]  Lars Borin,et al.  A flexible language learning platform based on language resources and web services , 2014, LREC.

[8]  Cédrick Fairon,et al.  An “AI readability” Formula for French as a Foreign Language , 2012, EMNLP.

[9]  Thomas M. Segler Investigating the Selection of Example Sentences for Unknown Target Words in ICALL Reading Texts for L2 German , 2007 .

[10]  António Branco,et al.  Rolling out Text Categorization for Language Learning Assessment Supported by Language Technology , 2014, PROPOR.

[11]  Markus Forsberg,et al.  SALDO: a touch of yin to WordNet’s yang , 2013, Lang. Resour. Evaluation.

[12]  Walt Detmar Meurers,et al.  On Improving the Accuracy of Readability Classification using Insights from Second Language Acquisition , 2012, BEA@NAACL-HLT.

[13]  Walt Detmar Meurers,et al.  Assessing the relative reading level of sentence pairs for text simplification , 2014, EACL.

[14]  Elizabeth Salesky,et al.  Exploiting Morphological, Grammatical, and Semantic Correlates for Improved Text Difficulty Assessment , 2014, BEA@ACL.

[15]  Kevyn Collins-Thompson,et al.  A Language Modeling Approach to Predicting Reading Difficulty , 2004, NAACL.

[16]  Kevyn Collins-Thompson,et al.  Computational Assessment of Text Readability: A Survey of Current and Future Research Running title: Computational Assessment of Text Readability , 2014 .

[17]  Maxine Eskénazi,et al.  Combining Lexical and Grammatical Features to Improve Readability Measures for First and Second Language Texts , 2007, NAACL.

[18]  Yuji Matsumoto MaltParser: A language-independent system for data-driven dependency parsing , 2005 .

[19]  Kevyn Collins-Thompson,et al.  Computational Assessment of Text Readability : A Survey of Past , Present , and Future Research 1 , 2014 .

[20]  Thea van der Geest,et al.  Online Test Tool to Determine the CEFR Reading Comprehension Level of Text , 2013, DSAI.

[21]  Richard Johansson,et al.  Rule-based and machine learning approaches for second language sentence-level readability , 2014, BEA@ACL.

[22]  Mari Ostendorf,et al.  Reading Level Assessment Using Support Vector Machines and Statistical Language Models , 2005, ACL.

[23]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[24]  Jun Ni,et al.  Feature-Based Assessment of Text Readability , 2013, 2013 Seventh International Conference on Internet Computing for Engineering and Science.

[25]  Elena Volodina,et al.  You Get what You Annotate: A Pedagogically Annotated Corpus of Coursebooks for Swedish as a Second Language , 2014 .

[26]  Yi-Ting Huang,et al.  A Robust Estimation Scheme of Reading Difficulty for Second Language Learners , 2011, 2011 IEEE 11th International Conference on Advanced Learning Technologies.

[27]  Arne Jönsson,et al.  Features Indicating Readability in Swedish Text , 2013, NODALIDA.