User Modeling in Language Learning with Macaronic Texts

Foreign language learners can acquire new vocabulary by using cognate and context clues when reading. To measure such incidental comprehension, we devise an experimental framework that involves reading mixed-language “macaronic” sentences. Using data collected via Amazon Mechanical Turk, we train a graphical model to simulate a human subject’s comprehension of foreign words, based on cognate clues (edit distance to an English word), context clues (pointwise mutual information), and prior exposure. Our model does a reasonable job at predicting which words a user will be able to understand, which should facilitate the automatic construction of comprehensible text for personalized foreign language education.

[1]  Philipp Koehn,et al.  Creating Interactive Macaronic Interfaces for Language Learning , 2016, ACL.

[2]  J. Bruner,et al.  The role of tutoring in problem solving. , 1976, Journal of child psychology and psychiatry, and allied disciplines.

[3]  Hod Lipson,et al.  Generating Code-switched Text for Lexical Learning , 2014, ACL.

[4]  Philipp Koehn,et al.  Analyzing Learner Understanding of Novel L2 Vocabulary , 2016, CoNLL.

[5]  Nitin Madnani,et al.  The E-rater® Automated Essay Scoring System , 2013 .

[6]  S. Krashen We Acquire Vocabulary and Spelling by Reading: Additional Evidence for the Input Hypothesis , 1989 .

[7]  Nitin Madnani,et al.  Identifying High-Level Organizational Elements in Argumentative Discourse , 2012, NAACL.

[8]  Markus Dreyer,et al.  Graphical Models over Multiple Strings , 2009, EMNLP.

[9]  Philipp Koehn,et al.  Moses: Open Source Toolkit for Statistical Machine Translation , 2007, ACL.

[10]  Michael I. Jordan,et al.  Loopy Belief Propagation for Approximate Inference: An Empirical Study , 1999, UAI.

[11]  Jeffrey Pennington,et al.  GloVe: Global Vectors for Word Representation , 2014, EMNLP.

[12]  Hal Daumé,et al.  Frustratingly Easy Domain Adaptation , 2007, ACL.

[13]  Philipp Koehn,et al.  Findings of the 2015 Workshop on Statistical Machine Translation , 2015, WMT@EMNLP.

[14]  Gözde Özbal,et al.  Automation and Evaluation of the Keyword Method for Second Language Learning , 2014, ACL.

[15]  Diane J. Litman,et al.  Natural Language Processing for Enhancing Teaching and Learning , 2016, AAAI.

[16]  Stephen J. Wright,et al.  Hogwild: A Lock-Free Approach to Parallelizing Stochastic Gradient Descent , 2011, NIPS.

[17]  Chris Callison-Burch,et al.  Open Source Toolkit for Statistical Machine Translation: Factored Translation Models and Lattice Decoding , 2006 .

[18]  Nitin Madnani,et al.  ETS: Discriminative Edit Models for Paraphrase Scoring , 2012, *SEMEVAL.