Automatic Identification of Nutritious Contexts for Learning Vocabulary Words

Vocabulary knowledge is crucial to literacy development and academic success. Previous research has shown learning the meaning of a word requires encountering it in diverse informative contexts. In this work, we try to identify “nutritious” contexts for a word – contexts that help students build a rich mental representation of the word’s meaning. Using crowdsourced ratings of vocabulary contexts retrieved from the web, AVER learns models to score unseen contexts for unseen words. We specify the features used in the models, measure their individual informativeness, evaluate AVER’s cross-validated accuracy in scoring contexts for unseen words, and compare its agreement with the human ratings against the humans’ agreement with each other. The automated scores are not good enough to replace human ratings, but should reduce human effort by identifying contexts likely to be worth rating by hand, subject to a tradeoff between the number of contexts inspected by hand, and how many of them a human judge will consider nutritious.

[1]  Karen Mezynski Issues Concerning the Acquisition of Knowledge: Effects of Vocabulary Training on Reading Comprehension , 1983 .

[2]  Margaret G. McKeown,et al.  Bringing Words to Life: Robust Vocabulary Instruction , 2002 .

[3]  Jack Mostow,et al.  Generating Example Contexts to Illustrate a Target Word Sense , 2011, BEA@ACL.

[4]  Paul van den Broek,et al.  Reading for Meaning: Fostering Comprehension in the Middle Grades. Language and Literacy Series. , 1999 .

[5]  John R. Anderson,et al.  Practice and Forgetting Effects on Vocabulary Memory: An Activation-Based Model of the Spacing Effect , 2005, Cogn. Sci..

[6]  Liu Liu,et al.  Generating example contexts to help children learn word meaning† , 2012, Natural Language Engineering.

[7]  Steven A. Stahl,et al.  The Effects of Vocabulary Instruction: A Model-Based Meta-Analysis , 1986 .

[8]  M. Maby,et al.  Vocabulary Learning and Instruction , 2019 .

[9]  David Reinking,et al.  The Effects of Computer-Mediated Texts on the Vocabulary Learning and Comprehension of Intermediate-Grade Readers , 1990 .

[10]  Yoav Goldberg,et al.  A Dataset of Syntactic-Ngrams over Time from a Very Large Corpus of English Books , 2013, *SEMEVAL.

[11]  Gregory Aist,et al.  Helping Children Learn Vocabulary during Computer-Assisted Oral Reading , 2000, J. Educ. Technol. Soc..

[12]  Maxine Eskenazi,et al.  Retrieval of Authentic Documents for Reader-Specific Lexical Practice , 2004 .

[13]  Mihai Surdeanu,et al.  The Stanford CoreNLP Natural Language Processing Toolkit , 2014, ACL.

[14]  William E. Nagy,et al.  Teaching Word Meanings , 2005 .

[15]  Donald Shankweiler,et al.  Beyond Phonological Processes: Print Exposure and Orthographic Processing , 2013 .

[16]  A. Biemiller,et al.  An effective method for building meaning vocabulary in primary grades , 2006 .

[17]  Jack Mostow,et al.  Inferring Selectional Preferences from Part-Of-Speech N-grams , 2012, EACL.

[18]  Peter Kolb,et al.  DISCO: A Multilingual Database of Distributionally Similar Words , 2008 .