TransPhoner: automated mnemonic keyword generation

We present TransPhoner: a system that generates keywords for a variety of scenarios including vocabulary learning, phonetic transliteration, and creative word plays. We select effective keywords by considering phonetic, orthographic and semantic word similarity, and word concept imageability. We show that keywords provided by TransPhoner improve learner performance in an online vocabulary learning study, with the improvement being more pronounced for harder words. Participants rated TransPhoner keywords as more helpful than a random keyword baseline, and almost as helpful as manually selected keywords. Comments also indicated higher engagement in the learning task, and more desire to continue learning. We demonstrate additional applications to tasks such as pure phonetic transliteration, generation of mnemonics for complex vocabulary, and topic-based transformation of song lyrics.

[1]  Edward Fredkin,et al.  Trie memory , 1960, Commun. ACM.

[2]  Višnja Pavičić Takač Vocabulary Learning Strategies and Foreign Language Acquisition , 2008 .

[3]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[4]  Falk Scholer,et al.  Machine transliteration survey , 2011, ACM Comput. Surv..

[5]  Catherine O. Fritz,et al.  Comparing and combining retrieval practice and the keyword mnemonic for foreign vocabulary learning. , 2007 .

[6]  A. D. Groot,et al.  The learning of foreign language vocabulary , 2005 .

[7]  D. Howard,et al.  Age of acquisition and imageability ratings for a large set of words, including verbs and function words , 2001, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[8]  Michael J Cortese,et al.  Imageability ratings for 3,000 monosyllabic words , 2004, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[9]  A. Paivio Mental imagery in associative learning and memory , 1969 .

[10]  Grzegorz Kondrak,et al.  A New Algorithm for the Alignment of Phonetic Sequences , 2000, ANLP.

[11]  Gerhard Weikum,et al.  Towards a universal wordnet by learning from combined evidence , 2009, CIKM.

[12]  Nuria Sagarra,et al.  The Key Is in the Keyword: L2 Vocabulary Learning Methods with Beginning Learners of Spanish. , 2006 .

[13]  Christopher D. Manning,et al.  Introduction to Information Retrieval , 2010, J. Assoc. Inf. Sci. Technol..

[14]  Michael Wilson,et al.  MRC psycholinguistic database: Machine-usable dictionary, version 2.00 , 1988 .

[15]  Stanley F. Chen,et al.  Conditional and joint models for grapheme-to-phoneme conversion , 2003, INTERSPEECH.

[16]  A. Paivio,et al.  Concreteness, imagery, and meaningfulness values for 925 nouns. , 1968, Journal of experimental psychology.

[17]  Stacy Marsella,et al.  The DARWARS Tactical Language Training System , 2004 .

[18]  R. Logie,et al.  Age-of-acquisition, imagery, concreteness, familiarity, and ambiguity measures for 1,944 words , 1980 .

[19]  Patrick Pantel,et al.  From Frequency to Meaning: Vector Space Models of Semantics , 2010, J. Artif. Intell. Res..

[20]  A. Beaton,et al.  Retention of foreign vocabulary learned using the keyword method: a ten-year follow-up , 1995 .

[21]  Grzegorz Kondrak,et al.  Automatic Generation of English Respellings , 2013, HLT-NAACL.

[22]  Kathy Hirsh-Pasek,et al.  An image is worth a thousand words: why nouns tend to dominate verbs in early word learning. , 2011, Developmental science.

[23]  Nick C. Ellis,et al.  Psycholinguistic Determinants of Foreign Language Vocabulary Learning , 1993 .

[24]  Jeffrey Heer,et al.  The efficacy of human post-editing for language translation , 2013, CHI.

[25]  M. Brysbaert,et al.  Age-of-acquisition ratings for 30,000 English words , 2012, Behavior research methods.

[26]  Allan Paivio,et al.  Extensions of the Paivio, Yuille, and Madigan (1968) norms , 2004, Behavior research methods, instruments, & computers : a journal of the Psychonomic Society, Inc.

[27]  Dan Klein,et al.  Feature-Rich Part-of-Speech Tagging with a Cyclic Dependency Network , 2003, NAACL.

[28]  R. Baayen,et al.  Mixed-effects modeling with crossed random effects for subjects and items , 2008 .

[29]  Richard G. Kern Perspectives on Technology in Learning and Teaching Languages , 2006 .

[30]  E. Vajda Handbook of the International Phonetic Association: A Guide to the Use of the International Phonetic Alphabet , 2000 .

[31]  Michael Wilson MRC Psycholinguistic Database , 2001 .

[32]  Vladimir I. Levenshtein,et al.  Binary codes capable of correcting deletions, insertions, and reversals , 1965 .

[33]  R. Atkinson,et al.  An Application of the Mnemonic Keyword Method to the Acquisition of a Russian Vocabulary. Psychology and Education Series, Technical Report No. 237. , 1974 .

[34]  Grzegorz Kondrak,et al.  Phonetic Alignment and Similarity , 2003, Comput. Humanit..

[35]  H. Stadthagen-González,et al.  The Bristol norms for age of acquisition, imageability, and familiarity , 2006, Behavior research methods.

[36]  Farzad Ehsani,et al.  Speech Technology in Computer-Assisted Language Learning: Strengths and Limitations of a New CALL Paradigm. , 1998 .