Automatic Distractor Generation for Domain Specific Texts

This paper presents a system which uses Natural Language Processing techniques to generate multiple-choice questions. The system implements different methods to find distractors semantically similar to the correct answer. For this task, a corpus-based approach is applied to measure similarities. The target language is Basque and the questions are used for learners' assessment in the science domain. In this article we present the results of an evaluation carried out with learners to measure the quality of the automatically generated distractors.

[1]  Hinrich Schütze,et al.  Automatic Word Sense Discrimination , 1998, Comput. Linguistics.

[2]  Kepa Sarasola,et al.  Semiautomatic Labelling of Semantic Features , 2002, COLING.

[3]  Montse Maritxalar,et al.  ArikIturri: An Automatic Question Generator Based on Corpora and NLP Techniques , 2006, Intelligent Tutoring Systems.

[4]  Eneko Agirre,et al.  Personalizing PageRank for Word Sense Disambiguation , 2009, EACL.

[5]  Le An Ha,et al.  Semantic Similarity of Distractors in Multiple-Choice Tests: Extrinsic Evaluation , 2009 .

[6]  Piek Vossen,et al.  The MEANING Multilingual Central Repository , 2004 .

[7]  Adam Kilgarriff,et al.  Automatic Cloze Generation for English Proficiency Testing , 2009 .

[8]  Peter D. Turney Mining the Web for Synonyms: PMI-IR versus LSA on TOEFL , 2001, ECML.

[9]  Dominic Widdows,et al.  Discovering Corpus-Specific Word Senses , 2003, EACL.

[10]  Danielle S. McNamara,et al.  Handbook of latent semantic analysis , 2007 .

[11]  Michael Heilman,et al.  A Selection Strategy to Improve Cloze Question Quality , 2008 .

[12]  Hiroshi Nakagawa,et al.  Assisting cloze test making with a web application , 2007 .

[13]  Luc De Raedt,et al.  Machine Learning: ECML 2001 , 2001, Lecture Notes in Computer Science.

[14]  Elhuyar Fundazioa,et al.  ZT Corpus Annotation and tools for Basque corpora , .

[15]  Carlo Strapparava,et al.  Domain Kernels for Word Sense Disambiguation , 2005, ACL.

[16]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[17]  Eiichiro Sumita,et al.  Measuring Non-native Speakers’ Proficiency of English by Using a Test with Automatically-Generated Fill-in-the-Blank Questions , 2005 .

[18]  Hinrich Sch Automatic Word Sense Discrimination , 1998 .