A System for Generating Multiple Choice Questions: With a Novel Approach for Sentence Selection

Multiple Choice Question (MCQ) plays a major role in educational assessment as well as in active learning. In this paper we present a system that generates MCQs automatically using a sports domain text as input. All the sentences in a text are not capable of generating MCQs; the first step of the system is to select the informative sentences. We propose a novel technique to select informative sentences by using topic modeling and parse structure similarity. The parse structure similarity is computed between the parse structure of an input sentence and a set of reference parse structures. In order to compile the reference set we use a number of existing MCQs collected from the web. Keyword selection is done with the help of occurrence of domain specific word and named entity word in the sentence. Distractors are generated using a set of rules and name dictionary. Experimental results demonstrate that the proposed technique is quite accurate.

[1]  Andreas Papasalouros,et al.  Automatic Generation Of Multiple Choice Questions From Domain Ontologies , 2008, e-Learning.

[2]  Manish Agarwal,et al.  Automatic Gap-fill Question Generation from Text Books , 2011, BEA@ACL.

[3]  Le An Ha,et al.  A computer-aided environment for generating multiple-choice test items , 2006, Natural Language Engineering.

[4]  Andrey Kurtasov A System for Generating Cloze Test Items from Russian-Language Text , 2013, RANLP.

[5]  Michael Heilman,et al.  A Selection Strategy to Improve Cloze Question Quality , 2008 .

[6]  Mukta Majumder,et al.  Automatic selection of informative sentences: The sentences that can generate multiple choice questions , 2014 .

[7]  Sujan Kumar Saha,et al.  Automatic Generation of Multiple Choice Questions Using Wikipedia , 2013, PReMI.

[8]  Beatrice Santorini,et al.  Part-of-Speech Tagging Guidelines for the Penn Treebank Project (3rd Revision) , 1990 .

[9]  Manish Agarwal,et al.  Automatic Question Generation using Discourse Cues , 2011, BEA@ACL.

[10]  R. Mitkov,et al.  Computer-Aided Generation of Multiple-Choice Tests , 2003, International Conference on Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003.

[11]  Maxine Eskénazi,et al.  Automatic Generation of Cloze Question Stems , 2012, PROPOR.

[12]  D. Nicol E‐assessment by design: using multiple‐choice tests to good effect , 2007 .

[13]  Montse Maritxalar,et al.  Automatic Distractor Generation for Domain Specific Texts , 2010, IceTAL.

[14]  David Coniam A Preliminary Inquiry into Using Corpus Word Frequency Data in the Automatic Generation of English Language Cloze Tests , 2013 .

[15]  Beatrice Santorini Part-of-speech tagging guidelines for the penn treebank project , 1990 .

[16]  Le An Ha,et al.  Generating Multiple-Choice Test Items from Medical Text: A Pilot Study , 2006, INLG.

[17]  Montse Maritxalar,et al.  ArikIturri: An Automatic Question Generator Based on Corpora and NLP Techniques , 2006, Intelligent Tutoring Systems.

[18]  Christopher D. Manning,et al.  The Stanford Typed Dependencies Representation , 2008, CF+CDPE@COLING.

[19]  Maxine Eskénazi,et al.  Automatic Question Generation for Vocabulary Assessment , 2005, HLT.

[20]  Manish Agarwal,et al.  Automatic Cloze-Questions Generation , 2013, RANLP.