Automatic selection of informative sentences: The sentences that can generate multiple choice questions

Traditional education cannot meet the expectation and requirement of a Smart City; it require more advance forms like active learning, ICT education etc. Multiple choice questions (MCQs) play an important role in educational assessment and active learning which has a key role in Smart City education. MCQs are effective to assess the understanding of well-defined concepts. A fraction of all the sentences of a text contain well-defined concepts or information that can be asked as a MCQ. These informative sentences are required to be identified first for preparing multiple choice questions manually or automatically. In this paper we propose a technique for automatic identification of such informative sentences that can act as the basis of MCQ. The technique is based on parse structure similarity. A reference set of parse structures is compiled with the help of existing MCQs. The parse structure of a new sentence is compared with the reference structures and if similarity is found then the sentence is considered as a potential candidate. Next a rule-based post-processing module works on these potential candidates to select the final set of informative sentences. The proposed approach is tested in sports domain, where many MCQs are easily available for preparing the reference set of structures. The quality of the system selected sentences is evaluated manually. The experimental result shows that the proposed technique is quite promising.

[1]  Le An Ha,et al.  Generating Multiple-Choice Test Items from Medical Text: A Pilot Study , 2006, INLG.

[2]  David Coniam A Preliminary Inquiry into Using Corpus Word Frequency Data in the Automatic Generation of English Language Cloze Tests , 2013 .

[3]  Le An Ha,et al.  A computer-aided environment for generating multiple-choice test items , 2006, Natural Language Engineering.

[4]  Sujan Kumar Saha,et al.  Automatic Generation of Multiple Choice Questions Using Wikipedia , 2013, PReMI.

[5]  Andreas Papasalouros,et al.  Automatic Generation Of Multiple Choice Questions From Domain Ontologies , 2008, e-Learning.

[6]  Montse Maritxalar,et al.  ArikIturri: An Automatic Question Generator Based on Corpora and NLP Techniques , 2006, Intelligent Tutoring Systems.

[7]  Manish Agarwal,et al.  Automatic Cloze-Questions Generation , 2013, RANLP.

[8]  Maxine Eskénazi,et al.  Automatic Question Generation for Vocabulary Assessment , 2005, HLT.

[9]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[10]  Montse Maritxalar,et al.  Automatic Distractor Generation for Domain Specific Texts , 2010, IceTAL.

[11]  Maxine Eskénazi,et al.  Automatic Generation of Cloze Question Stems , 2012, PROPOR.

[12]  D. Nicol E‐assessment by design: using multiple‐choice tests to good effect , 2007 .

[13]  Constantin Gurdgiev,et al.  Smarter Cities for Smarter Growth: How Cities Can Optimize Their Systems for the Talent-Based Economy , 2010 .

[14]  José Ramón Gil-García,et al.  Understanding Smart Cities: An Integrative Framework , 2012, HICSS.

[15]  Carlo Giovannella,et al.  Scenarios for active learning in smart territories , 2013, IxD&A.

[16]  Manish Agarwal,et al.  Automatic Gap-fill Question Generation from Text Books , 2011, BEA@ACL.

[17]  R. Mitkov,et al.  Computer-Aided Generation of Multiple-Choice Tests , 2003, International Conference on Natural Language Processing and Knowledge Engineering, 2003. Proceedings. 2003.

[18]  Ralph Grishman,et al.  A Maximum Entropy Approach to Named Entity Recognition , 1999 .