Using the text to evaluate short answers for reading comprehension exercises

Short answer questions for reading comprehension are a common task in foreign language learning. Automatic short answer scoring is the task of automatically assessing the semantic content of a student’s answer, marking it e.g. as correct or incorrect. While previous approaches mainly focused on comparing a learner answer to some reference answer provided by the teacher, we explore the use of the underlying reading texts as additional evidence for the classification. First, we conduct a corpus study targeting the links between sentences in reading texts for learners of German and answers to reading comprehension questions based on those texts. Second, we use the reading text directly for classification, considering three different models: an answer-based classifier extended with textual features, a simple text-based classifier, and a model that combines the two according to confidence of the text-based classification. The most promising approach is the first one, results for which show that textual features improve classification accuracy. While the other two models do not improve classification accuracy, they do investigate the role of the text and suggest possibilities for developing automatic answer scoring systems with less supervision needed from instructors.

[1]  James H. Martin,et al.  Identifying science concepts and student misconceptions in an interactive essay writing tutor , 2012, BEA@NAACL-HLT.

[2]  Walt Detmar Meurers,et al.  Short Answer Assessment: Establishing Links Between Research Strands , 2012, BEA@NAACL-HLT.

[3]  Detmar Meurers,et al.  Integrating parallel analysis modules to evaluate the meaning of answers to reading comprehension questions , 2011 .

[4]  Manaal Faruqui,et al.  Training and Evaluating a German Named Entity Recognizer with Semantic Generalization , 2010, KONVENS.

[5]  Detmar Meurers,et al.  Creation and Analysis of a Reading Comprehension Exercise Corpus : Towards Evaluating Meaning in Context , 2012 .

[6]  Stephen G. Pulman,et al.  Automatic Short Answer Marking , 2005, ACL 2005.

[7]  Walt Detmar Meurers Diagnosing Meaning Errors in Short Answers to Reading Comprehension Questions , 2008 .

[8]  Helmut Schmidt,et al.  Probabilistic part-of-speech tagging using decision trees , 1994 .

[9]  Stacey Bailey,et al.  Content Assessment in Intelligent Computer-aided Language Learning: Meaning Error Diagnosis for English as a Second Language , 2008 .

[10]  Walt Detmar Meurers,et al.  Evaluating the Meaning of Answers to Reading Comprehension Questions: A Semantics-Based Approach , 2012, BEA@NAACL-HLT.

[11]  L. S. Shapley,et al.  College Admissions and the Stability of Marriage , 2013, Am. Math. Mon..

[12]  Walt Detmar Meurers,et al.  Evaluating Answers to Reading Comprehension Questions in Context: Results for German and the Role of Information Structure , 2011, TextInfer@EMNLP.

[13]  Walter Daelemans,et al.  TiMBL: Tilburg Memory-Based Learner, version 2.0, Reference guide , 1998 .

[14]  Rodney D. Nielsen,et al.  Learning to Assess Low-Level Conceptual Understanding , 2008, FLAIRS Conference.

[15]  Rada Mihalcea,et al.  Text-to-Text Semantic Similarity for Automatic Short Answer Grading , 2009, EACL.

[16]  Christian Chiarcos,et al.  A New Hybrid Dependency Parser for German , 2009 .

[17]  Eugene S. Edgington,et al.  Randomization Tests , 2011, International Encyclopedia of Statistical Science.

[18]  Rada Mihalcea,et al.  Learning to Grade Short Answer Questions using Semantic Similarity Measures and Dependency Graph Alignments , 2011, ACL.

[19]  Stephen G. Pulman,et al.  Information Extraction and Machine Learning: Auto-Marking Short Free Text Responses to Science Questions , 2005, AIED.

[20]  Martin Chodorow,et al.  C-rater: Automated Scoring of Short-Answer Questions , 2003, Comput. Humanit..

[21]  Ido Dagan,et al.  Assessing the Role of Discourse References in Entailment Inference , 2010, ACL.

[22]  Helmut Feldweg,et al.  GermaNet - a Lexical-Semantic Net for German , 1997 .

[23]  Christian Chiarcos,et al.  Von der Form zur Bedeutung: Texte automatisch verarbeiten/From Form to Meaning: Processing Texts Automatically , 2009 .