论文信息 - Identifying Current Issues in Short Answer Grading

Identifying Current Issues in Short Answer Grading

Given a query answer (e.g. “The entire program.”, “main() function.”), the task is to evaluate the correctness of the answer with respect to the reference answer (e.g. 5, 0). SAG is expected to be useful in many real-world applications such as automated assessment of student answers in examinations. In recent years, a number of datasets have been released such as SciEntsBank [3] and X-CSD [5], which leads to creating a number of computational models for SAG [6, 1]. However, the performance of SAG is still limited, which hampers applying SAG to realworld applications. For example, a state-of-the-art system for SciEntsBank achieved 0.643 of weighted F1 score in 5-ways scoring [7]. Furthermore, it has not been explored what issues remain for creating a better SAG system yet in the literature. This paper aims at making these issues clear. For this aim, we create a simple SAG system which is easily analyzable but comparable to the state-of-the-art systems. We employ a simple k-Nearest Neighbors (kNN)-based system, where the instances, namely answers, are simply represented by additive word vectors. Our experiments show that the kNN-based system achieves reasonable performance compared to the state-of-the-art approaches. In addition, our detailed analysis of the system’s behavior highlights some remaining issues of SAG.

Kentaro Inui | Naoya Inoue | Tomoya Mizumoto | Tianqi Wang

[1] Susan T. Dumais,et al. An Analysis of the AskMSR Question-Answering System , 2002, EMNLP.

[2] Rada Mihalcea,et al. Text-to-Text Semantic Similarity for Automatic Short Answer Grading , 2009, EACL.

[3] Rada Mihalcea,et al. Learning to Grade Short Answer Questions using Semantic Similarity Measures and Dependency Graph Alignments , 2011, ACL.

[4] Chris Brew,et al. SemEval-2013 Task 7: The Joint Student Response Analysis and 8th Recognizing Textual Entailment Challenge , 2013, *SEMEVAL.

[5] Shourya Roy,et al. Wisdom of Students: A Consistent Automatic Short Answer Grading Technique , 2016, ICON.

[6] Mohsen Rashwan,et al. Vector Based Techniques for Short Answer Grading , 2016, FLAIRS.

[7] Soumajit Adhya,et al. Automated Short Answer Grader Using Friendship Graphs , 2016 .

[8] Tamara Sumner,et al. Fast and Easy Short Answer Grading with High Accuracy , 2016, NAACL.

[9] Torsten Zesch,et al. Investigating neural architectures for short answer scoring , 2017, BEA@EMNLP.