论文信息 - Analyses for elucidating current question answering technology

Analyses for elucidating current question answering technology

In this paper, we take a detailed look at the performance of components of an idealized question answering system on two different tasks: the TREC Question Answering task and a set of reading comprehension exams. We carry out three types of analysis: inherent properties of the data, feature analysis, and performance bounds. Based on these analyses we explain some of the performance results of the current generation of Q/A systems and make predictions on future work. In particular, we present four findings: (1) Q/A system performance is correlated with answer repetition; (2) relative overlap scores are more effective than absolute overlap scores; (3) equivalence classes on scoring functions can be used to quantify performance bounds; and (4) perfect answer typing still leaves a great deal of ambiguity for a Q/A system because sentences often contain several items of the same type.

[1] Branimir Boguraev,et al. Natural Language Engineering , 1995 .

[2] Lynette Hirschman,et al. Deep Read: A Reading Comprehension System , 1999, ACL.

[3] Hwee Tou Ng,et al. A Machine Learning Approach to Answering Questions for Reading Comprehension Tests , 2000, EMNLP.

[4] Ellen Riloff,et al. A Rule-based Question Answering System for Reading Comprehension Tests , 2000 .

[5] Sanda M. Harabagiu,et al. Experiments with Open-Domain Textual Question Answering , 2000, COLING.

[6] Mary P. Harper,et al. A Question Answering System Developed as a Project in a Natural Language Processing Course , 2000 .

[7] Inderjeet Mani,et al. How to Evaluate Your Question Answering System Every Day ... and Still Get Real Work Done , 2000, LREC.

[8] Yasemin Altun,et al. Reading Comprehension Programs in a Statistical-Language-Processing Class , 2000 .