论文信息 - Data-driven type checking in open domain question answering - 字舞流文

Data-driven type checking in open domain question answering

Abstract Many open domain question answering systems answer questions by first harvesting a large number of candidate answers, and then picking the most promising one from the list. One criterion for this answer selection is type checking: deciding whether the candidate answer is of the semantic type expected by the question. We define a general strategy for building redundancy-based type checkers, built around the notions of comparison set and scoring method, where the former provide a set of potential answer types and the latter are meant to capture the relation between a candidate answer and an answer type. Our focus is on scoring methods. We discuss nine such methods, provide a detailed experimental comparison and analysis of these methods, and find that the best performing scoring method performs at the same level as knowledge-intensive methods, although our experiments do not reveal a clear-cut answer on the question whether any of the scoring methods we consider should be preferred over the others.

Valentin Jijkoun | M. de Rijke | Maarten de Rijke | Stefan Schlobach | David Ahn | V. Jijkoun | S. Schlobach | David Ahn

[1] Sanda M. Harabagiu,et al. The Structure and Performance of an Open-Domain Question Answering System , 2000, ACL.

[2] Gilad Mishne,et al. Using Wikipedia at the TREC QA Track , 2004, TREC.

[3] Valentin Jijkoun,et al. Answer Selection in a Multi-stream Open Domain Question Answering System , 2004, ECIR.

[4] J. Ross Quinlan,et al. Induction of Decision Trees , 1986, Machine Learning.

[5] Charles L. A. Clarke,et al. Statistical Selection of Exact Answers (MultiText Experiments for TREC 2002) , 2002, TREC.

[6] Bonnie Webber,et al. Information Fusion for Answering Factoid Questions , 2003 .

[7] M. de Rijke,et al. Type Checking in Open-Domain Question Answering , 2004, ECAI.

[8] Kenneth Ward Church,et al. Using Statistics in Lexical Analysis , 2003, Lexical Acquisition: Exploiting On-Line Resources to Build a Lexicon.

[9] George A. Miller,et al. WordNet: A Lexical Database for English , 1995, HLT.

[10] Dragomir R. Radev,et al. Question-answering by predictive annotation , 2000, SIGIR '00.

[11] Bernardo Magnini,et al. Is It the Right Answer? Exploiting Web Redundancy for Answer Validation , 2002, ACL.

[12] Brigitte Grau,et al. The Question Answering System QALC at LIMSI, Experiments in Using Web and WordNet , 2002, TREC.

[13] Walter Daelemans,et al. TiMBL: Tilburg Memory-Based Learner, version 2.0, Reference guide , 1998 .

[14] Eduard H. Hovy,et al. Question Answering in Webclopedia , 2000, TREC.

[15] Tat-Seng Chua,et al. National University of Singapore at the TREC 13 Question Answering Main Task , 2004, TREC.

[16] M. de Rijke,et al. The University of Amsterdam at TREC 2008: Blog, Enterprise, and Relevance Feedback , 2008 .

[17] Jimmy J. Lin,et al. Question answering from the web using knowledge annotation and knowledge mining techniques , 2003, CIKM '03.

[18] Hinrich Schütze,et al. Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[19] Ellen M. Voorhees,et al. Overview of the TREC 2004 Novelty Track. , 2005 .

[20] M. de Rijke,et al. Tequesta: The University of Amsterdam's Textual Question Answering System , 2001, TREC.

[21] Sanda M. Harabagiu,et al. Performance Issues and Error Analysis in an Open-Domain Question Answering System , 2002, ACL.

[22] Ted Dunning,et al. Accurate Methods for the Statistics of Surprise and Coincidence , 1993, CL.

[23] Gilad Mishne,et al. The University of Amsterdam at the TREC 2003 Question Answering Track , 2003, TREC.

[24] Gilad Mishne,et al. Making Stone Soup: Evaluating a Recall-Oriented Multi-stream Question Answering System for Dutch , 2004, CLEF.

[25] C. J. van Rijsbergen,et al. A Case Study for Automatic Query Expansion Based on Divergence , 2004 .

[26] Jennifer Chu-Carroll,et al. A Multi-Strategy and Multi-Source Approach to Question Answering , 2002, TREC.

[27] Donna K. Harman,et al. The Text REtrieval Conference (TREC) , 1999, NTCIR.

[28] R. Payne. Geographic names information system , 1983 .

[29] Jennifer Chu-Carroll,et al. IBM's PIQUANT in TREC2003 , 2003, TREC.

[30] Scott Miller,et al. TREC 2002 QA at BBN: Answer Selection and Confidence Estimation , 2002, TREC.