Type Checking in Open-Domain Question Answering

Open domain question answering (QA) systems have to bridge the potential vocabulary mismatch between a question and its candidate answers. One can view this as a recall problem and address it accordingly. Recall oriented strategies to QA may generate considerable amounts of noise. To combat this, many open domain QA systems contain an explicit filtering or re-ranking component, which often check whether the answer is of the correct semantic type. Particular classes of questions expect specific answer types to which all of their answers should belong. We compare two kinds of strategies for answer type checking for open domain QA. One is redundancy-based and builds on the intuition that the amount of implicit knowledge which connects an answer to a question can be estimated by exploiting the redundancy of information available on the web. The other is knowledge-intensive, and exploits structured and semi-structured data sources to determine, with high confidence, the semantic type of suggested answers.

[1]  Bernardo Magnini,et al.  Is It the Right Answer? Exploiting Web Redundancy for Answer Validation , 2002, ACL.

[2]  Scott Miller,et al.  TREC 2002 QA at BBN: Answer Selection and Confidence Estimation , 2002, TREC.

[3]  George A. Miller,et al.  WordNet: A Lexical Database for English , 1995, HLT.

[4]  Dragomir R. Radev,et al.  Question-answering by predictive annotation , 2000, SIGIR '00.

[5]  Sanda M. Harabagiu,et al.  Performance issues and error analysis in an open-domain question answering system , 2003, TOIS.

[6]  Eduard H. Hovy,et al.  Question Answering in Webclopedia , 2000, TREC.

[7]  Jennifer Chu-Carroll,et al.  A Multi-Strategy and Multi-Source Approach to Question Answering , 2002, TREC.

[8]  Donna K. Harman,et al.  The Text REtrieval Conference (TREC) , 1999, NTCIR.

[9]  M. de Rijke,et al.  Tequesta: The University of Amsterdam's Textual Question Answering System , 2001, TREC.

[10]  Bonnie Webber,et al.  Information Fusion for Answering Factoid Questions , 2003 .

[11]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[12]  R. Payne Geographic names information system , 1983 .

[13]  Sanda M. Harabagiu,et al.  The Structure and Performance of an Open-Domain Question Answering System , 2000, ACL.

[14]  Valentin Jijkoun,et al.  Answer Selection in a Multi-stream Open Domain Question Answering System , 2004, ECIR.

[15]  Jennifer Chu-Carroll,et al.  IBM's PIQUANT in TREC2003 , 2003, TREC.

[16]  Brigitte Grau,et al.  The Question Answering System QALC at LIMSI, Experiments in Using Web and WordNet , 2002, TREC.

[17]  Jimmy J. Lin,et al.  Question answering from the web using knowledge annotation and knowledge mining techniques , 2003, CIKM '03.