Question Answering by Searching Large Corpora With Linguistic Methods

In this paper we describe the QuALiM Question Answering system which uses linguistic analysis of questions as well as candidate sentences in its answer finding process. To this end we have developed a rephrasing algorithm based on linguistic patterns that describe the structure of questions and candidate sentences and where precisely to find the answer in the candidate sentences. With this method and a fall-back strategy, both using the web as their primary data source, we participated in TREC 2004. We present our official results and a follow-up evaluation to elucidate the contribution of the methods used.

[1]  Ellen M. Voorhees,et al.  Overview of the TREC-9 Question Answering Track , 2000, TREC.

[2]  Sanda M. Harabagiu,et al.  Answer Mining by Combining Extraction Techniques with Abductive Reasoning , 2003, Text Retrieval Conference.

[3]  Jimmy J. Lin,et al.  Omnibase: Uniform Access to Heterogeneous Data for Question Answering , 2002, NLDB.

[4]  Zhiping Zheng,et al.  AnswerBus question answering system , 2002 .

[5]  Leonard Bolc,et al.  Natural language question answering systems , 1980 .

[6]  James A. Hendler,et al.  The Semantic Web" in Scientific American , 2001 .

[7]  Harris Wu,et al.  Probabilistic question answering on the web , 2002, WWW '02.

[8]  Oren Etzioni,et al.  Scaling question answering to the Web , 2001, WWW '01.

[9]  Ellen M. Voorhees,et al.  The TREC-8 Question Answering Track Report , 1999, TREC.

[10]  Daniel Dominic Sleator,et al.  Parsing English with a Link Grammar , 1995, IWPT.

[11]  John D. Lafferty,et al.  A Robust Parsing Algorithm for Link Grammars , 1995, IWPT.

[12]  Sanda M. Harabagiu,et al.  COGEX: A Logic Prover for Question Answering , 2003, NAACL.

[13]  Martin M. Soubbotin Patterns of Potential Answer Expressions as Clues to the Right Answers , 2001, TREC.

[14]  Boris Katz,et al.  Annotating the World Wide Web using Natural Language , 1997, RIAO.

[15]  Ellen M. Voorhees,et al.  Overview of the TREC 2004 Novelty Track. , 2005 .

[16]  Martin M. Soubbotin,et al.  Use of Patterns for Detection of Likely Answer Strings: A Systematic Approach , 2002, TREC.

[17]  Bernardo Magnini,et al.  Combining Linguistic Processing and Web Mining for Question Answering: ITC-irst at TREC 2004 , 2004, TREC.

[18]  Dania Egedi,et al.  A freely available wide coverage morphological analyzer for English , 1992, COLING 1992.

[19]  Oren Etzioni,et al.  Scaling question answering to the Web , 2001, WWW '01.

[20]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[21]  Harris Wu,et al.  Probabilistic question answering on the Web , 2005, J. Assoc. Inf. Sci. Technol..

[22]  Jimmy J. Lin,et al.  Web question answering: is more always better? , 2002, SIGIR '02.

[23]  Daniel Marcu,et al.  Multiple-Engine Question Answering in TextMap , 2003, TREC.