Selectively Using Relations to Improve Precision in Question Answering

Despite the intuition that linguistically sophisticated techniques should be beneficial to question answering, real gains in performance have yet to be demonstrated empirically in a reliable manner. Systems built around sophisticated linguistic analysis generally perform worse than their linguistically-uninformed cousins. We believe that the key to effective application of natural language processing technology is to selectively employ it only when helpful, without abandoning simpler techniques. To this end, we identify two linguistic phenomena that current information extraction driven systems have difficulty with, and demonstrate how syntactic processing can help. By indexing syntactic relations that can be reliably extracted from corpus text and matching questions with documents at the relation level, we demonstrate that syntactic analysis enables a question answering system to successfully handle these phenomena, thereby improving precision.

[1]  Miss A.O. Penney (b) , 1974, The New Yale Book of Quotations.

[2]  Joel L Fagan,et al.  Experiments in Automatic Phrase Indexing For Document Retrieval: A Comparison of Syntactic and Non-Syntactic Methods , 1987 .

[3]  W. Bruce Croft,et al.  An Approach to Natural Language Processing for Document Retrieval. , 1987, Annual International ACM SIGIR Conference on Research and Development in Information Retrieval.

[4]  Boris Katz,et al.  Using English for Indexing and Retrieving , 1991 .

[5]  Julian Kupiec,et al.  MURAX: a robust linguistic approach for question answering using an on-line encyclopedia , 1993, SIGIR.

[6]  Dekang Lin,et al.  Principle-Based Parsing Without Overgeneration , 1993, ACL.

[7]  Tomek Strzalkowski,et al.  Natural Language Information Retrieval: TREC-8 Report , 1994, TREC.

[8]  Alan F. Smeaton,et al.  Indexing Structures Derived from Syntax in TREC-3: System Description , 1994, TREC.

[9]  Natasa Milic-Frayling,et al.  Evaluation of Syntactic Phrase Indexing -- CLARIT NLP Track Report , 1996, TREC.

[10]  Boris Katz,et al.  Annotating the World Wide Web using Natural Language , 1997, RIAO.

[11]  Avi Arampatzis,et al.  Phrase-based Information Retrieval , 1998 .

[12]  Kenneth C. Litkowski Question-Answering Using Semantic Relation Triples , 1999, TREC.

[13]  Antonio Cisternino,et al.  PiQASso: Pisa Question Answering System , 2001, TREC.

[14]  Gideon S. Mann,et al.  Analyses for elucidating current question answering technology , 2001, Natural Language Engineering.

[15]  Performance Issues and Error Analysis in an Open-Domain Question Answering System , 2002, ACL.