Passage Scoring for Question Answering via Bayesian Inference on Lexical Relations

Many researchers have used lexical networks and ontologies to mitigate synonymy and polysemy problems in Question Answering (QA), systems coupled with taggers, query classifiers, and answer extractors in complex and ad-hoc ways. We seek to make QA systems reproducible with shared and modest human effort, carefully separating knowledge from algorithms. To this end, we propose an aesthetically “clean” Bayesian inference scheme for exploiting lexical relations for passage-scoring for QA . The factors which contribute to the efficacy of Bayesian Inferencing on lexical relations are soft word sense disambiguation, parameter smoothing which ameliorates the data sparsity problem and estimation of joint probability over words which overcomes the deficiency of naive-bayes-like approaches.

[1]  Hang Li,et al.  Learning Word Association Norms Using Tree Cut Pair Models , 1996, ICML.

[2]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[3]  J. Wiebe Constructing Bayesian Networks from WordNet for Word-SenseDisambiguation : Representational and Processing Issues , 1998 .

[4]  Sanda M. Harabagiu,et al.  FALCON: Boosting Knowledge for Answer Engines , 2000, TREC.

[5]  Charles L. A. Clarke,et al.  Exploiting redundancy in question answering , 2001, SIGIR '01.

[6]  Boris Katz,et al.  From Sentence Processing to Information Access on the World Wide Web , 1997 .

[7]  Mark Sanderson,et al.  Word sense disambiguation and information retrieval , 1994, SIGIR '94.

[8]  Oren Etzioni,et al.  Scaling question answering to the Web , 2001, WWW '01.

[9]  Ellen M. Voorhees,et al.  Overview of the TREC-9 Question Answering Track , 2000, TREC.

[10]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[11]  Christiane Fellbaum,et al.  Book Reviews: WordNet: An Electronic Lexical Database , 1999, CL.

[12]  Thorsten Brants,et al.  Natural Language Processing in Information Retrieval , 2003, CLIN.

[13]  Pushpak Bhattacharyya,et al.  Text Representation with WordNet Synsets Using Soft Sense Disambiguation , 2003, Ingénierie des Systèmes d Inf..

[14]  Chris Buckley,et al.  Implementation of the SMART Information Retrieval System , 1985 .

[15]  Adwait Ratnaparkhi,et al.  A Maximum Entropy Model for Part-Of-Speech Tagging , 1996, EMNLP.