This paper concerns about Question Answering system in which a statistical language modeling approach is used. The main objective is to build a simple system for question answering without the need for highly tuned linguistic modules which need more human work and is very difficult to find any bugs if any. A mathematical model for answer retrieval and answer extraction is derived, which does not use any linguistic information or annotated data. It makes use of word tokens and web data. We take a statistical, noisy-channel approach and treat QA as a whole as a classification problem. We present a fully data-driven mathematical model for estimating the probability of a candidate answer given a question. In doing so we largely remove the need for ad-hoc weights and parameters that were a feature of many TREC systems.
[1]
Dietrich Klakow,et al.
A statistical classification approach to question answering using Web data
,
2005,
2005 International Conference on Cyberworlds (CW'05).
[2]
Sadaoki Furui,et al.
TREC 2005 Question Answering Experiments at Tokyo Institute of Technology
,
2007,
TREC.
[3]
Eric Brill,et al.
Automatic Question Answering: Beyond the Factoid
,
2004,
NAACL.
[4]
Adwait Ratnaparkhi,et al.
IBM's Statistical Question Answering System
,
2000,
TREC.
[5]
Susan T. Dumais,et al.
An Analysis of the AskMSR Question-Answering System
,
2002,
EMNLP.
[6]
Salim Roukos,et al.
IBM's Statistical Question Answering System-TREC 11
,
2001,
TREC.