A recent trend in the exploitation of unstructured text content is the use of natural language question answering (NLQA) systems. NLQA is an elaboration of traditional information retrieval techniques for satisfying a user's information needs, where the goal is not simply to retrieve relevant documents but to additionally extract specific passages and semantic entities from these documents as candidate answers to a natural language question. NLQA is thus a tight integration of natural language processing (NLP), information retrieval (IR) and information extraction (IE) designed to circumvent the deep and brittle analysis of questions in favor of shallow but robust comprehension, to ultimately achieve a broad domain question-answering competence. It is argued here that the key to achieving good quality answers in a high-throughput setting lies in a system's ability to construct rich queries that incorporate knowledge from multiple sources.
[1]
Steven J. Maiorano.
Finding Answers in Large Collections of Texts: Paragraph Indexing W Abductive Inference
,
1999
.
[2]
Kristian J. Hammond,et al.
Natural Language Processing in the FAQ Finder System: Results and Prospects
,
1997
.
[3]
Oren Etzioni,et al.
Scaling question answering to the Web
,
2001,
WWW '01.
[4]
George A. Miller,et al.
WordNet: A Lexical Database for English
,
1995,
HLT.
[5]
W. Bruce Croft,et al.
The INQUERY Retrieval System
,
1992,
DEXA.
[6]
Sanda M. Harabagiu,et al.
LASSO: A Tool for Surfing the Answer Net
,
1999,
TREC.