Named Entity Recognition for Question Answering

Current text-based question answering (QA) systems usually contain a named entity recogniser (NER) as a core component. Named entity recognition has traditionally been developed as a component for information extraction systems, and current techniques are focused on this end use. However, no formal assessment has been done on the characteristics of a NER within the task of question answering. In this paper we present a NER that aims at higher recall by allowing multiple entity labels to strings. The NER is embedded in a question answering system and the overall QA system performance is compared to that of one with a traditional variation of the NER that only allows single entity labels. It is shown that the added noise produced introduced by the additional labels is offset by the higher recall gained, therefore enabling the QA system to have a better chance to find the answer.

[1]  Hwee Tou Ng,et al.  Named Entity Recognition: A Maximum Entropy Approach Using Global Information , 2002, COLING.

[2]  Beth M. Sundheim,et al.  Overview of Results of the MUC-6 Evaluation , 1995, MUC.

[3]  Yorick Wilks,et al.  GATE: an environment to support research and development in natural language engineering , 1996, Proceedings Eighth IEEE International Conference on Tools with Artificial Intelligence.

[4]  Lynette Hirschman,et al.  Deep Read: A Reading Comprehension System , 1999, ACL.

[5]  Timo Järvinen,et al.  A non-projective dependency parser , 1997, ANLP.

[6]  Beth M. Sundheim Overview of results of the MUC-6 evaluation , 1995, MUC.

[7]  Diego Mollá Aliod,et al.  Answerfinder: Question Answering by Combining Lexical, Syntactic and Semantic Information , 2004, ALTA.

[8]  Dan Roth,et al.  Learning Question Classifiers , 2002, COLING.

[9]  Ellen M. Voorhees,et al.  The TREC-8 Question Answering Track Report , 1999, TREC.

[10]  Fernando Llopis,et al.  Reducing Question Answering Input Data Using Named Entity Recognition , 2005, TSD.

[11]  Quintin Armour The role of named entities in text classification , 2005 .

[12]  Diego Molla Aliod,et al.  Learning of Graph-based Question Answering Rules , 2006 .

[13]  Ted Briscoe,et al.  Parser evaluation: a survey and a new proposal , 1998, LREC.

[14]  Diego Mollá Aliod,et al.  Learning of Graph Rules for Question Answering , 2005, ALTA.

[15]  Diego Mollá Aliod,et al.  AnswerFinder at TREC 2005 , 2005, TREC.

[16]  G Demetriou,et al.  Two applications of information extraction to biological science journal articles: enzyme interactions and protein structures. , 1999, Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing.