A Text Mining Approach for Definition Question Answering

This paper describes a method for definition question answering based on the use of surface text patterns. The method is specially suited to answer questions about person’s positions and acronym’s descriptions. It considers two main steps. First, it applies a sequence-mining algorithm to discover a set of definition-related text patterns from the Web. Then, using these patterns, it extracts a collection of concept-description pairs from a target document database, and applies the sequence-mining algorithm to determine the most adequate answer to a given question. Experimental results on the Spanish CLEF 2005 data set indicate that this method can be a practical solution for answering this kind of definition questions, reaching a precision as high as 84%.

[1]  David J. Hand,et al.  Pattern Detection and Discovery , 2002, Pattern Detection and Discovery.

[2]  José Francisco Martínez Trinidad,et al.  A New Algorithm for Fast Discovery of Maximal Sequential Patterns in a Document Collection , 2006, CICLing.

[3]  Tat-Seng Chua,et al.  Unsupervised learning of soft patterns for generating definitions from online news , 2004, WWW '04.

[4]  Martin M. Soubbotin Patterns of Potential Answer Expressions as Clues to the Right Answers , 2001, TREC.

[5]  Eduard H. Hovy,et al.  Offline Strategies for Online Question Answering: Answering Questions Before They Are Asked , 2003, ACL.

[6]  Eduard Hovy,et al.  Towards terascale knowledge acquisition , 2004, COLING 2004.

[7]  Helena Ahonen-Myka Discovery of Frequent Word Sequences in Text , 2002, Pattern Detection and Discovery.

[8]  M. de Rijke,et al.  Overview of the CLEF 2005 Multilingual Question Answering Track , 2005, CLEF.

[9]  Paolo Rosso,et al.  INAOE-UPV Joint Participation in CLEF 2005: Experiments in Monolingual Question Answering , 2005, CLEF.

[10]  Horacio Rodríguez Hontoria,et al.  Los sistemas de búsqueda de respuestas desde una perspectiva actual , 2003 .

[11]  Fredric C. Gey,et al.  Accessing Multilingual Information Repositories, 6th Workshop of the Cross-Language Evalution Forum, CLEF 2005, Vienna, Austria, 21-23 September, 2005, Revised Selected Papers , 2006, CLEF.

[12]  Eduard H. Hovy,et al.  Learning surface text patterns for a Question Answering System , 2002, ACL.