Question Answering for Spanish Based on Lexical and Context Annotation

Question Answering has become a promising research field whose aim is to provide more natural access to the information than traditional document retrieval techniques. In this work, an approach centered in the use of context at a lexical level has been followed in order to identify possible answers to short factoid questions stated by the user in natural language. The methods applied at different stages of the system as well as an architecture for question answering are described. The evaluation of this approach was made following QA@CLEF03 criteria on a corpus of over 200,000 news in Spanish. The paper shows and discusses the results achieved by the system.

[1]  Dragomir R. Radev,et al.  The Use of Predictive Annotation for Question Answering in TREC8 , 1999, TREC.

[2]  Lynette Hirschman,et al.  Natural language question answering: the view from here , 2001, Natural Language Engineering.

[3]  Fernando Llopis,et al.  Question Answering in Spanish , 2003, CLEF.

[4]  Maarten de Rijke,et al.  The Multiple Language Question Answering Track at CLEF 2003 , 2003, CLEF.

[5]  Eduard H. Hovy,et al.  Learning surface text patterns for a Question Answering System , 2002, ACL.

[6]  Horacio Rodríguez Hontoria,et al.  Los sistemas de búsqueda de respuestas desde una perspectiva actual , 2003 .

[7]  Manuel Montes-y-Gómez,et al.  A Language Independent Method for Question Classification , 2004, COLING.

[8]  Carol Peters,et al.  Comparative Evaluation of Multilingual Information Access Systems , 2003, Lecture Notes in Computer Science.

[9]  Ian H. Witten,et al.  Data mining - practical machine learning tools and techniques, Second Edition , 2005, The Morgan Kaufmann series in data management systems.

[10]  Michael Stonebraker,et al.  The Morgan Kaufmann Series in Data Management Systems , 1999 .

[11]  Sergei Nirenburg,et al.  Automatic Question Answering , 2000, RIAO.

[12]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[13]  Manuel Montes-y-Gómez,et al.  Toward a Document Model for Question Answering Systems , 2004, AWIC.

[14]  Alberto Maria Segre,et al.  Programs for Machine Learning , 1994 .

[15]  Vladimir Vapnik,et al.  The Nature of Statistical Learning , 1995 .

[16]  Janusz Kacprzyk,et al.  Advances in Web Intelligence , 2003, Lecture Notes in Computer Science.

[17]  Leo Breiman,et al.  Bagging Predictors , 1996, Machine Learning.

[18]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[19]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques with Java implementations , 2002, SGMD.

[20]  Xavier Carreras,et al.  A Flexible Distributed Architecture for Natural Language Analyzers , 2002, LREC.