论文信息 - National University of Singapore at the TREC 13 Question Answering Main Task

National University of Singapore at the TREC 13 Question Answering Main Task

In the past two years in our participation in TREC, our efforts (Yang et al., 2002, 2003) have been focused on incorporating external knowledge for boosting document and passage retrieval performance in event-based open domain question answering (QA). Despite our previous successes, we have identified three weaknesses of our system with respect to this year’s task guidelines. First, our system works at the surface level to extract answers, by picking the first occurrence of a string that matches the question target type from the highest-ranked passage. As such, our answer extraction relies heavily on the results of passage retrieval and named entity tagging. However, a passage that contains the correct answer may contain other strings of the same target type (Light et al., 2001) which can lead to an incorrect string being extracted. A technique to select the answer string that has the correct relationships with respect to the other words in the question is needed. Second, our definitional QA system utilizes manually-constructed definition patterns. While these patterns are precise in selecting definition sentences, they are strict in matching (slot-by-slot matching using regular expressions), failing to match correct sentences with minor variations. Third, this year’s guidelines state that factoid and list questions are not independent; instead, they are all related to given topics. Under such contextual QA scenario, we need to revise our framework to exploit existing topic-relevant knowledge in answering such questions. Accordingly, we focus on the following three features in this year’s TREC: (1) To give appropriate evidence to answer extraction, we use grammatical dependency relations among question terms to reinforce answer selection. In contrast to previous work in matching dependency relations, we propose to measure the similarity between relations to rank answer strings.

Tat-Seng Chua | Hang Cui | Min-Yen Kan | Renxu Sun | Keya Li

[1] Gideon S. Mann,et al. Analyses for elucidating current question answering technology , 2001, Natural Language Engineering.

[2] Antonio Cisternino,et al. PiQASso: Pisa Question Answering System , 2001, TREC.

[3] Robert L. Mercer,et al. The Mathematics of Statistical Machine Translation: Parameter Estimation , 1993, CL.

[4] Dekang Lin,et al. Dependency-Based Evaluation of Minipar , 2003 .

[5] Grace Hui Yang,et al. The Integration of Lexical Knowledge and External Resources for Question Answering , 2002, TREC.

[6] Jimmy J. Lin,et al. Selectively Using Relations to Improve Precision in Question Answering , 2003 .

[7] Tat-Seng Chua,et al. A Comparative Study on Sentence Retrieval for Definitional Question Answering , 2004 .

[8] Mitchell P. Marcus,et al. Adding Semantic Annotation to the Penn TreeBank , 1998 .

[9] W. Bruce Croft,et al. Query expansion using local and global document analysis , 1996, SIGIR '96.

[10] Jungyun Seo,et al. SiteQ: Engineering High Performance QA System Using Lexico-Semantic Pattern Matching and Shallow NLP , 2001, TREC.

[11] Daniel Jurafsky,et al. Shallow Semantic Parsing using Support Vector Machines , 2004, NAACL.

[12] Tat-Seng Chua,et al. Generic soft pattern models for definitional question answering , 2005, SIGIR '05.

[13] Jinxi Xu,et al. TREC 2003 QA at BBN: Answering Definitional Questions , 2003, TREC.