论文信息 - 3 Comparing Improved Language Models for Sentence Retrieval in Question Answering

3 Comparing Improved Language Models for Sentence Retrieval in Question Answering

A retrieval system is a very important part in a question answ ering framework. It reduces the number of documents to be considered for finding an answer . For further refinement, the documents are split up into smaller chunks to deal with topic variability in larger documents. In our case, we divided the documents into single sentences. Then a language model based approach was used to re-rank the sentence collection. For this purpose, we developed a new language model toolkit. It implements all standard language modeling techniques and is more flexible than o ther tools in terms of backingoff strategies, model combinations and design of the retrie val vocabulary. With the aid of this toolkit we conducted re-ranking experiments with st andard language model based smoothing methods. On top of these algorithms we developed s ome new, improved models including dynamic stop word reduction and stemming. We also experimented with query expansion depending on the type of a query. On a TREC corpus, w e demonstrate that our proposed approaches provide a performance superior to the s tandard methods. In terms of Proceedings of the 17th Meeting of Computational Linguisti cs in the Netherlands Edited by: Peter Dirix, Ineke Schuurman, Vincent Vandeghin ste, and Frank Van Eynde. Copyright c 2007 by the individual authors.

A. Merkel | D. Klakow

[1] Charles L. A. Clarke,et al. Question Answering by Passage Selection (MultiText Experiments for TREC-9) , 2000, TREC.

[2] John D. Lafferty,et al. A study of smoothing methods for language models applied to Ad Hoc information retrieval , 2001, SIGIR '01.

[3] James Allan,et al. UMass at TREC 2002: Cross Language and Novelty Tracks , 2002, TREC.

[4] Donna K. Harman,et al. Overview of the TREC 2002 Novelty Track , 2002, TREC.

[5] W. Bruce Croft,et al. Answer Passage Retrieval for Question Answering , 2003 .

[6] Jimmy J. Lin,et al. Quantitative evaluation of passage retrieval algorithms for question answering , 2003, SIGIR.

[7] Dell Zhang,et al. A Language Modeling Approach to Passage Question Answering , 2003, TREC.

[8] James Allan,et al. Retrieval and novelty detection at the sentence level , 2003, SIGIR.

[9] W. Bruce Croft,et al. Simple Translation Models for Sentence Retrieval in Factoid Question Answering , 2004 .

[10] D. Losada. Language modeling for sentence retrieval : A comparison between Multiple-Bernoulli models and Multinomial models , 2005 .

[11] Andreas Merkel,et al. Dedicated Backing-Off Distributions for Language Model Based Passage , 2006, LWA.

[12] Andreas Merkel,et al. The Alyssa System at TREC 2006: A Statistically-Inspired Question Answering System , 2006, TREC.

[13] Andreas Merkel,et al. Language Model Based Query Classification , 2007, ECIR.