Incorporating prior knowledge into a transductive ranking algorithm for multi-document summarization
暂无分享,去创建一个
This paper presents a transductive approach to learn ranking functions for extractive multi-document summarization. At the first stage, the proposed approach identifies topic themes within a document collection, which help to identify two sets of relevant and irrelevant sentences to a question. It then iteratively trains a ranking function over these two sets of sentences by optimizing a ranking loss and fitting a prior model built on keywords. The output of the function is used to find further relevant and irrelevant sentences. This process is repeated until a desired stopping criterion is met.
[1] Inderjeet Mani,et al. Summarizing Similarities and Differences Among Related Documents , 1997, Information Retrieval.
[2] Robert E. Schapire,et al. Incorporating Prior Knowledge into Boosting , 2002, ICML.
[3] Ari Rappoport,et al. Self-Training for Enhancement and Domain Adaptation of Statistical Parsers Trained on Small Datasets , 2007, ACL.
[4] Nicolas Usunier,et al. A Contextual Query Expansion Approach by Term Clustering for Robust Text Summarization , 2007 .