A light-weight summarizer based on language model with relative entropy
暂无分享,去创建一个
A new method for sentence extraction on the basis of language model with relative entropy is presented in this paper. The proposed technique first builds a sentence language model and document cluster language model respectively for the sentence and the documents. The sentences are then ranked according to the relative entropies of the estimated document language model with respect to the estimated sentence language model. The overall results on DUC and MSE corpus demonstrate that the proposed approach outperforms some of the best reported results for generic multi-document summarization.
[1] Elizabeth D. Liddy,et al. Advances in Automatic Text Summarization , 2001, Information Retrieval.
[2] Eduard H. Hovy,et al. Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics , 2003, NAACL.
[3] Thomas M. Cover,et al. Elements of Information Theory , 2005 .