A light-weight summarizer based on language model with relative entropy

A new method for sentence extraction on the basis of language model with relative entropy is presented in this paper. The proposed technique first builds a sentence language model and document cluster language model respectively for the sentence and the documents. The sentences are then ranked according to the relative entropies of the estimated document language model with respect to the estimated sentence language model. The overall results on DUC and MSE corpus demonstrate that the proposed approach outperforms some of the best reported results for generic multi-document summarization.