Effectively Leveraging Entropy and Relevance for Summarization

Document summarization has attracted a lot of research interest since the 1960s. However, it still remains a challenging task on how to extract effective feature for automatic summarization. In this paper, we extract two features called entropy and relevance to leverage information from different perspectives for summarization. Experiments on unsupervised and supervised methods testify the effectiveness of leveraging the two features.

[1]  Qiang Yang,et al.  Noise reduction through summarization for Web-page classification , 2007, Inf. Process. Manag..

[2]  Karen Spärck Jones Automatic summarising: The state of the art , 2007, Inf. Process. Manag..

[3]  Jammalamadaka Introduction to Linear Regression Analysis (3rd ed.) , 2003 .

[4]  Guang-Bin Huang,et al.  Extreme learning machine: a new learning scheme of feedforward neural networks , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[5]  Yong Yu,et al.  Enhancing diversity, coverage and balance for summarization through structure learning , 2009, WWW '09.

[6]  Hua Li,et al.  Document Summarization Using Conditional Random Fields , 2007, IJCAI.

[7]  Eduard H. Hovy,et al.  Automatic Evaluation of Summaries Using N-gram Co-occurrence Statistics , 2003, NAACL.

[8]  Sun Park,et al.  Automatic generic document summarization based on non-negative matrix factorization , 2009, Inf. Process. Manag..

[9]  Rada Mihalcea,et al.  Language Independent Extractive Summarization , 2005, ACL.

[10]  Daniel Marcu,et al.  From discourse structures to text summaries , 1997 .

[11]  Elizabeth A. Peck,et al.  Introduction to Linear Regression Analysis , 2001 .

[12]  Alex Alves Freitas,et al.  Automatic Text Summarization Using a Machine Learning Approach , 2002, SBIA.

[13]  Xin Liu,et al.  Generic text summarization using relevance measure and latent semantic analysis , 2001, SIGIR '01.

[14]  Dianne P. O'Leary,et al.  Text summarization via hidden Markov models , 2001, SIGIR '01.

[15]  Hans Peter Luhn,et al.  The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[16]  Ramiz M. Aliguliyev,et al.  A new sentence similarity measure and sentence based extractive technique for automatic text summarization , 2009, Expert Syst. Appl..