论文信息 - Efficient Voting-Based Extractive Automatic Text Summarization Using Prominent Feature Set

Efficient Voting-Based Extractive Automatic Text Summarization Using Prominent Feature Set

ABSTRACT Automatic text summarization (ATS) is the process of generating a summary by condensing text document by a computer machine. In this paper, we explored voting-based extractive approaches for text summarization. The main issue with most of the feature-based ATS methods is to find optimal feature weights for sentence scoring to optimize the quality of summary. Voting-based methods are sensitive to initial ranking process. We proposed reciprocal ranking-based sentence scoring approach that alleviates the feature weighting and initial ranking problem. The proposed approach uses a specific prominent set of features for initial ranking that further enhance the performance. Experimental results on Document Understating Conference 2002 data-set using ROUGE evaluation matrices shows that our proposed method performs better as compared to other voting-based methods.

Yogesh Kumar Meena | Dinesh Gopalani | D. Gopalani | Y. Meena

[1] Rajesh S. Prasad,et al. Implementation and Evaluation of Evolutionary Connectionist Approaches to Automated Text Summarization , 2010 .

[2] Yogesh Kumar Meena,et al. Analysis of Sentence Scoring Methods for Extractive Automatic Text Summarization , 2014, ICTCS '14.

[3] Edward A. Fox,et al. Combination of Multiple Searches , 1993, TREC.

[4] Breck Baldwin,et al. Dynamic Coreference-Based Summarization , 1998, EMNLP.

[5] Naomie Salim,et al. Voting Models for Summary Extraction from Text Documents , 2014, 2014 International Conference on IT Convergence and Security (ICITCS).

[6] Javed A. Aslam,et al. Models for metasearch , 2001, SIGIR '01.

[7] Antonio Zamora,et al. Automatic Abstracting Research at Chemical Abstracts Service , 1975, J. Chem. Inf. Comput. Sci..

[8] Naomie Salim,et al. Differential evolution cluster-based text summarization methods , 2013, 2013 INTERNATIONAL CONFERENCE ON COMPUTING, ELECTRICAL AND ELECTRONIC ENGINEERING (ICCEEE).

[9] Eduard H. Hovy,et al. Automated Text Summarization and the SUMMARIST System , 1998, TIPSTER.

[10] Lisa F. Rau,et al. Automatic Condensation of Electronic Publications by Sentence Selection , 1995, Inf. Process. Manag..

[11] Ralph Grishman,et al. Summarization System Integrated with Named Entity Tagging and IE pattern Discovery , 2002, LREC.

[12] Xiaoyue Liu,et al. An Extractive Text Summarizer Based on Significant Words , 2009, ICCPOL.

[13] Mark T. Maybury,et al. Advances in Automatic Text Summarization , 1999 .

[14] Chin-Yew Lin,et al. ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[15] Dragomir R. Radev,et al. Experiments in Single and Multi-Document Summarization Using MEAD , 2001 .

[16] George D. C. Cavalcanti,et al. Assessing sentence scoring techniques for extractive text summarization , 2013, Expert Syst. Appl..

[17] James E. Rush,et al. Automatic abstracting and indexing. II. Production of indicative abstracts by application of contextual inference and syntactic coherence criteria , 1971 .

[18] Jun Ma,et al. A Comprehensive Method for Text Summarization Based on Latent Semantic Analysis , 2013, NLPCC.

[19] Craig MacDonald,et al. Voting for candidates: adapting data fusion techniques for an expert search task , 2006, CIKM '06.

[20] Rafael Dueire Lins,et al. A Context Based Text Summarization System , 2014, 2014 11th IAPR International Workshop on Document Analysis Systems.

[21] Fuji Ren,et al. GA, MR, FFNN, PNN and GMM based models for automatic text summarization , 2009, Comput. Speech Lang..

[22] Vasudeva Varma,et al. Sentence Extraction Based Single Document Summarization , .

[23] Naomie Salim,et al. Text summarization features selection method using pseudo Genetic-based model , 2012, 2012 International Conference on Information Retrieval & Knowledge Management.

[24] Gerard Salton,et al. Automatic Text Structuring and Summarization , 1997, Inf. Process. Manag..

[25] Edward A. Fox,et al. Research Contributions , 2014 .

[26] A. Kogilavani,et al. CLUSTERING AND FEATURE SPECIFIC SENTENCE EXTRACTION BASED SUMMARIZATION OF MULTIPLE DOCUMENTS , 2010 .

[27] Masaki Murata,et al. Sentence Extraction System Assembling Multiple Evidence , 2001, NTCIR.

[28] Craig MacDonald,et al. Voting techniques for expert search , 2008, Knowledge and Information Systems.

[29] Rada Mihalcea,et al. TextRank: Bringing Order into Text , 2004, EMNLP.

[30] Kenneth Ward Church,et al. Inverse Document Frequency (IDF): A Measure of Deviations from Poisson , 1995, VLC@ACL.

[31] H. P. Edmundson,et al. New Methods in Automatic Extracting , 1969, JACM.

[32] Elizabeth León Guzman,et al. Extractive single-document summarization based on genetic operators and guided local search , 2014, Expert Syst. Appl..

[33] Eduard H. Hovy,et al. Identifying Topics by Position , 1997, ANLP.

[34] Karen Spärck Jones. Automatic summarising: factors and directions , 1998, ArXiv.

[35] Tatsunori Mori,et al. Information Gain Ratio as Term Weight: The case of Summarization of IR Results , 2002, COLING.

[36] Phyllis B. Baxendale,et al. Machine-Made Index for Technical Literature - An Experiment , 1958, IBM J. Res. Dev..

[37] James P. Callan,et al. Combining document representations for known-item search , 2003, SIGIR.

[38] Naomie Salim,et al. Multi document summarization based on cross-document relation using voting technique , 2013, 2013 INTERNATIONAL CONFERENCE ON COMPUTING, ELECTRICAL AND ELECTRONIC ENGINEERING (ICCEEE).

[39] Hans Peter Luhn,et al. The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[40] Martin F. Porter,et al. An algorithm for suffix stripping , 1997, Program.

[41] Wenjie Li,et al. Simultaneous Ranking and Clustering of Sentences: A Reinforcement Approach to Multi-Document Summarization , 2010, COLING.