Use of fuzzy logic and wordnet for improving performance of extractive automatic text summarization

Text Summarization produces a shorter version of large text documents by selecting most relevant information. Text summarization systems are of two types: extractive and abstractive. This paper focuses on extractive text summarization. In extractive text summarization, important sentences are selected based on certain important features. The importance of some extractive features is more than the some other features, so they should have the balance weight in computations. The purpose of this paper is to use fuzzy logic and wordnet synonyms to handle the issue of ambiguity and imprecise values with the traditional two value or multi-value logic and to consider the semantics of the text. Three different methods: fuzzy logic based method, bushy path method, and wordnet synonyms method are used to generate 3 summaries. Final summary is generated by selecting common sentences from all the 3 summaries and from rest of the sentences in union of all summaries, selection is done based on sentence location. The proposed methodology is compared with three individual methods i.e. fuzzy logic based summarizer, bushy path summarizer, and wordnet synonyms summarizer by evaluating the performance of each on 95 documents from standard DUC 2002 dataset using ROUGE evaluation metrics. The analysis shows that the proposed method gives better average precision, recall, and f-measure.

[1]  Phyllis B. Baxendale,et al.  Machine-Made Index for Technical Literature - An Experiment , 1958, IBM J. Res. Dev..

[2]  Francine Chen,et al.  A trainable document summarizer , 1995, SIGIR '95.

[3]  Rada Mihalcea,et al.  TextRank: Bringing Order into Text , 2004, EMNLP.

[4]  Esfandiar Eslami,et al.  Optimizing Machine Learning Approach Based on Fuzzy Logic in Text Summarization , 2009 .

[5]  A. R. Kulkarni,et al.  A DOMAIN-SPECIFIC AUTOMATIC TEXT SUMMARIZATION USING FUZZY LOGIC , 2013 .

[6]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[7]  Mary Ellen Okurowski,et al.  Trainable, Scalable Summarization Using Robust NLP and Machine Learning , 1998, ACL.

[8]  Rucha S. Dixit,et al.  Improvement of Text Summarization using Fuzzy Logic Based Method , 2012 .

[9]  Naomie Salim,et al.  Feature-Based Sentence Extraction Using Fuzzy Inference Rules , 2009, 2009 International Conference on Signal Processing Systems.

[10]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[11]  Dragomir R. Radev,et al.  LexRank: Graph-based Lexical Centrality as Salience in Text Summarization , 2004, J. Artif. Intell. Res..

[12]  Naomie Salim,et al.  Sentence Features Fusion for Text Summarization Using Fuzzy Logic , 2009, 2009 Ninth International Conference on Hybrid Intelligent Systems.

[13]  H. P. Edmundson,et al.  New Methods in Automatic Extracting , 1969, JACM.

[14]  S. Santhana Megala,et al.  Enriching Text Summarization using Fuzzy Logic , 2014 .

[15]  Naomie Salim,et al.  Fuzzy Logic Based Method for Improving Text Summarization , 2009, ArXiv.

[16]  Gerard Salton,et al.  Automatic Text Structuring and Summarization , 1997, Inf. Process. Manag..

[17]  S. A. Babar,et al.  Improving Performance of Text Summarization , 2015 .

[18]  Naomie Salim,et al.  Fuzzy Swarm Based Text Summarization , 2009 .

[19]  Savita S. Jadhav,et al.  Enforcing Text Summarization using Fuzzy Logic , 2014 .

[20]  Christian Callegari,et al.  Advances in Computing, Communications and Informatics (ICACCI) , 2015 .

[21]  Hans Peter Luhn,et al.  The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[22]  Xiaoyue Liu,et al.  An Extractive Text Summarizer Based on Significant Words , 2009, ICCPOL.

[23]  Lisa F. Rau,et al.  Automatic Condensation of Electronic Publications by Sentence Selection , 1995, Inf. Process. Manag..