An improved key term weightage algorithm for text summarization using local context information and fuzzy graph sentence score

The process of text summarization is to identify the crux of the document. In the proposed work, summarization is done using three different algorithms. They are sentence based key term weightage, the two way local context information scoring (LCIS) and the fuzzy graph sentence scoring (FGSS). They are used to improve the weight of the key terms, identify LCIS and the centroid of the document by calculating FGSS score respectively. This intelligent system assigning sentence based weightage to the key terms is found to be effective. The present method is domain and language independent. It provides good harmonic mean in comparison with the earlier studies and does not require any training or testing.

[1]  M. F. Porter,et al.  An algorithm for suffix stripping , 1997 .

[2]  Naomie Salim,et al.  Fuzzy Genetic Semantic Based Text Summarization , 2011, 2011 IEEE Ninth International Conference on Dependable, Autonomic and Secure Computing.

[3]  Xiaoge Li,et al.  Chinese term extraction from web pages based on expected point-wise mutual information , 2016, 2016 12th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD).

[4]  Rucha S. Dixit,et al.  Improvement of Text Summarization using Fuzzy Logic Based Method , 2012 .

[5]  Regina Barzilay,et al.  Using Lexical Chains for Text Summarization , 1997 .

[6]  Chin-Yew Lin,et al.  ROUGE: A Package for Automatic Evaluation of Summaries , 2004, ACL 2004.

[7]  Elena Lloret,et al.  A Text Summarization Approach under the Influence of Textual Entailment , 2016, NLPCS.

[8]  Sneha S. Desai,et al.  WordNet and Semantic similarity based approach for document clustering , 2016, 2016 International Conference on Computation System and Information Technology for Sustainable Solutions (CSITSS).

[9]  Gajanan K. Birajdar,et al.  Speech/music classification using visual and spectral chromagram features , 2020, J. Ambient Intell. Humaniz. Comput..

[10]  Vijay K. Mago,et al.  Calculating the similarity between words and sentences using a lexical database and corpus statistics , 2018, ArXiv.

[11]  Pooja Arora,et al.  Semantic Searching and Ranking of Documents using Hybrid Learning System and WordNet , 2012 .

[12]  Harish Garg,et al.  Linguistic single-valued neutrosophic prioritized aggregation operators and their applications to multiple-attribute group decision-making , 2018, Journal of Ambient Intelligence and Humanized Computing.

[13]  Tao Yang,et al.  Research and improvement of feature words weight based on TFIDF algorithm , 2016, 2016 IEEE Information Technology, Networking, Electronic and Automation Control Conference.

[14]  Alireza Pakgohar,et al.  A probabilistic relational model for keyword extraction , 2012, 2012 International Conference on Statistics in Science, Business and Engineering (ICSSBE).

[15]  Zhang Zuping,et al.  A Hierarchical Structured Self-Attentive Model for Extractive Document Summarization (HSSAS) , 2018, IEEE Access.

[16]  Mark Last,et al.  Graph-Based Keyword Extraction for Single-Document Summarization , 2008, COLING 2008.

[17]  Ramakrishnan Srikant,et al.  Fast algorithms for mining association rules , 1998, VLDB 1998.

[18]  Dragomir R. Radev,et al.  Centroid-based summarization of multiple documents: sentence extraction, utility-based evaluation, and user studies , 2000, ArXiv.

[19]  W. Bruce Croft,et al.  Document Summarization for Answering Non-Factoid Queries , 2018, IEEE Transactions on Knowledge and Data Engineering.

[20]  Masud Ibn Afjal,et al.  An extractive text summarization technique for Bengali document(s) using K-means clustering algorithm , 2017, 2017 IEEE International Conference on Imaging, Vision & Pattern Recognition (icIVPR).

[21]  Lin Min,et al.  Semi-supervised topic learning and representation method based on association rules and metadata , 2017, 2017 2nd IEEE International Conference on Computational Intelligence and Applications (ICCIA).

[22]  Mohammed Nabeel Iqbal,et al.  Some applications of intuitionistic fuzzy sets using new similarity measure , 2019, Journal of Ambient Intelligence and Humanized Computing.

[23]  Han-Joon Kim,et al.  News Keyword Extraction for Topic Tracking , 2008, 2008 Fourth International Conference on Networked Computing and Advanced Information Management.

[24]  Vladia Pinheiro,et al.  Ranking Keyphrases from Semantic and Syntactic Features of Textual Terms , 2015, 2015 Brazilian Conference on Intelligent Systems (BRACIS).