Single Document Summarization Using Natural Language Processing

The need for text summarization is crucial as we enter the era of in­ formation overload. However, the current implementations are specific to a domain or a genre of the source document. In this paper, we discuss an algo­ rithm for text summarization which is independent of domain and document source. This algorithm creates text summaries by analyzing the logical struc­ ture of the sentences. Sentences are parsed and important relationships are identified, stored in the form of a graph, thus graph corresponding to each sen­ tence in the document is generated and merged to form graph of the document, now this graph is clustered into sub-graphs which represent the different topics in the document. Then a graph scoring algorithm scores the graph, and helps in extracting the important sentences towards summary. To increase the coher­ ence of the summary, the pool of extracted sentences undergoes some transfor­ mation in a specified order, resulting in final sentences that form the summary of the document.

[1]  Yllias Chali,et al.  Text Summarization Using Lexical Chains , 2001 .

[2]  Marti A. Hearst Text Tiling: Segmenting Text into Multi-paragraph Subtopic Passages , 1997, CL.

[3]  B. Magnini,et al.  Keyphrase Extraction for Summarization Purposes : The LAKE System at DUC-2004 , 2004 .

[4]  Shalom Lappin,et al.  An Algorithm for Pronominal Anaphora Resolution , 1994, CL.

[5]  Phyllis B. Baxendale,et al.  Machine-Made Index for Technical Literature - An Experiment , 1958, IBM J. Res. Dev..

[6]  Francine Chen,et al.  A trainable document summarizer , 1995, SIGIR '95.

[7]  Michael Collins,et al.  A New Statistical Parser Based on Bigram Lexical Dependencies , 1996, ACL.

[8]  Sergey Brin,et al.  The Anatomy of a Large-Scale Hypertextual Web Search Engine , 1998, Comput. Networks.

[9]  Dragomir R. Radev,et al.  The University of Michigan at DUC 2004 , 2004 .

[10]  Gordon W. Paynter,et al.  Interactive document summarisation using automatically extracted keyphrases , 2002, Proceedings of the 35th Annual Hawaii International Conference on System Sciences.

[11]  H. P. Edmundson,et al.  New Methods in Automatic Extracting , 1969, JACM.

[12]  Michael Collins,et al.  Three Generative, Lexicalised Models for Statistical Parsing , 1997, ACL.

[13]  Karen Spärck Jones Automatic summarising: factors and directions , 1998, ArXiv.

[14]  Michael Collins,et al.  Three Generative, Lexicalised Models for Statistical Parsing , 1997, ACL.

[15]  Hans Peter Luhn,et al.  The Automatic Creation of Literature Abstracts , 1958, IBM J. Res. Dev..

[16]  M. Sanderson Book Reviews: Advances in Automatic Text Summarization , 2000, Computational Linguistics.

[17]  Vincent Kanade,et al.  Clustering Algorithms , 2021, Wireless RF Energy Transfer in the Massive IoT Era.

[18]  Mark T. Maybury,et al.  Advances in Automatic Text Summarization , 1999 .