Kannada text summarization using Latent Semantic Analysis

Text Summarization is a method of reducing the original text document into a short description. This short version retains the meaning and information content of the original text document. It is a difficult task for human beings to generate the summary for very large documents manually. The linguistic and statistical features of sentence can be used to find the importance of sentences. The Latent Semantic Analysis (LSA) captures automatically the semantic relationships between the sentences as a human being thinks. In this paper Singular Value Decomposition (SVD) is used to generate the summary. SVD finds the dimensions of the sentence vectors which are principal and mutually orthogonal. These properties guaranty the relevance to original text document and non-redundancy respectively in machine generated summary.

[1]  R. C. Balabantaray,et al.  Text Summarization using Term Weights , 2012 .

[2]  J. Steinberger,et al.  Using Latent Semantic Analysis in Text Summarization and Summary Evaluation , 2004 .

[3]  Hakan Ceylan,et al.  Investigating the extractive summarization of literary novels , 2011 .

[4]  Shubhangi C. Tirpude An Approach to Single Documnent Text Summarization & Simplification , 2014 .

[5]  Martin Hassel,et al.  Evaluation of Automatic Text Summarization , 2004 .

[6]  Sean Ekins,et al.  A Predictive Ligand-Based Bayesian Model for Human Drug-Induced Liver Injury , 2010, Drug Metabolism and Disposition.

[7]  Mikhail Shashkov,et al.  Evaluation of Automatic Text Summarization , 2004 .

[8]  Gurpreet Singh Lehal,et al.  Features Selection and Weight learning for Punjabi Text Summarization , 2011 .

[9]  Jayashree,et al.  KEYWORD EXTRACTION BASED SUMMARIZATION OF CATEGORIZED KANNADA TEXT DOCUMENTS , 2011 .

[10]  P RamakanthKumar.,et al.  Sentence Boundary Detection in Kannada Language , 2012 .

[11]  P. Ramakanth Kumar,et al.  Text Classification of Kannada Webpages Using Various Pre-processing Agents , 2013, ISI.

[12]  Ferda Nur Alpaslan,et al.  Text Summarization of Turkish Texts using Latent Semantic Analysis , 2010, COLING.

[13]  R Jayashree,et al.  Document Summarization in Kannada Using Keyword Extraction , 2011 .

[14]  Ibrahim Imam,et al.  Query Based Arabic Text Summarization , 2013 .

[15]  Rasha Mohammed Badry,et al.  Text Summarization within the Latent Semantic Analysis Framework: Comparative Study , 2013 .

[16]  Gurpreet Singh Lehal,et al.  Automatic Text Summarization System for Punjabi Language , 2013 .

[17]  Gurpreet Singh Lehal,et al.  A Survey of Text Summarization Extractive Techniques , 2010 .