Text summarization is an emerging technique for finding out the summary of the text document. Text summarization is nothing but summarizing the content of given text document. Text summarization has got so uses such as Due to the massive amount of information getting increased on internet; it is difficult for the user to go through all the information available on web. Summarization techniques need to be used to reduce the users time in reading the whole information available on web. In this paper propose a Malayalam text summarization system which is based on MMR technique with successive threshold. Here the sentences are selected based on the concept of maximal marginal relevance. The key idea is to use a unit step function at each step to decide the maximum marginal relevance and the number of sentences present in the summary would be equal to the number of paragraphs or the average number of sentences present in the text document, which can be achieved by using successive threshold approach. We apply MMR approach on Malayalam text summarization task and achieve comparable results to the state of the art. KeywordsMaximum Marginal Relevance, Successive Threshold, Unit step function.
[1]
Marko Grobelnik,et al.
Extracting Summary Sentences Based on the Document Semantic Graph
,
2005
.
[2]
Jayashree,et al.
KEYWORD EXTRACTION BASED SUMMARIZATION OF CATEGORIZED KANNADA TEXT DOCUMENTS
,
2011
.
[3]
Dipanjan Das Andr,et al.
A Survey on Automatic Text Summarization
,
2007
.
[4]
Gurpreet Singh Lehal,et al.
A Survey of Text Summarization Extractive Techniques
,
2010
.
[5]
Elena Lloret.
Text Summarization: An Overview
,
2021,
Special Issue III.
[6]
Kamal Sarkar,et al.
Bengali text summarization by sentence extraction
,
2012,
ArXiv.
[7]
Sumam Mary Idicula,et al.
Text summarization for Malayalam documents — An experience
,
2014,
2014 International Conference on Data Science & Engineering (ICDSE).
[8]
Aysun Güran,et al.
A Comparison of Feature and Semantic-Based Summarization Algorithms for Turkish
,
2010
.