A Hybrid Method for Query based Automatic Summarization System

Automatic text summarization is one of the research goals of Natural Language Processing which relieves humans from studying each and every line in a text document to understand the underlying concepts in it. Automatic text summarization is aimed to create a brief outline of a given text covering the important points in the text. Automatic text summarization can be generic or query specific. This paper is focused on Query specific text summarization where a summary of the given text is constructed based on the given query. Query specific text summarization is based on the calculation of the relationship between sentences in the text document and the query given. Several statistical techniques and linguistic techniques have been developed to find the relationship between the given query and the sentences in the document. These methods when used alone could not give desired accuracy in the results. In this paper a sentence scoring method is defined based on existing sentence scoring methods. It attempts to combine the individual results of these methods to give a better assessment of the relationship between the sentences.

[1]  Mohsen Amini Salehi,et al.  A Comprehensive Survey on Text Summarization Systems , 2009, 2009 2nd International Conference on Computer Science and its Applications.

[2]  Pei-ying Zhang,et al.  Automatic text summarization based on sentences clustering and extraction , 2009, 2009 2nd IEEE International Conference on Computer Science and Information Technology.

[3]  Gerard Salton,et al.  Automatic Text Processing: The Transformation, Analysis, and Retrieval of Information by Computer , 1989 .

[4]  Paul M. B. Vitányi,et al.  The Google Similarity Distance , 2004, IEEE Transactions on Knowledge and Data Engineering.

[5]  J.R. Bellegarda,et al.  Exploiting latent semantic information in statistical language modeling , 2000, Proceedings of the IEEE.

[6]  George W. Adamson,et al.  The use of an association measure based on character structure to identify semantically related pairs of words and document titles , 1974, Inf. Storage Retr..

[7]  A. Ibrahim,et al.  Arabic text summarization using Rhetorical Structure Theory , 2012, 2012 8th International Conference on Informatics and Systems (INFOS).

[8]  Mark Wasson,et al.  Using Leading Text for News Summaries: Evaluation Results and Implications for Commercial Summarization Applications , 1998, ACL.

[9]  Youngjoong Ko,et al.  An effective sentence-extraction technique using contextual information and statistical approaches for text summarization , 2008, Pattern Recognition Letters.

[10]  Zuhair Bandar,et al.  A Method for Measuring Sentence Similarity and iIts Application to Conversational Agents , 2004, FLAIRS.