Multi-Document and Multi-Lingual Summarization using Neural Networks

This system proposes Multi-lingual (Tamil and English) Multi-document summarization by neural networks. The system involves three steps. In first step, the sentences of the documents are converted into vector form. In the second step weight values are assigned to vector form based on sentence features. Depend on sentence weight value, single document summarization is done. The output of single document summarization is used as an input for multi-document Summarization. Final step is a sentence selection, in which output summary is selected based on the similarity and dissimilarity measures. Sentence similarity and dissimilarity measures are used to compare the sentences. From that, resultant summary is produced. The proposed system can be able to summarize both Tamil and English online news papers.

[1]  Marti A. Hearst,et al.  A Critique and Improvement of an Evaluation Metric for Text Segmentation , 2002, CL.

[2]  A. Govardhan,et al.  Query-Based Summarizer Based on Similarity of Sentences and Word Frequency , 2011 .

[3]  Wei Peng,et al.  An Integrated Data-Driven Framework for Computing System Management , 2010, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[4]  Andrew V. Goldberg,et al.  An efficient cost scaling algorithm for the assignment problem , 1995, Math. Program..

[5]  Xiaojun Wan,et al.  Multi-document Summarization Using Minimum Distortion , 2010, 2010 IEEE International Conference on Data Mining.

[6]  K. Kaikhah Automatic text summarization with neural networks , 2004, 2004 2nd International IEEE Conference on 'Intelligent Systems'. Proceedings (IEEE Cat. No.04EX791).

[7]  Tao Jiang,et al.  Learning Image-Text Associations , 2009, IEEE Transactions on Knowledge and Data Engineering.

[8]  Inderjeet Mani,et al.  The Challenges of Automatic Summarization , 2000, Computer.

[9]  Gurpreet Singh Lehal,et al.  A Survey of Text Summarization Extractive Techniques , 2010 .

[10]  Dong-Hong Ji,et al.  MSBGA: A Multi-Document Summarization System Based on Genetic Algorithm , 2006, 2006 International Conference on Machine Learning and Cybernetics.

[11]  O. Sornil,et al.  An Automatic Text Summarization Approach using Content-Based and Graph-Based Characteristics , 2006, 2006 IEEE Conference on Cybernetics and Intelligent Systems.

[12]  Feifan Liu,et al.  Exploring Correlation Between ROUGE and Human Evaluation on Meeting Summaries , 2010, IEEE Transactions on Audio, Speech, and Language Processing.

[13]  Hongyan Liu,et al.  Multi-Document summarization based on improved features and clustering , 2010, Proceedings of the 6th International Conference on Natural Language Processing and Knowledge Engineering(NLPKE-2010).

[14]  Fuji Ren,et al.  Probabilistic neural network based text summarization , 2008, 2008 International Conference on Natural Language Processing and Knowledge Engineering.

[15]  Pankaj Gupta,et al.  Summarizing text by ranking text units according to shallow linguistic features , 2011, 13th International Conference on Advanced Communication Technology (ICACT2011).